Published September 7, 2022 | Version v1
Conference paper Open

The State of Unpaywall: Analyzing the Consistency of Open Access Data

  • 1. German Centre for Higher Education Research and Science Studies

Description

These result highlight the difficulties of identifying the open access status of a publication. Especially for less rigid OA subgroups like Hybrid and Bronze OA the classification task is a process of iterating over improved algorithms. Generally, it can be assumed that these iterations lead towards a more accurate reflection of the true OA status. This process, however, has implications for the academic users of Unpaywall data. Studies that use these data to analyze OA status and especially OA subgroups should be aware that the reliability of the data and reproducibility of the results are dependent on time and infrastructural design choice. This observation poses essential background information for OA studies that rely on Unpaywall data at a single point in time.

For the OA transformation, the results also highlight the importance of author-choice based contributions. Publisher-choice based contributions appear to be harder to identify but also volatile in their status over time. For Open Access studies, these findings provide empirical reasons for caution when including data on Bronze OA into their analysis. For the OA transformation in general, the findings highlight authors as the key contributors to a successful transformation.

Files

127.pdf

Files (548.0 kB)

Name Size Download all
md5:a6daac4abf5a75ecf4002f6653586fe2
548.0 kB Preview Download

Additional details

Related works

Is described by
Presentation: 10.5281/zenodo.7142361 (DOI)