Dataset Open Access

Data From: The Future of OA: A large-scale analysis projecting Open Access publication and readership

Piwowar, Heather; Priem, Jason; Orr, Richard

This is the raw data behind the publication on bioRxiv at https://doi.org/10.1101/795310: 

Piwowar, Priem, Orr (2019) The Future of OA: A large-scale analysis projecting Open Access publication and readership. bioRxiv: https://doi.org/10.1101/795310

The jupyter notebook that produces the manuscript using the data here is available at: https://github.com/Impactstory/future-oa

 

Summary:

Understanding the growth of open access (OA) is important for deciding funder policy, subscription allocation, and infrastructure planning.

This study analyses the number of papers available as OA over time. The models includes both OA embargo data and the relative growth rates of different OA types over time, based on the OA status of 70 million journal articles published between 1950 and 2019.

The study also looks at article usage data, analyzing the proportion of views to OA articles vs views to articles which are closed access. Signal processing techniques are used to model how these viewership patterns change over time. Viewership data is based on 2.8 million uses of the Unpaywall browser extension in July 2019.

We found that Green, Gold, and Hybrid papers receive more views than their Closed or Bronze counterparts, particularly Green papers made available within a year of publication. We also found that the proportion of Green, Gold, and Hybrid articles is growing most quickly.

In 2019:

  • 31% of all journal articles are available as OA

  • 52% of article views are to OA articles

Given existing trends, we estimate that by 2025:

  • 44% of all journal articles will be available as OA

  • 70% of article views will be to OA articles

The declining relevance of closed access articles is likely to change the landscape of scholarly communication in the years to come.

The jupyter notebook that produces the manuscript using the data here is available at: https://github.com/Impactstory/future-oa.
Files (2.8 MB)
Name Size
articles_by_color_by_year_with_embargos.csv
md5:60bfaa2c68059e1cbcd1df4a087daa60
40.4 kB Download
articles_by_graph_type_by_year.csv
md5:7082c573b299138fff8e541ae3627082
12.0 kB Download
biorxiv_growth_otherwise_closed.csv
md5:f860130b1dcda01c2452fcccb6d5671e
82 Bytes Download
delayed_bronze_after_embargos_age_months.csv
md5:97b8b90c9021d09b46ef4ba2c64190f1
480.8 kB Download
delayed_bronze_after_embargos_age_years.csv
md5:3c795e446a0875569ab8071cce8dc0d5
39.1 kB Download
delayed_bronze_empirical_list.csv
md5:b3466c07d050760ca5c9e6f2a619fcd3
7.6 kB Download
delayed_bronze_extracted_policies.csv
md5:8f26cd562e5a4fca9d4bab7088dcb77f
13.5 kB Download
delayed_bronze_sql_parts.zip
md5:fbf339a7ba26c49a160cdba07ad711cb
168.8 kB Download
gold_oa_empirical_list.csv
md5:9453d4bf4e39e9329833d35f13bee778
1.0 MB Download
green_oa_with_dates_by_availability.csv
md5:14863473eb7dd6f8c1b9c3e264d89bd9
951.0 kB Download
views_by_age_months.csv
md5:254d9014441b9a7ea97dc4e0bfcfd42c
45.8 kB Download
views_by_age_months_no_color_full_year.csv
md5:ba0c4a4488c46875c383c166df59c5cd
8.7 kB Download
views_by_age_years.csv
md5:48a22c941e051d77f09ba648519e6b5b
7.1 kB Download
491
2,832
views
downloads
All versions This version
Views 491491
Downloads 2,8322,832
Data volume 2.5 GB2.5 GB
Unique views 425425
Unique downloads 1,5561,556

Share

Cite as