Dataset Open Access

Biomedical preprints per month, by source and as a fraction of total literature

Polka, Jessica K.; Penfold, Naomi C.

This is a snapshot of a Google sheet containing counts of biomedical preprints by source and as a fraction of the total biomedical literature through 2020-06.

Note: this does not yet include >6,000 preprints on relevant OSF platforms.

Data sources

arXiv q-bio, PeerJ Preprints, and bioRxiv counts through 2018-12 were sourced from Jordan Anaya's PrePubMed. Following that, arXiv q-bio counts were sourced from arXiv statistics, and PeerJ Preprints and bioRxiv counts were taken from searches of EuropePMC.

All counts from F1000 & Open Research platforms,, ChemRxiv, and medRxiv were taken from searches of EuropePMC.

Counts for Research Square are derived from Crossref through 2020-03, thereafter from EuropePMC. Counts for the Lancet and Sneak Peek were taken from web searches.

Total biomedical literature is from PubMed.

Files (946.8 kB)
Name Size
Preprints per month through 2020-06.PNG
123.2 kB Download
Preprints per month through 2020-06.xlsx
717.7 kB Download
Preprints vs PubMed 2020-06.PNG
105.9 kB Download
  • Anaya (2018): PrePubMed.

  • Penfold and Polka (2019): Preprints in biology as a fraction of the biomedical literature. 10.5281/zenodo.3256298

All versions This version
Views 2,675597
Downloads 422122
Data volume 142.3 MB31.9 MB
Unique views 2,032491
Unique downloads 28389


Cite as