There is a newer version of this record available.

Dataset Open Access

Biomedical preprints per month, by source and as a fraction of total literature

Polka, Jessica K.; Penfold, Naomi C.

This is a snapshot of a Google sheet containing counts of biomedical preprints by source and as a fraction of the total biomedical literature through 2020-04.

Note: this does not yet include >6,000 preprints on relevant OSF platforms.

Data sources

arXiv q-bio, PeerJ Preprints, and bioRxiv counts through 2018-12 were sourced from Jordan Anaya's PrePubMed. Following that, arXiv q-bio counts were sourced from arXiv statistics, and PeerJ Preprints and bioRxiv counts were taken from searches of EuropePMC.

All counts from F1000 & Open Research platforms,, ChemRxiv, and medRxiv were taken from searches of EuropePMC.

Counts for Research Square are derived from Crossref through 2020-03, thereafter from EuropePMC. Counts for the Lancet and Sneak Peek were taken from web searches.

Total biomedical literature is from PubMed.

Files (1.3 MB)
Name Size
Biomedical preprints per month through 2020-04.PNG
274.9 kB Download
Papers per month through 2020-04.PNG
291.8 kB Download
Preprints per month through 2020-04.xlsx
724.5 kB Download
  • Anaya (2018): PrePubMed.

  • Penfold and Polka (2019): Preprints in biology as a fraction of the biomedical literature. 10.5281/zenodo.3256298

All versions This version
Views 3,0881,454
Downloads 462151
Data volume 155.6 MB64.5 MB
Unique views 2,4011,256
Unique downloads 316111


Cite as