There is a newer version of this record available.

Dataset Open Access

Complete Rxivist dataset of scraped bioRxiv data

Abdill, Richard J.; Blekhman, Ran allows readers to sort and filter the tens of thousands of preprints posted to bioRxiv. Rxivist uses a custom web crawler to index all papers on; this is a snapshot of Rxivist the production database. The version number indicates the date on which the snapshot was taken. See the included "" file (or on GitHub) for instructions on how to use the "rxivist.backup" file to import data into a PostgreSQL database server.

Please note this is a different repository than the one used for the Rxivist manuscript—that is in a separate Zenodo repository. You're welcome (and encouraged!) to use this data in your research, but please cite our paper, available on bioRxiv.

Files (151.6 MB)
Name Size
63.3 kB Download
151.5 MB Download
All versions This version
Views 2,106113
Downloads 74022
Data volume 41.8 GB1.7 GB
Unique views 1,615104
Unique downloads 42217


Cite as