There is a newer version of this record available.

Dataset Open Access

Complete Rxivist dataset of scraped bioRxiv data

Abdill, Richard J.; Blekhman, Ran allows readers to sort and filter the tens of thousands of preprints posted to bioRxiv. Rxivist uses a custom web crawler to index all papers on; this is a snapshot of Rxivist the production database. The version number indicates the date on which the snapshot was taken. See the included "" file (or on GitHub) for instructions on how to use the "rxivist.backup" file to import data into a PostgreSQL database server.

Please note this is a different repository than the one used for the Rxivist manuscript—that is in a separate Zenodo repository. You're welcome (and encouraged!) to use this data in your research, but please cite our paper, available on bioRxiv.

Files (151.6 MB)
Name Size
63.3 kB Download
151.5 MB Download
All versions This version
Views 3,376128
Downloads 98923
Data volume 65.3 GB1.8 GB
Unique views 2,598119
Unique downloads 56918


Cite as