There is a newer version of this record available.

Dataset Open Access

Complete Rxivist dataset of scraped bioRxiv data

Abdill, Richard J.; Blekhman, Ran allows readers to sort and filter the tens of thousands of preprints posted to bioRxiv. Rxivist uses a custom web crawler to index all papers on; this is a snapshot of Rxivist the production database. The version number indicates the date on which the snapshot was taken. See the included "" file (or on GitHub) for instructions on how to use the "rxivist.backup" file to import data into a PostgreSQL database server.

Please note this is a different repository than the one used for the Rxivist manuscript—that is in a separate Zenodo repository. You're welcome (and encouraged!) to use this data in your research, but please cite our paper, available on bioRxiv.

Files (151.6 MB)
Name Size
63.3 kB Download
151.5 MB Download
All versions This version
Views 24475
Downloads 4818
Data volume 3.2 GB1.5 GB
Unique views 18767
Unique downloads 3615


Cite as