Dataset Open Access

ArXiV Archive: A tidy and complete archive of metadata for papers on arxiv.org, 1993-2019

R. Stuart Geiger

This is a full archive of metadata about papers on arxiv.org from 1993-2018, including abstracts. Data is tidy and packed in TSV files, in two different collections of the total dataset: per year (all categories) and per primary category (all years). This archive also includes Jupyter notebooks for unpacking and analyzing it in python. See the README.md file and https://github.com/staeiou/arxiv_archive for more information.

This release has the exact same data as in v1.0.0, but the notebook 4-analysis-examples.ipynb is updated and fixes an analysis bug.

Files (1.5 GB)
Name Size
staeiou/arxiv_archive-v1.0.1.zip
md5:93b6f9fccdd018660b38ceb003b83ab5
1.5 GB Download
137
40
views
downloads
All versions This version
Views 13787
Downloads 4015
Data volume 56.6 GB21.9 GB
Unique views 10876
Unique downloads 2413

Share

Cite as