Dataset Open Access
This is a full archive of metadata about papers on arxiv.org from 1993-2018, including abstracts. Data is tidy and packed in TSV files, in two different collections of the total dataset: per year (all categories) and per primary category (all years). This archive also includes Jupyter notebooks for unpacking and analyzing it in python. See the README.md file and https://github.com/staeiou/arxiv_archive for more information.
This release has the exact same data as in v1.0.0, but the notebook 4-analysis-examples.ipynb is updated and fixes an analysis bug.