Published January 7, 2019 | Version v1.0.1
Dataset Open

ArXiV Archive: A tidy and complete archive of metadata for papers on arxiv.org, 1993-2019

  • 1. UC-Berkeley @BIDS

Description

This is a full archive of metadata about papers on arxiv.org from 1993-2018, including abstracts. Data is tidy and packed in TSV files, in two different collections of the total dataset: per year (all categories) and per primary category (all years). This archive also includes Jupyter notebooks for unpacking and analyzing it in python. See the README.md file and https://github.com/staeiou/arxiv_archive for more information.

This release has the exact same data as in v1.0.0, but the notebook 4-analysis-examples.ipynb is updated and fixes an analysis bug.

Files

staeiou/arxiv_archive-v1.0.1.zip

Files (1.5 GB)

Name Size Download all
md5:93b6f9fccdd018660b38ceb003b83ab5
1.5 GB Preview Download

Additional details

Related works