Publication data for article "Scientific and technological knowledge grows linearly over time"
Creators
Description
This dataset includes metadata for academic publications used in the article Scientific and technological knowledge grows linearly over time. The dataset contains citation relationships, publication dates, and academic fields for 213,715,816 publications from 1800 to 2020. These publications cover 292 secondary subjects in 19 major disciplines, including Economics, Biology, Computer Science, Physics, and more. The data are requested from Acemap (https://www.acemap.info, Shanghai Jiao Tong University) and sourced from the last snapshot of Microsoft Academic Graph (MAG) as of December 31, 2021.
The dataset includes two gzip-compressed files, which contain all data in CSV format after decompression. Sample data is presented below:
- paper_date_refs.csv (paper_date_refs.tar.gz)
- paper_id
- date
- reference_ids (separated by comma)
- field_paper (field_paper.tar.gz)
- field_paper/Computer science.csv
- paper_id
- field_paper/Biology.csv
- paper_id
- ...
- field_paper/Computer science.csv
Files
Files
(11.7 GB)
Name | Size | Download all |
---|---|---|
md5:836af54a8b35515f790ce282eef1169e
|
2.1 GB | Download |
md5:2a9a7a39379c2566987be39025d7741a
|
9.6 GB | Download |