Published May 30, 2023 | Version v1
Dataset Open

Publication data for article "Scientific and technological knowledge grows linearly over time"

  • 1. Shanghai Jiao Tong University
  • 2. Chinese Academy of Sciences
  • 3. ROR icon University of Minnesota

Description

This dataset includes metadata for academic publications used in the article Scientific and technological knowledge grows linearly over time. The dataset contains citation relationships, publication dates, and academic fields for 213,715,816 publications from 1800 to 2020. These publications cover 292 secondary subjects in 19 major disciplines, including Economics, Biology, Computer Science, Physics, and more. The data are requested from Acemap (https://www.acemap.info, Shanghai Jiao Tong University) and sourced from the last snapshot of Microsoft Academic Graph (MAG) as of December 31, 2021.

The dataset includes two gzip-compressed files, which contain all data in CSV format after decompression. Sample data is presented below:

  1. paper_date_refs.csv (paper_date_refs.tar.gz)
    1. paper_id
    2. date
    3. reference_ids (separated by comma)
  2. field_paper (field_paper.tar.gz)
    1. field_paper/Computer science.csv
      1. paper_id
    2. field_paper/Biology.csv
      1. paper_id
    3. ...

Files

Files (11.7 GB)

Name Size Download all
md5:836af54a8b35515f790ce282eef1169e
2.1 GB Download
md5:2a9a7a39379c2566987be39025d7741a
9.6 GB Download