Dataset Open Access
This dataset contains citations from USPTO patents granted 1947-2018 to articles captured by the Microsoft Academic Graph (ID) from 1800-2018.
The main file, pcs.tsv, contains the resolved citations. Fields are tab-separated. Each match has the patent number, MAG ID, the original citation from the patent, an indicator for whether the citation was supplied by the applicant, examiner, or unknown, and a confidence score (1-10) indicating how likely this match is correct. Note that this distribution does not contain matches with confidence 2 or 1.
The remaining files are a redistribution of the 1 January 2019 release of the Microsoft Academic Graph. All of these files are compressed using ZIP compression under CentOS5. Original files, documented at https://docs.microsoft.com/en-us/academic-services/graph/reference-data-schema, can be downloaded from https://aka.ms/msracad; this redistribution carves up the original files into smaller, variable-specific files that can be loaded individually (see reliance_on_science.pdf for full details, the latest version of which is available at https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3331686).
Source code is available at https://github.com/mattmarx/reliance_on_science. Be sure to see the replication disclaimers.
Name | Size | |
---|---|---|
_reliance on science.pdf
md5:75f5bf12a703348312d6d6083d3d314d |
693.6 kB | Download |
authoridname_normalized.zip
md5:0917e7304059b52619782aa4a5f1f24a |
2.8 GB | Download |
authoridname_raw.zip
md5:9e35a6df4f3f6b0fe525eed10afae3d3 |
3.0 GB | Download |
conferenceidname.zip
md5:f8501b603ac284a7c168d72a1511ad36 |
78.9 kB | Download |
fieldidname.zip
md5:a68b721d656a7be3ca6efb677d0a39b0 |
4.2 MB | Download |
journalidname.zip
md5:47cfcec6787566c70ca8b9d93fe3762d |
1.3 MB | Download |
paperauthoridaffiliationname.zip
md5:3d7dbb590fa0f834a938e3897b71f4f5 |
4.3 GB | Download |
paperauthororder.zip
md5:9705a0dc6d517b2336ecc148ba591982 |
3.5 GB | Download |
papercitations.zip
md5:84c293aba31f57bbb85d2e6d5f65dfce |
7.8 GB | Download |
paperconferenceid.zip
md5:cfde2972be81f7db051edc37e903ac91 |
448.7 MB | Download |
paperdoi.zip
md5:ae6a01a43054910834667f6763c4b13e |
1.3 GB | Download |
paperfieldid.zip
md5:78e5e3e144a42e8b22bc1f85c2b8ed3e |
5.7 GB | Download |
paperjournalid.zip
md5:d9a425c7c183d3a12762d0bf1ced17f2 |
807.1 MB | Download |
papertitle.zip
md5:1b57c3b2a863387608461f4c5d3c928a |
6.9 GB | Download |
papervolisspages.zip
md5:43535c579a791b6f07d11b1c3c381c4f |
1.1 GB | Download |
paperyear.zip
md5:d0067ff44ce5aee7db1be8e51398f950 |
620.2 MB | Download |
pcs.tsv
md5:37bc6ba73865524927b878b635692f97 |
3.2 GB | Download |
All versions | This version | |
---|---|---|
Views | 34,791 | 1,294 |
Downloads | 51,335 | 1,677 |
Data volume | 137.4 TB | 2.4 TB |
Unique views | 28,186 | 1,174 |
Unique downloads | 18,849 | 986 |