There is a newer version of this record available.

Dataset Open Access

Reliance on Science in Patenting

Marx, Matt; Aaron Fuegi

This contains citations from the front pages of worldwide patents to articles in he Microsoft Academic Graph (MAG) from 1800-2018. Questions & feedback to support@relianceonscience.org.  If you use the data, please cite these two papers:

  1. for the dataset of citations: Marx, Matt and Aaron Fuegi, "Reliance on Science: Worldwide Front-Page Patent Citations to Scientific Articles" (https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3331686). 
  2. for the articles: Sinha, A, et al. 2015. Overview of Microsoft Academic Service (MAS) and Applications. In Proceedings of the 24th International Conference on World Wide Web (WWW ’15 Companion). ACM, New York, NY, USA, 243-246.

The files below are described in _datadescription.pdf but here is a brief summary:

  • _pcs.tsv, contains the patent citations to science. Fields are tab-separated. Each citation to science has the patent number, MAG ID, applicant/examiner indicator, and a confidence score (1-10). 
  • _pcs_pubmed.tsv, is a PubMed-specific match currently limited to USPTO patents.
  • _pcs_bodytextbeta.tsv is a preliminary release also containing citations from the body text of USPTO patents since 1836. This adds a field indicating whether the citation appeared on the front page, in the body text, or in both.

The remaining files redistribute the 1/1/2019 release of the Microsoft Academic Graph, carving up the original files into smaller, variable-specific files. There are also some extensions including journal impact factor and high-level technical classifications.

Source code is available at https://github.com/mattmarx/reliance_on_science.

Files (42.4 GB)
Name Size
__datadescription.pdf
md5:04eb9c793014f96d78a1a755239610a8
343.2 kB Download
_pcs.tsv
md5:12afea9b677e5f2a0d882504610a82eb
621.4 MB Download
_pcs_bodytextbeta.tsv
md5:fab932b4551a9b97b842d9b67e9b1d10
1.1 GB Download
_pcs_pubmed.tsv
md5:29e6eb461cfc146eb2c50c0576bba38f
174.9 MB Download
authoridname_normalized.zip
md5:0917e7304059b52619782aa4a5f1f24a
2.8 GB Download
authoridname_raw.zip
md5:9e35a6df4f3f6b0fe525eed10afae3d3
3.0 GB Download
conferenceidname.zip
md5:f8501b603ac284a7c168d72a1511ad36
78.9 kB Download
fieldidname.zip
md5:a68b721d656a7be3ca6efb677d0a39b0
4.2 MB Download
jcif.zip
md5:c2f351238565d2216136aeaacdf55914
5.2 MB Download
jif.zip
md5:7c66b0a4d51721179ce103ce9fdb35c9
8.1 MB Download
journalidnameissn.zip
md5:4fb35d70897e46a5b3f1ac9a723c095a
1.3 MB Download
magfield_oecd_wos_crosswalk.zip
md5:bbe297e3f6a71b79d3b754ab00c3eba0
2.2 GB Download
paperauthoridaffiliationname.zip
md5:3d7dbb590fa0f834a938e3897b71f4f5
4.3 GB Download
paperauthororder.zip
md5:9705a0dc6d517b2336ecc148ba591982
3.5 GB Download
papercitations.zip
md5:84c293aba31f57bbb85d2e6d5f65dfce
7.8 GB Download
paperconferenceid.zip
md5:cfde2972be81f7db051edc37e903ac91
448.7 MB Download
paperdoi.zip
md5:ae6a01a43054910834667f6763c4b13e
1.3 GB Download
paperfieldid.zip
md5:78e5e3e144a42e8b22bc1f85c2b8ed3e
5.7 GB Download
paperjournalid.zip
md5:d9a425c7c183d3a12762d0bf1ced17f2
807.1 MB Download
papertitle.zip
md5:95c371e6e21169c13e1c5b3e6b7b8aab
6.9 GB Download
papervolisspages.zip
md5:43535c579a791b6f07d11b1c3c381c4f
1.1 GB Download
paperyear.zip
md5:d0067ff44ce5aee7db1be8e51398f950
620.2 MB Download
  • Marx, Matt and Aaron Fuegi, "Reliance on Science in Patenting: USPTO Front-Page Citations to Scientific Articles" (https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3331686)

  • Sinha, Arnab, Zhihong Shen, Yang Song, Hao Ma, Darrin Eide, Bo-June (Paul) Hsu, and Kuansan Wang. 2015. An Overview of Microsoft Academic Service (MAS) and Applications. In Proceedings of the 24th International Conference on World Wide Web (WWW '15 Companion). ACM, New York, NY, USA, 243-246

52,472
62,802
views
downloads
All versions This version
Views 52,4722,868
Downloads 62,8023,276
Data volume 154.6 TB2.7 TB
Unique views 42,8982,515
Unique downloads 25,4962,265

Share

Cite as