There is a newer version of the record available.

Published May 29, 2021 | Version v30
Dataset Open

Reliance on Science in Patenting

  • 1. Cornell University
  • 2. Boston University

Description

This dataset contains both front-page and in-text citations from patents to scientific articles.  If you use the data, please cite these two articles:

1. M. Marx, & A. Fuegi, "Reliance on Science: Worldwide Front-Page Patent Citations to Scientific Articles" (2020), Strategic Management Journal 41(9):1572-1594. (https://onlinelibrary.wiley.com/doi/full/10.1002/smj.3145

2. M. Marx & A. Fuegi, "Reliance on Science by Inventors: Hybrid Extraction of In-text Patent-to-Article Citations." NBER Working Paper 27987.(https://www.nber.org/papers/w27987)

The datafile containing the citations is _pcs_mag_doi_pmid.tsv. DOIs and PMIDs provided where available. Each citation has the applicant/examiner flag, confidence score (1-10), and whether the reference was a) only on the front page, b) only in the body text, or c) in both. _data_description.pdf has full details. bodytextknowngood.tsv contains the known-good references for calculating recall. bodytextpatrefstopatents.tsv contains references to patents in the body text of patents.

The remaining files redistribute the Microsoft Academic Graph. Please also cite Sinha, A, et al. 2015. Overview of Microsoft Academic Service (MAS) and Applications. In Proceedings of the 24th International Conference on World Wide Web (WWW ’15 Companion). ACM, New York, NY, USA, 243-246.

These data are under an Open Data Commons Attribution license (ODC-By); use them for anything as long as you cite us! Source code for front-page matches: https://github.com/mattmarx/reliance_on_science. Questions & feedback to support@relianceonscience.org. 

(note ,this is an interim release and does not include the MAG files; those will be posted shortly)

Files

__datadescription.pdf

Files (6.1 GB)

Name Size Download all
md5:869edb358e771120b87fb44ec9f1317a
227.5 kB Preview Download
md5:465ad920f49b905973f8e5e92356386b
2.9 GB Download
md5:ea4aabd9bd31834f7ec333f0ff09bae3
82.5 kB Preview Download
md5:5bb26fd59a0f9b9e2a44a4a124d44b6c
1.0 GB Preview Download
md5:c2f351238565d2216136aeaacdf55914
5.2 MB Preview Download
md5:7c66b0a4d51721179ce103ce9fdb35c9
8.1 MB Preview Download
md5:bbe297e3f6a71b79d3b754ab00c3eba0
2.2 GB Preview Download

Additional details

References

  • Marx, Matt and Aaron Fuegi, "Reliance on Science in Patenting: USPTO Front-Page Citations to Scientific Articles" (https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3331686)
  • Sinha, Arnab, Zhihong Shen, Yang Song, Hao Ma, Darrin Eide, Bo-June (Paul) Hsu, and Kuansan Wang. 2015. An Overview of Microsoft Academic Service (MAS) and Applications. In Proceedings of the 24th International Conference on World Wide Web (WWW '15 Companion). ACM, New York, NY, USA, 243-246