There is a newer version of this record available.

Dataset Open Access

Reliance on Science

Marx, Matt; Aaron Fuegi

This dataset contains patent-to-paper citations through 2022 as well as patent-paper pairs (through 2021).  If you use the data, please cite these two articles:

1. M. Marx & A. Fuegi, "Reliance on Science by Inventors: Hybrid Extraction of In-text Patent-to-Article Citations."  forthcoming in Journal of Economics and Management Strategy. (http://doi.org/10.1111/jems.12455)

2. M. Marx, & A. Fuegi, "Reliance on Science: Worldwide Front-Page Patent Citations to Scientific Articles" (2020), Strategic Management Journal 41(9):1572-1594. (https://onlinelibrary.wiley.com/doi/full/10.1002/smj.3145

The datafile containing the citations is _pcs_oa.csv.  Each citation has the applicant/examiner flag, confidence score (1-10), whether the reference was a) only on the front page, b) only in the body text, or c) in both, and an indicator for a self-citation (i.e., one of the authors is an inventor on the patent). There are two "shorthand" files, _pcs_countsbypatent.csv and _pcs_countsbypaper.csv, which collapse these to the paper and patent level by citation type.

The datafile containing the patent-paper pairs (PPPs) is _patent_paper_pairs.tsv. These are USPTO only, through 2021. Each PPP has a confidence score and the count of days between the publication of the paper and the filing of the patent. (If the patent is a continuation of another patent, the filing date of the original patent is used.) Also, when a paper is paired with multiple patents, an indicator variable reports whether those patents are continuations or otherwise identical. 

The remaining files redistribute some of the end-2022 edition of OpenAlex. To retrieve additional OpenAlex files or fields, please visit openalex.com.  (This release replaces files from the Microsoft Academic Graph, which was retired on 12/20/2021.) 

The above is documented in greater detail in __reliance_on_science.pdf.

These data are provided under a Creative Commons Attribution Non-Commercial license. Please contact us regarding commercial use.  Questions & feedback to support@relianceonscience.org.

This work is sponsored by the Alfred P. Sloan Foundation grant #G-2021-16822.

Files (20.0 GB)
Name Size
__relianceonscience.pdf
md5:e4b606d64bbe02a44fc2e4736c410afe
182.8 kB Download
_patent_paper_pairs.tsv
md5:e3ee27cd33b2ec6bf1efc586ba5ee66a
2.9 MB Download
_pcs_countsbypaper.csv
md5:8946c21613d92f7342ae917844d0759a
251.7 MB Download
_pcs_countsbypatent.csv
md5:a0fa762574525d74c9de8fc3563723f6
266.0 MB Download
_pcs_oa.csv
md5:c76b3e096253dce1fd356c2a59be30a1
2.0 GB Download
affiliationidname.zip
md5:76877b7f0c82e75a9c9fc4416dd8a3c7
40.6 kB Download
authoridname.zip
md5:f6e94c90ba236f58d40684a67bac3017
3.8 GB Download
bodytextknowngood.tsv
md5:0d20284aadeb443ad48eac1d00ae503f
272.1 kB Download
journalidname.zip
md5:903bb12f21bb2d233bfccc9ea6b09cdb
2.0 MB Download
paperauthoridorderaffiliation.zip
md5:1303b74f23e25592d65c6b57872c4970
7.2 GB Download
paperdoi.zip
md5:f3f7dc7b25e1cf6f6e9c0e74d94ae554
2.2 GB Download
paperjournalid.zip
md5:51f4b871ae6c11aac982363e381238d9
705.3 MB Download
paperncitesfrompapers.zip
md5:62ad289c9348d87dd4b9d71f24c4dbae
280.1 MB Download
paperpmid.zip
md5:abcc50eb83cbae18a588024ec56aea7a
317.2 MB Download
papervolisspages.zip
md5:8d017618b926cb19d89189cc9b9ef2a2
1.3 GB Download
paperyear.zip
md5:7cc4447d5f0b3d763f6bd9a3d6cc4e43
1.5 GB Download
  • Marx, Matt and Aaron Fuegi, "Reliance on Science in Patenting: USPTO Front-Page Citations to Scientific Articles" (https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3331686)

  • Sinha, Arnab, Zhihong Shen, Yang Song, Hao Ma, Darrin Eide, Bo-June (Paul) Hsu, and Kuansan Wang. 2015. An Overview of Microsoft Academic Service (MAS) and Applications. In Proceedings of the 24th International Conference on World Wide Web (WWW '15 Companion). ACM, New York, NY, USA, 243-246

57,868
67,058
views
downloads
All versions This version
Views 57,8681,992
Downloads 67,0581,384
Data volume 159.0 TB1.1 TB
Unique views 47,2751,783
Unique downloads 27,600743

Share

Cite as