Dataset Open Access

Reliance on Science in Patenting

Marx, Matt; Aaron Fuegi

Citation Style Language JSON Export

  "publisher": "Zenodo", 
  "DOI": "10.5281/zenodo.5111261", 
  "language": "eng", 
  "title": "Reliance on Science in Patenting", 
  "issued": {
    "date-parts": [
  "abstract": "<p><em><strong>Note: If you downloaded these data between May 29 (v30) and July 16 (v31), please delete those and replace them with the current release below&nbsp;(v32, uploaded July 17). I introduced a bug with v30 that resulted in duplicate patent-paper linkages due to erroneous patent numberings.&nbsp;</strong></em></p>\n\n<p>This dataset contains both front-page and in-text citations from patents to scientific articles through 2020. &nbsp;<em>If you use the data, please cite </em>these two articles:</p>\n\n<p><strong>1. M. Marx, &amp; A.&nbsp;Fuegi, &quot;Reliance on Science: Worldwide Front-Page Patent Citations to Scientific Articles&quot; (2020),&nbsp;<em>Strategic Management Journal 41(9):1572-1594</em>. (</strong><a href=\"\"></a><strong>)&nbsp;</strong></p>\n\n<p><strong>2. M. Marx &amp; A. Fuegi, &quot;Reliance on Science by Inventors: Hybrid Extraction of In-text Patent-to-Article Citations.&quot; NBER Working Paper&nbsp;27987</strong>.(<a href=\"\"></a>)</p>\n\n<p>The datafile containing the citations is <strong>_pcs_mag_doi_pmid.tsv.&nbsp;</strong>DOIs and PMIDs provided where available. Each citation has the&nbsp;applicant/examiner flag, confidence score&nbsp;(1-10), and&nbsp;whether the reference was a) only on the front page, b) only in the body text, or c) in both.&nbsp;<strong>_data_description.pdf</strong>&nbsp;has full details.&nbsp;<strong>bodytextknowngood.tsv</strong>&nbsp;contains the known-good references for calculating recall.</p>\n\n<p>The remaining files redistribute the&nbsp;<a href=\"\">Microsoft Academic Graph</a>. Please also cite&nbsp;Sinha, A, et al. 2015. Overview of Microsoft Academic Service (MAS) and Applications. In Proceedings of the 24th International Conference on World Wide Web (WWW &rsquo;15 Companion). ACM, New York, NY, USA, 243-246.</p>\n\n<p>These data are under an&nbsp;Open Data Commons Attribution license (ODC-By);&nbsp;use them for anything&nbsp;as long as you cite us! Source code for front-page matches is at&nbsp;;and for in-text is at Questions &amp; feedback to <a href=\"\"></a><em>.</em></p>\n\n<p><strong><em>This work is sponsored by the Alfred P. Sloan Foundation grant #G-2021-16822.</em></strong></p>", 
  "author": [
      "family": "Marx, Matt"
      "family": "Aaron Fuegi"
  "version": "v32", 
  "type": "dataset", 
  "id": "5111261"
All versions This version
Views 22,371295
Downloads 37,407282
Data volume 113.3 TB347.2 GB
Unique views 18,156258
Unique downloads 13,216192


Cite as