There is a newer version of this record available.

Dataset Open Access

Reliance on Science

Marx, Matt; Aaron Fuegi


MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nmm##2200000uu#4500</leader>
  <datafield tag="999" ind1="C" ind2="5">
    <subfield code="x">Marx, Matt and Aaron Fuegi, "Reliance on Science in Patenting: USPTO Front-Page Citations to Scientific Articles" (https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3331686)</subfield>
  </datafield>
  <datafield tag="999" ind1="C" ind2="5">
    <subfield code="x">Sinha, Arnab, Zhihong Shen, Yang Song, Hao Ma, Darrin Eide, Bo-June (Paul) Hsu, and Kuansan Wang. 2015. An Overview of Microsoft Academic Service (MAS) and Applications. In Proceedings of the 24th International Conference on World Wide Web (WWW '15 Companion). ACM, New York, NY, USA, 243-246</subfield>
  </datafield>
  <datafield tag="041" ind1=" " ind2=" ">
    <subfield code="a">eng</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">innovation, patenting, science, citation</subfield>
  </datafield>
  <controlfield tag="005">20230601192108.0</controlfield>
  <controlfield tag="001">7903131</controlfield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Boston University</subfield>
    <subfield code="a">Aaron Fuegi</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">3100517735</subfield>
    <subfield code="z">md5:3a35d65f9241074976b1083bca7fd96e</subfield>
    <subfield code="u">https://zenodo.org/record/7903131/files/authoridname_normalized.zip</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">272069</subfield>
    <subfield code="z">md5:0d20284aadeb443ad48eac1d00ae503f</subfield>
    <subfield code="u">https://zenodo.org/record/7903131/files/bodytextknowngood.tsv</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">82850</subfield>
    <subfield code="z">md5:d671dffead5994cfad1fa88848a1049c</subfield>
    <subfield code="u">https://zenodo.org/record/7903131/files/conferenceidname.zip</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">218818</subfield>
    <subfield code="z">md5:4f75a37058a6a67aa463d0ea08d0df1c</subfield>
    <subfield code="u">https://zenodo.org/record/7903131/files/_data_description.pdf</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">5224591</subfield>
    <subfield code="z">md5:c2f351238565d2216136aeaacdf55914</subfield>
    <subfield code="u">https://zenodo.org/record/7903131/files/jcif.zip</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">8123343</subfield>
    <subfield code="z">md5:7c66b0a4d51721179ce103ce9fdb35c9</subfield>
    <subfield code="u">https://zenodo.org/record/7903131/files/jif.zip</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">1537887</subfield>
    <subfield code="z">md5:12a865c40b44735fe82557bd42ff2152</subfield>
    <subfield code="u">https://zenodo.org/record/7903131/files/journalidnameissn.zip</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">2166111020</subfield>
    <subfield code="z">md5:bbe297e3f6a71b79d3b754ab00c3eba0</subfield>
    <subfield code="u">https://zenodo.org/record/7903131/files/magfield_oecd_wos_crosswalk.zip</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">9317468501</subfield>
    <subfield code="z">md5:4de658f319d6243f182fa4f34f3f2669</subfield>
    <subfield code="u">https://zenodo.org/record/7903131/files/paperauthoridaffiliationname.zip</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">4399536130</subfield>
    <subfield code="z">md5:ae79bbdfc7820c2f4841ab8f3f965449</subfield>
    <subfield code="u">https://zenodo.org/record/7903131/files/paperauthororder.zip</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">10932047112</subfield>
    <subfield code="z">md5:2c3434f1ca91478901fa79bea665370b</subfield>
    <subfield code="u">https://zenodo.org/record/7903131/files/papercitations.zip</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">550860111</subfield>
    <subfield code="z">md5:5434339c22fda4ae7b03a34ad496fd55</subfield>
    <subfield code="u">https://zenodo.org/record/7903131/files/paperconferenceid.zip</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">973368155</subfield>
    <subfield code="z">md5:6874e40f9e0f868e39501d9d8ed3fc74</subfield>
    <subfield code="u">https://zenodo.org/record/7903131/files/paperjournalid.zip</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">1354359993</subfield>
    <subfield code="z">md5:f272c7ac3db9f98f7b5d757c2efd5d3d</subfield>
    <subfield code="u">https://zenodo.org/record/7903131/files/papervolisspages.zip</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">752201896</subfield>
    <subfield code="z">md5:1153ec5319607a6dff643952a5393f12</subfield>
    <subfield code="u">https://zenodo.org/record/7903131/files/paperyear.zip</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">2943032</subfield>
    <subfield code="z">md5:e3ee27cd33b2ec6bf1efc586ba5ee66a</subfield>
    <subfield code="u">https://zenodo.org/record/7903131/files/_patent_paper_pairs.tsv</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">3723316576</subfield>
    <subfield code="z">md5:718c1d0e9e78fe4ef2229af7189f94f0</subfield>
    <subfield code="u">https://zenodo.org/record/7903131/files/_pcs_mag_doi_pmid.tsv</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2023-05-08</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire_data</subfield>
    <subfield code="o">oai:zenodo.org:7903131</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">Cornell University</subfield>
    <subfield code="0">(orcid)0000-0002-6173-4142</subfield>
    <subfield code="a">Marx, Matt</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">Reliance on Science</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u">https://opendatacommons.org/licenses/by/1.0/</subfield>
    <subfield code="a">Open Data Commons Attribution License v1.0</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;This dataset contains both front-page and in-text citations from patents to scientific articles, as well as &lt;strong&gt;patent-paper&lt;/strong&gt;&amp;nbsp;&lt;strong&gt;pairs&lt;/strong&gt;, through 2021. &amp;nbsp;&lt;em&gt;If you use the data, please cite &lt;/em&gt;these two articles:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;1. M. Marx &amp;amp; A. Fuegi, &amp;quot;Reliance on Science by Inventors: Hybrid Extraction of In-text Patent-to-Article Citations.&amp;quot; &lt;/strong&gt;&amp;nbsp;&lt;em&gt;forthcoming in Journal of Economics and Management Strategy.&amp;nbsp;&lt;/em&gt;(&lt;a href="http://doi.org/10.1111/jems.12455"&gt;http://doi.org/10.1111/jems.12455&lt;/a&gt;)&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2. M. Marx, &amp;amp; A.&amp;nbsp;Fuegi, &amp;quot;Reliance on Science: Worldwide Front-Page Patent Citations to Scientific Articles&amp;quot; (2020),&amp;nbsp;&lt;em&gt;Strategic Management Journal 41(9):1572-1594&lt;/em&gt;. (&lt;/strong&gt;&lt;a href="https://onlinelibrary.wiley.com/doi/full/10.1002/smj.3145"&gt;https://onlinelibrary.wiley.com/doi/full/10.1002/smj.3145&lt;/a&gt;&lt;strong&gt;)&amp;nbsp;&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;

&lt;p&gt;The datafile containing the citations is &lt;strong&gt;_pcs_mag_doi_pmid.tsv.&amp;nbsp;&lt;/strong&gt;DOIs and PMIDs provided where available. Each citation has the&amp;nbsp;applicant/examiner flag, confidence score&amp;nbsp;(1-10), and&amp;nbsp;whether the reference was a) only on the front page, b) only in the body text, or c) in both. Each paper-patent citation also includes the temporal gap&amp;nbsp;and three related measures of self-citation (i.e., was one or more of the inventors on the citing patent also an author on the cited paper).&amp;nbsp;&lt;strong&gt;_reliance_on_science.pdf&lt;/strong&gt;&amp;nbsp;has full details.&amp;nbsp;&lt;strong&gt;bodytextknowngood.tsv&lt;/strong&gt;&amp;nbsp;contains the known-good references for calculating recall.&lt;/p&gt;

&lt;p&gt;The datafile containing the patent-paper pairs (PPPs) is &lt;strong&gt;_patent_paper_pairs.tsv&lt;/strong&gt;. These are USPTO only. Each PPP has a confidence score, the count of days between the publication of the paper and the filing of the patent. (If the patent is a continuation of another patent, the filing date of the original patent is used.) Also, when a paper is paired with multiple patents, an indicator variable reports whether those patents are continuations or otherwise identical.&amp;nbsp;&lt;/p&gt;

&lt;p&gt;The remaining files redistribute much of the *final* edition of the&amp;nbsp;&lt;a href="http://aka.ms/msracad"&gt;Microsoft Academic Graph&lt;/a&gt;&amp;nbsp;(12/20/2021). Please also cite&amp;nbsp;Sinha, A, et al. 2015. Overview of Microsoft Academic Service (MAS) and Applications. In Proceedings of the 24th International Conference on World Wide Web (WWW &amp;rsquo;15 Companion). ACM, New York, NY, USA, 243-246. Note that jif.zip, jcif.zip, and the OECD/wos-category crosswalks are derivatives of MAG and may not be updated through the end of 2021.&lt;/p&gt;

&lt;p&gt;These data are under an&amp;nbsp;Open Data Commons Attribution license (ODC-By);&amp;nbsp;use them for anything&amp;nbsp;as long as you cite us! Source code for front-page matches is at&amp;nbsp;https://github.com/mattmarx/reliance_on_science&amp;nbsp;and for in-text is at https://github.com/mattmarx/intextcitations. Questions &amp;amp; feedback to &lt;a href="mailto:support@relianceonscience.org"&gt;support@relianceonscience.org&lt;/a&gt;&lt;em&gt;.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;&lt;em&gt;This work is sponsored by the Alfred P. Sloan Foundation grant #G-2021-16822.&lt;/em&gt;&lt;/strong&gt;&lt;/p&gt;</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.5281/zenodo.3236339</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.7903131</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">dataset</subfield>
  </datafield>
</record>
52,501
62,812
views
downloads
All versions This version
Views 52,501875
Downloads 62,812927
Data volume 154.6 TB2.2 TB
Unique views 42,919788
Unique downloads 25,504412

Share

Cite as