There is a newer version of this record available.

Dataset Open Access

DISCERN: Duke Innovation & SCientific Enterprises Research Network

Arora Ashish; Belenzon Sharon; Sheer Lia

This database links innovation data to Compustat firms. When using the data, please cite "Knowledge Spillovers and Corporate Investment in Scientific Research" (Arora, Belenzon and Sheer), NBER WP 23187. A special thanks and appreciation go to Bernardo Dionisi , Honggi Lee, Dror Shvadron and JK Suh for their diligent work and dedication to this effort over the past several years.

This project introduces major data extension and improvement to the historical NBER patent dataset, which should be valuable for all researchers working with patent data linked to firms. In updating the data to match between Compustat and patents to 2015, we address two major challenges: name changes and ownership changes. These challenges are central to how patents are assigned to firms over time. To be consistent over the sample period, we reconstruct the complete historical data covered in the NBER data files.

About 30% of the Compustat firms in our sample change their name at least once. Accounting for name changes improves the accuracy and scope of matches to patents (and other assets), ownership structure, and dynamic reassignments of GVKEY codes to companies. Dynamic reassignment means that, for instance, if a sample firm merges with another firm, the patents of the merged firm are included in the stock of patents linked to the Compustat record from that point onward, but not before.

For ownership and subsidiary data we rely on a wide range of M&A data, including SDC, historical snapshots of ORBIS files for 2002-2015, 10-K SEC filings, and NBER2006 as well as perform extensive manual checks that help us uncover firms’ structure and ownership changes before proceeding to the patent match. Thus, we have extended and improved the NBER patent data. In the enclosed "Data Appendix", we document our data construction work, present several examples (“case studies”), and outline the improvements we made to existing NBER historical patent data.

Files (92.7 MB)
Name Size
_DISCERN Data Appendix MARCH 2020.pdf
md5:a5a0ea3fbdfd2a1f79bce981fb834932
1.4 MB Download
_DISCERN_DATA_MARCH_2020.zip
md5:ad0112e88aff5869582abc002232648f
91.3 MB Download
_README.docx
md5:3d06faf42e6fedda50427eea77d1c3fc
21.1 kB Download
  • "Why do firms invest in scientific research?", Ashish Arora, Sharon Belenzon and Lia Sheer, NBER WP 23187.

17,151
16,229
views
downloads
All versions This version
Views 17,1511,416
Downloads 16,2291,743
Data volume 360.4 GB25.9 GB
Unique views 13,5791,284
Unique downloads 9,4991,273

Share

Cite as