There is a newer version of this record available.

Dataset Open Access

DISCERN: Duke Innovation & SCientific Enterprises Research Network

Arora Ashish; Belenzon Sharon; Sheer Lia

This database links patent data to Compustat firms. When using the data, please cite "WHY DO FIRMS INVEST IN RESEARCH?" (Arora, Belenzon and Sheer), NBER WP 23187.

Please follow the Stata DO files to merge the data into Compustat (using the field "gvkey"). The program “main_do_file.do” is the main do file. It runs all the other do files. See the Readme file for more detail.  

This project introduces major data extension and improvement to the historical NBER patent dataset, which should be valuable for all researchers working with patent data linked to firms. In updating the data to match between Compustat and patents to 2015, we address two major challenges: name changes and ownership changes. These challenges are central to how patents are assigned to firms over time. To be consistent over the sample period, we reconstruct the complete historical data covered in the NBER data files.

About 30% of the Compustat firms in our sample change their name at least once. Accounting for name changes improves the accuracy and scope of matches to patents (and other assets), ownership structure, and dynamic reassignments of GVKEY codes to companies. Dynamic reassignment means that, for instance, if a sample firm merges with another firm, the patents of the merged firm are included in the stock of patents linked to the Compustat record from that point onward, but not before.

For ownership and subsidiary data we rely on a wide range of M&A data, including SDC, historical snapshots of ORBIS files for 2002-2015, 10-K SEC filings, and NBER2006 as well as perform extensive manual checks that help us uncover firms’ structure and ownership changes before proceeding to the patent match. Thus, we have extended and improved the NBER patent data. In the enclosed "Data Appendix", we document our data construction work, present several examples (“case studies”), and outline the improvements we made to existing NBER historical patent data.

Files (397.7 MB)
Name Size
_Data description.pdf
md5:886964b5952f9fac1eae7a8ad79dca5a
1.4 MB Download
DISCERN_Panal_Data_1980_2015.dta.zip
md5:998953d7d6f5cdc96285866a4df8a32d
2.1 MB Download
DISCERN_patent_database_1980_2015_final1.dta.zip
md5:afe7710c1eb248bf0073378907e8651e
29.3 MB Download
DISCERN_SUB_name_list.dta.zip
md5:03b4400e893eddb3f7e084b3888fee1c
1.4 MB Download
DISCERN_UO_name_list.dta.zip
md5:4e8eb46833f40f876ed08afba9f83530
357.6 kB Download
dyn_match_All.dta
md5:147e4c933fb8ef89ec45eee2726af817
5.3 MB Download
fillin_gap_years.dta
md5:3171dbb9e09281c49fe97db1a87dbb82
25.3 kB Download
pat_per_year_permno_adj.dta
md5:6ef29b710f5bab33efe587d4a8aa941c
2.4 MB Download
pat_stock_permno_adj.dta
md5:9b892b80cda3889c24ae64b1284d17e7
1.0 MB Download
patent_1980_2015.dta
md5:3a7cedcb59254f369136dee9d83ddac9
327.1 MB Download
patent_firms.dta
md5:eefd71676779a6d67e0c71e70bf69b71
37.5 kB Download
patent_match_id_name.dta.zip
md5:620d9fa9705367ae0ba7c21c917edc50
26.5 MB Download
permno_gvkey.dta
md5:3bf394cc11200aebab3f4afb85c26180
646.3 kB Download
permno_min_max_year_adj_80_15.dta
md5:5a0686b6ed0d67273c496c3ec65c353f
88.9 kB Download
programs.zip
md5:f2e0fe46c9a76275860dcdddc23d2806
8.1 kB Download
README.docx
md5:6c8613ee3fc1678a759aae55aba809e9
20.0 kB Download
  • "Why do firms invest in scientific research?", Ashish Arora, Sharon Belenzon and Lia Sheer, NBER WP 23187.

15,648
14,903
views
downloads
All versions This version
Views 15,6481,486
Downloads 14,903956
Data volume 331.1 GB27.2 GB
Unique views 12,3971,334
Unique downloads 8,685271

Share

Cite as