Dataset Open Access

Patent text: code, data, and new measures

Arts; Hou; Gomez

This Zenodo page describes data collection, processing, and different open access data files related to the text of USPTO patent documents. The document "Data Description Zenodo.pdf" provides more details. If you use the code or data, please cite the following paper:

Arts S, Hou J, Gomez JC. (2020). Natural language processing to identify the creation and impact of new technologies in patent text: code, data, and new measures. Forthcoming Research Policy. (https://doi.org/10.1016/j.respol.2020.104144)

Files (65.6 GB)
Name Size
0_Data_Description_Zenodo.pdf
md5:ce0332320560f80efa6a86fcdbbae986
535.3 kB Download
1000_most_similar_patents.zip
md5:0660f13ff52576b824432ff8c6fbe628
46.0 GB Download
100_most_similar_patents.zip
md5:cbef0725269ac2185034a30b365066a9
5.1 GB Download
cosine_similarity.zip
md5:025c03d1b7f32acc75e93bc4f6d5aa38
80.9 MB Download
keywords.zip
md5:b1fe1e41a8da1c7ed8948487c7a1089f
903.6 MB Download
new_bigrams.zip
md5:1a0268bc4a8ca3d83deb072e558990e1
68.5 MB Download
new_keyword_comb_1980_1989.zip
md5:ee65ce71ae3c02319db065685420f056
492.7 MB Download
new_keyword_comb_1990_1994.zip
md5:166d0b81fc60b8714ae77c230b648295
351.3 MB Download
new_keyword_comb_1995_1999.zip
md5:4621d1f6feaaca64f3adec600a6c624f
866.1 MB Download
new_keyword_comb_2000_2004.zip
md5:fc14065819616aee644b01fa2971b9e7
774.7 MB Download
new_keyword_comb_2005_2009.zip
md5:e7167dc5b23816fbfc1ade0e3e047566
748.1 MB Download
new_keyword_comb_2010_2018.zip
md5:f34e22ad7a57fe8aff646c5ebb08fe12
557.7 MB Download
new_keyword_comb_all.zip
md5:764c38f0e64d0bcc20d8f7d709b4cfd1
3.1 GB Download
new_keywords.zip
md5:ad9ee88d67e61888fae961fa29894148
10.0 MB Download
new_trigrams.zip
md5:ae488e2e3460afdb5df0d7ae12b5a409
113.5 MB Download
patent txt raw.zip
md5:5ebdfe48395eec11e4f1a2de9490132e
6.3 GB Download
patent_text_measures.zip
md5:675a63980d9deb10eb2062da80a045ce
100.3 MB Download
357
262
views
downloads
All versions This version
Views 357357
Downloads 262262
Data volume 775.6 GB775.6 GB
Unique views 295295
Unique downloads 134134

Share

Cite as