Dataset Open Access

US Patent Similarity Data

Whalen, Ryan; Lungeanu, Alina; DeChurch, Leslie; Contractor, Noshir

Researcher(s)
Contractor, Noshir; Lungeanu, Alina; DeChurch, Leslie

Pairwise semantic similarity measures for US utility patents. Includes measures for citing/cited patent pairs, 100 most-similar patents for each patent, and doc2vec vectors for each patent.

Research Supported by NSF Award number 1856090
Files (27.0 GB)
Name Size
cite_sims.zip
md5:d51cef7d5582bd9f6877d3a9088fb1c0
1.3 GB Download
most_sim.zip
md5:6b3fa0e3e4229d25b5c3a15b4be47712
8.1 GB Download
patent_doc2v_model.zip
md5:a9d9a7f74eb5ceb9cfdab8fc953213d8
200.7 MB Download
vectors.zip
md5:95f735af0af3b098c7cdffb8b63c2884
17.4 GB Download
151
173
views
downloads
All versions This version
Views 151151
Downloads 173173
Data volume 1.4 TB1.4 TB
Unique views 129129
Unique downloads 9393

Share

Cite as