Dataset Open Access

US Patent Similarity Data

Whalen, Ryan; Lungeanu, Alina; DeChurch, Leslie; Contractor, Noshir

Researcher(s)
Contractor, Noshir; Lungeanu, Alina; DeChurch, Leslie

Pairwise semantic similarity measures for US utility patents. Includes measures for citing/cited patent pairs, 100 most-similar patents for each patent, and doc2vec vectors for each patent.

Research Supported by NSF Award number 1856090
Files (27.0 GB)
Name Size
cite_sims.zip
md5:d51cef7d5582bd9f6877d3a9088fb1c0
1.3 GB Download
most_sim.zip
md5:6b3fa0e3e4229d25b5c3a15b4be47712
8.1 GB Download
patent_doc2v_model.zip
md5:a9d9a7f74eb5ceb9cfdab8fc953213d8
200.7 MB Download
vectors.zip
md5:95f735af0af3b098c7cdffb8b63c2884
17.4 GB Download
376
340
views
downloads
All versions This version
Views 376376
Downloads 340340
Data volume 2.6 TB2.6 TB
Unique views 333333
Unique downloads 201201

Share

Cite as