Published November 25, 2019 | Version 2
Dataset Open

US Patent Similarity Data

  • 1. University of Hong Kong
  • 2. Northwestern University
  • 1. Northwestern University

Description

Pairwise semantic similarity measures for US utility patents. Includes measures for citing/cited patent pairs, 100 most-similar patents for each patent, and doc2vec vectors for each patent. Second edition includes .npy file needed to generate new text embeddings using the pre-trained model.

Notes

Research Supported by NSF Award number 1856090

Files

cite_sims.zip

Files (32.2 GB)

Name Size Download all
md5:d51cef7d5582bd9f6877d3a9088fb1c0
1.3 GB Preview Download
md5:6b3fa0e3e4229d25b5c3a15b4be47712
8.1 GB Preview Download
md5:9824990e0cb0b605b7f434f4581ace0d
5.2 GB Preview Download
md5:2f6020b8196cac2d8fb922eb89a232c8
200.1 MB Preview Download
md5:95f735af0af3b098c7cdffb8b63c2884
17.4 GB Preview Download