Dataset Open Access

Patent text: code, data, and new measures

Arts; Hou; Gomez


JSON Export

{
  "files": [
    {
      "links": {
        "self": "https://zenodo.org/api/files/13d44161-3ae1-4bca-bd34-dea27a7cc0a2/100_most_similar_patents.zip"
      }, 
      "checksum": "md5:cbef0725269ac2185034a30b365066a9", 
      "bucket": "13d44161-3ae1-4bca-bd34-dea27a7cc0a2", 
      "key": "100_most_similar_patents.zip", 
      "type": "zip", 
      "size": 5103406708
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/13d44161-3ae1-4bca-bd34-dea27a7cc0a2/cosine_similarity.zip"
      }, 
      "checksum": "md5:025c03d1b7f32acc75e93bc4f6d5aa38", 
      "bucket": "13d44161-3ae1-4bca-bd34-dea27a7cc0a2", 
      "key": "cosine_similarity.zip", 
      "type": "zip", 
      "size": 80884754
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/13d44161-3ae1-4bca-bd34-dea27a7cc0a2/keywords.zip"
      }, 
      "checksum": "md5:b1fe1e41a8da1c7ed8948487c7a1089f", 
      "bucket": "13d44161-3ae1-4bca-bd34-dea27a7cc0a2", 
      "key": "keywords.zip", 
      "type": "zip", 
      "size": 903581447
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/13d44161-3ae1-4bca-bd34-dea27a7cc0a2/new_bigrams.zip"
      }, 
      "checksum": "md5:1a0268bc4a8ca3d83deb072e558990e1", 
      "bucket": "13d44161-3ae1-4bca-bd34-dea27a7cc0a2", 
      "key": "new_bigrams.zip", 
      "type": "zip", 
      "size": 68494502
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/13d44161-3ae1-4bca-bd34-dea27a7cc0a2/new_keyword_comb_1980_1989.zip"
      }, 
      "checksum": "md5:ee65ce71ae3c02319db065685420f056", 
      "bucket": "13d44161-3ae1-4bca-bd34-dea27a7cc0a2", 
      "key": "new_keyword_comb_1980_1989.zip", 
      "type": "zip", 
      "size": 492689117
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/13d44161-3ae1-4bca-bd34-dea27a7cc0a2/new_keyword_comb_1990_1994.zip"
      }, 
      "checksum": "md5:166d0b81fc60b8714ae77c230b648295", 
      "bucket": "13d44161-3ae1-4bca-bd34-dea27a7cc0a2", 
      "key": "new_keyword_comb_1990_1994.zip", 
      "type": "zip", 
      "size": 351278937
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/13d44161-3ae1-4bca-bd34-dea27a7cc0a2/new_keyword_comb_1995_1999.zip"
      }, 
      "checksum": "md5:4621d1f6feaaca64f3adec600a6c624f", 
      "bucket": "13d44161-3ae1-4bca-bd34-dea27a7cc0a2", 
      "key": "new_keyword_comb_1995_1999.zip", 
      "type": "zip", 
      "size": 866116127
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/13d44161-3ae1-4bca-bd34-dea27a7cc0a2/new_keyword_comb_2000_2004.zip"
      }, 
      "checksum": "md5:fc14065819616aee644b01fa2971b9e7", 
      "bucket": "13d44161-3ae1-4bca-bd34-dea27a7cc0a2", 
      "key": "new_keyword_comb_2000_2004.zip", 
      "type": "zip", 
      "size": 774701905
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/13d44161-3ae1-4bca-bd34-dea27a7cc0a2/new_keyword_comb_2005_2009.zip"
      }, 
      "checksum": "md5:e7167dc5b23816fbfc1ade0e3e047566", 
      "bucket": "13d44161-3ae1-4bca-bd34-dea27a7cc0a2", 
      "key": "new_keyword_comb_2005_2009.zip", 
      "type": "zip", 
      "size": 748119596
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/13d44161-3ae1-4bca-bd34-dea27a7cc0a2/new_keyword_comb_2010_2018.zip"
      }, 
      "checksum": "md5:f34e22ad7a57fe8aff646c5ebb08fe12", 
      "bucket": "13d44161-3ae1-4bca-bd34-dea27a7cc0a2", 
      "key": "new_keyword_comb_2010_2018.zip", 
      "type": "zip", 
      "size": 557666537
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/13d44161-3ae1-4bca-bd34-dea27a7cc0a2/new_keyword_comb_all.zip"
      }, 
      "checksum": "md5:764c38f0e64d0bcc20d8f7d709b4cfd1", 
      "bucket": "13d44161-3ae1-4bca-bd34-dea27a7cc0a2", 
      "key": "new_keyword_comb_all.zip", 
      "type": "zip", 
      "size": 3089558181
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/13d44161-3ae1-4bca-bd34-dea27a7cc0a2/new_keywords.zip"
      }, 
      "checksum": "md5:ad9ee88d67e61888fae961fa29894148", 
      "bucket": "13d44161-3ae1-4bca-bd34-dea27a7cc0a2", 
      "key": "new_keywords.zip", 
      "type": "zip", 
      "size": 10008180
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/13d44161-3ae1-4bca-bd34-dea27a7cc0a2/new_trigrams.zip"
      }, 
      "checksum": "md5:ae488e2e3460afdb5df0d7ae12b5a409", 
      "bucket": "13d44161-3ae1-4bca-bd34-dea27a7cc0a2", 
      "key": "new_trigrams.zip", 
      "type": "zip", 
      "size": 113464480
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/13d44161-3ae1-4bca-bd34-dea27a7cc0a2/patent_text_measures.zip"
      }, 
      "checksum": "md5:675a63980d9deb10eb2062da80a045ce", 
      "bucket": "13d44161-3ae1-4bca-bd34-dea27a7cc0a2", 
      "key": "patent_text_measures.zip", 
      "type": "zip", 
      "size": 100294986
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/13d44161-3ae1-4bca-bd34-dea27a7cc0a2/patent%20txt%20raw.zip"
      }, 
      "checksum": "md5:5ebdfe48395eec11e4f1a2de9490132e", 
      "bucket": "13d44161-3ae1-4bca-bd34-dea27a7cc0a2", 
      "key": "patent txt raw.zip", 
      "type": "zip", 
      "size": 6306854242
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/13d44161-3ae1-4bca-bd34-dea27a7cc0a2/1000_most_similar_patents.zip"
      }, 
      "checksum": "md5:0660f13ff52576b824432ff8c6fbe628", 
      "bucket": "13d44161-3ae1-4bca-bd34-dea27a7cc0a2", 
      "key": "1000_most_similar_patents.zip", 
      "type": "zip", 
      "size": 46024090902
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/13d44161-3ae1-4bca-bd34-dea27a7cc0a2/0_Data_Description_Zenodo.pdf"
      }, 
      "checksum": "md5:ce0332320560f80efa6a86fcdbbae986", 
      "bucket": "13d44161-3ae1-4bca-bd34-dea27a7cc0a2", 
      "key": "0_Data_Description_Zenodo.pdf", 
      "type": "pdf", 
      "size": 535293
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/13d44161-3ae1-4bca-bd34-dea27a7cc0a2/greek.txt"
      }, 
      "checksum": "md5:aea2752c5e38c3ed96976e9264d88a1a", 
      "bucket": "13d44161-3ae1-4bca-bd34-dea27a7cc0a2", 
      "key": "greek.txt", 
      "type": "txt", 
      "size": 685
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/13d44161-3ae1-4bca-bd34-dea27a7cc0a2/symbols.txt"
      }, 
      "checksum": "md5:5d4d932a407310fabea7e80531a9b467", 
      "bucket": "13d44161-3ae1-4bca-bd34-dea27a7cc0a2", 
      "key": "symbols.txt", 
      "type": "txt", 
      "size": 167
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/13d44161-3ae1-4bca-bd34-dea27a7cc0a2/stopwords.txt"
      }, 
      "checksum": "md5:d42922204201e14c015aecd0f0762bd2", 
      "bucket": "13d44161-3ae1-4bca-bd34-dea27a7cc0a2", 
      "key": "stopwords.txt", 
      "type": "txt", 
      "size": 394962
    }
  ], 
  "owners": [
    80673
  ], 
  "doi": "10.5281/zenodo.3515985", 
  "stats": {
    "version_unique_downloads": 1739.0, 
    "unique_views": 2400.0, 
    "views": 2669.0, 
    "version_views": 2671.0, 
    "unique_downloads": 1739.0, 
    "version_unique_views": 2402.0, 
    "volume": 6445003507180.0, 
    "version_downloads": 2946.0, 
    "downloads": 2946.0, 
    "version_volume": 6445003507180.0
  }, 
  "links": {
    "doi": "https://doi.org/10.5281/zenodo.3515985", 
    "conceptdoi": "https://doi.org/10.5281/zenodo.3515984", 
    "bucket": "https://zenodo.org/api/files/13d44161-3ae1-4bca-bd34-dea27a7cc0a2", 
    "conceptbadge": "https://zenodo.org/badge/doi/10.5281/zenodo.3515984.svg", 
    "html": "https://zenodo.org/record/3515985", 
    "latest_html": "https://zenodo.org/record/3515985", 
    "badge": "https://zenodo.org/badge/doi/10.5281/zenodo.3515985.svg", 
    "latest": "https://zenodo.org/api/records/3515985"
  }, 
  "conceptdoi": "10.5281/zenodo.3515984", 
  "created": "2020-11-13T13:49:45.231431+00:00", 
  "updated": "2021-01-27T13:25:15.191212+00:00", 
  "conceptrecid": "3515984", 
  "revision": 10, 
  "id": 3515985, 
  "metadata": {
    "access_right_category": "success", 
    "doi": "10.5281/zenodo.3515985", 
    "description": "<p>This Zenodo page describes data collection, processing, and different open access data files related to the text of USPTO patent documents. The document &quot;Data Description Zenodo.pdf&quot;&nbsp;provides more details.&nbsp;If you use the code or data, please cite the following paper:</p>\n\n<p>Arts S, Hou J, Gomez JC. (2020). Natural language processing to identify the creation and impact of new technologies in patent text: code, data, and new measures. Forthcoming&nbsp;<em>Research Policy</em>. (<a href=\"https://doi.org/10.1016/j.respol.2020.104144\">https://doi.org/10.1016/j.respol.2020.104144</a>)</p>", 
    "license": {
      "id": "ODC-By-1.0"
    }, 
    "title": "Patent text: code, data, and new measures", 
    "relations": {
      "version": [
        {
          "count": 1, 
          "index": 0, 
          "parent": {
            "pid_type": "recid", 
            "pid_value": "3515984"
          }, 
          "is_last": true, 
          "last_child": {
            "pid_type": "recid", 
            "pid_value": "3515985"
          }
        }
      ]
    }, 
    "version": "version 1 (11/2020)", 
    "keywords": [
      "patent measures", 
      "natural language processing", 
      "novelty", 
      "impact", 
      "USPTO", 
      "technological progress", 
      "innovation"
    ], 
    "publication_date": "2020-11-13", 
    "creators": [
      {
        "orcid": "0000-0003-3214-7325", 
        "affiliation": "Sam", 
        "name": "Arts"
      }, 
      {
        "affiliation": "Jianan", 
        "name": "Hou"
      }, 
      {
        "affiliation": "Juan-Carlos", 
        "name": "Gomez"
      }
    ], 
    "access_right": "open", 
    "resource_type": {
      "type": "dataset", 
      "title": "Dataset"
    }, 
    "related_identifiers": [
      {
        "scheme": "doi", 
        "identifier": "10.5281/zenodo.3515984", 
        "relation": "isVersionOf"
      }
    ]
  }
}
2,671
2,946
views
downloads
All versions This version
Views 2,6712,669
Downloads 2,9462,946
Data volume 6.4 TB6.4 TB
Unique views 2,4022,400
Unique downloads 1,7391,739

Share

Cite as