Dataset Open Access

Word Embedding of Amazon Product Review Corpus

Marc Schulder; Wiegand, Michael


JSON-LD (schema.org) Export

{
  "inLanguage": {
    "alternateName": "eng", 
    "@type": "Language", 
    "name": "English"
  }, 
  "description": "<p>A word embedding of the <a href=\"https://www.cs.uic.edu/~liub/FBS/sentiment-analysis.html#datasets\">Amazon Product Review Corpus</a> (<a href=\"https://www.doi.org/10.1145/1341531.1341560\">Jindal and Liu, 2008</a>).</p>\n\n<p>Created using <a href=\"https://code.google.com/archive/p/word2vec/\">Word2Vec</a> in CBOW mode, 500 dimensions and window size 5.</p>\n\n<p>Words have been lemmatised and particle verbs have been merged into a single token (e.g. <code>calm_down</code>).</p>\n\n<ul>\n</ul>\n\n<p>&nbsp;</p>\n\n<p><strong>Attribution</strong></p>\n\n<p>This dataset was created as part of the following publication:</p>\n\n<p>Marc Schulder,&nbsp;Michael Wiegand,&nbsp;Josef Ruppenhofer&nbsp;and&nbsp;Benjamin Roth&nbsp;(2017).&nbsp;<strong>&quot;Towards Bootstrapping a Polarity Shifter Lexicon using Linguistic Features&quot;</strong>. Proceedings of the 8th International Joint Conference on Natural Language Processing (IJCNLP). Taipei, Taiwan, November 27 - December 3, 2017.&nbsp;<a href=\"https://doi.org/10.5281/zenodo.3365609\">DOI: 10.5281/zenodo.3365609</a>.</p>\n\n<p>If you use the data in your research or work, please cite the publication.</p>", 
  "license": "https://creativecommons.org/licenses/by/4.0/legalcode", 
  "creator": [
    {
      "affiliation": "Spoken Language Systems, Saarland University", 
      "@id": "https://orcid.org/0000-0002-4183-8489", 
      "@type": "Person", 
      "name": "Marc Schulder"
    }, 
    {
      "affiliation": "Spoken Language Systems, Saarland University", 
      "@type": "Person", 
      "name": "Wiegand, Michael"
    }
  ], 
  "url": "https://zenodo.org/record/3370051", 
  "datePublished": "2017-11-27", 
  "version": "1.0.0", 
  "keywords": [
    "Word Embedding", 
    "Product Reviews"
  ], 
  "@context": "https://schema.org/", 
  "distribution": [
    {
      "contentUrl": "https://zenodo.org/api/files/55a96a3f-7229-4d99-9a70-b7ee6d0c5195/amazon_product_review_corpus.particle_verbs.cbow.w5.d500.txt", 
      "encodingFormat": "txt", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/55a96a3f-7229-4d99-9a70-b7ee6d0c5195/amazon_product_review_corpus.particle_verbs.cbow.w5.d500.voc", 
      "encodingFormat": "voc", 
      "@type": "DataDownload"
    }
  ], 
  "identifier": "https://doi.org/10.5281/zenodo.3370051", 
  "@id": "https://doi.org/10.5281/zenodo.3370051", 
  "@type": "Dataset", 
  "name": "Word Embedding of Amazon Product Review Corpus"
}
190
67
views
downloads
All versions This version
Views 190190
Downloads 6767
Data volume 131.6 GB131.6 GB
Unique views 164164
Unique downloads 4242

Share

Cite as