Dataset Open Access
Marc Schulder;
Wiegand, Michael
{ "inLanguage": { "alternateName": "eng", "@type": "Language", "name": "English" }, "description": "<p>A word embedding of the <a href=\"https://www.cs.uic.edu/~liub/FBS/sentiment-analysis.html#datasets\">Amazon Product Review Corpus</a> (<a href=\"https://www.doi.org/10.1145/1341531.1341560\">Jindal and Liu, 2008</a>).</p>\n\n<p>Created using <a href=\"https://code.google.com/archive/p/word2vec/\">Word2Vec</a> in CBOW mode, 500 dimensions and window size 5.</p>\n\n<p>Words have been lemmatised and particle verbs have been merged into a single token (e.g. <code>calm_down</code>).</p>\n\n<ul>\n</ul>\n\n<p> </p>\n\n<p><strong>Attribution</strong></p>\n\n<p>This dataset was created as part of the following publication:</p>\n\n<p>Marc Schulder, Michael Wiegand, Josef Ruppenhofer and Benjamin Roth (2017). <strong>"Towards Bootstrapping a Polarity Shifter Lexicon using Linguistic Features"</strong>. Proceedings of the 8th International Joint Conference on Natural Language Processing (IJCNLP). Taipei, Taiwan, November 27 - December 3, 2017. <a href=\"https://doi.org/10.5281/zenodo.3365609\">DOI: 10.5281/zenodo.3365609</a>.</p>\n\n<p>If you use the data in your research or work, please cite the publication.</p>", "license": "https://creativecommons.org/licenses/by/4.0/legalcode", "creator": [ { "affiliation": "Spoken Language Systems, Saarland University", "@id": "https://orcid.org/0000-0002-4183-8489", "@type": "Person", "name": "Marc Schulder" }, { "affiliation": "Spoken Language Systems, Saarland University", "@type": "Person", "name": "Wiegand, Michael" } ], "url": "https://zenodo.org/record/3370051", "datePublished": "2017-11-27", "version": "1.0.0", "keywords": [ "Word Embedding", "Product Reviews" ], "@context": "https://schema.org/", "distribution": [ { "contentUrl": "https://zenodo.org/api/files/55a96a3f-7229-4d99-9a70-b7ee6d0c5195/amazon_product_review_corpus.particle_verbs.cbow.w5.d500.txt", "encodingFormat": "txt", "@type": "DataDownload" }, { "contentUrl": "https://zenodo.org/api/files/55a96a3f-7229-4d99-9a70-b7ee6d0c5195/amazon_product_review_corpus.particle_verbs.cbow.w5.d500.voc", "encodingFormat": "voc", "@type": "DataDownload" } ], "identifier": "https://doi.org/10.5281/zenodo.3370051", "@id": "https://doi.org/10.5281/zenodo.3370051", "@type": "Dataset", "name": "Word Embedding of Amazon Product Review Corpus" }
All versions | This version | |
---|---|---|
Views | 190 | 190 |
Downloads | 67 | 67 |
Data volume | 131.6 GB | 131.6 GB |
Unique views | 164 | 164 |
Unique downloads | 42 | 42 |