Dataset Open Access
Marc Schulder;
Wiegand, Michael
<?xml version='1.0' encoding='utf-8'?> <resource xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://datacite.org/schema/kernel-4" xsi:schemaLocation="http://datacite.org/schema/kernel-4 http://schema.datacite.org/meta/kernel-4.1/metadata.xsd"> <identifier identifierType="DOI">10.5281/zenodo.3370051</identifier> <creators> <creator> <creatorName>Marc Schulder</creatorName> <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0002-4183-8489</nameIdentifier> <affiliation>Spoken Language Systems, Saarland University</affiliation> </creator> <creator> <creatorName>Wiegand, Michael</creatorName> <givenName>Michael</givenName> <familyName>Wiegand</familyName> <affiliation>Spoken Language Systems, Saarland University</affiliation> </creator> </creators> <titles> <title>Word Embedding of Amazon Product Review Corpus</title> </titles> <publisher>Zenodo</publisher> <publicationYear>2017</publicationYear> <subjects> <subject>Word Embedding</subject> <subject>Product Reviews</subject> </subjects> <dates> <date dateType="Issued">2017-11-27</date> </dates> <language>en</language> <resourceType resourceTypeGeneral="Dataset"/> <alternateIdentifiers> <alternateIdentifier alternateIdentifierType="url">https://zenodo.org/record/3370051</alternateIdentifier> </alternateIdentifiers> <relatedIdentifiers> <relatedIdentifier relatedIdentifierType="DOI" relationType="IsSupplementTo" resourceTypeGeneral="ConferencePaper">10.5281/zenodo.3365609</relatedIdentifier> <relatedIdentifier relatedIdentifierType="DOI" relationType="IsVersionOf">10.5281/zenodo.3370050</relatedIdentifier> <relatedIdentifier relatedIdentifierType="URL" relationType="IsPartOf">https://zenodo.org/communities/natural-language-processing</relatedIdentifier> </relatedIdentifiers> <version>1.0.0</version> <rightsList> <rights rightsURI="https://creativecommons.org/licenses/by/4.0/legalcode">Creative Commons Attribution 4.0 International</rights> <rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights> </rightsList> <descriptions> <description descriptionType="Abstract"><p>A word embedding of the <a href="https://www.cs.uic.edu/~liub/FBS/sentiment-analysis.html#datasets">Amazon Product Review Corpus</a> (<a href="https://www.doi.org/10.1145/1341531.1341560">Jindal and Liu, 2008</a>).</p> <p>Created using <a href="https://code.google.com/archive/p/word2vec/">Word2Vec</a> in CBOW mode, 500 dimensions and window size 5.</p> <p>Words have been lemmatised and particle verbs have been merged into a single token (e.g. <code>calm_down</code>).</p> <ul> </ul> <p>&nbsp;</p> <p><strong>Attribution</strong></p> <p>This dataset was created as part of the following publication:</p> <p>Marc Schulder,&nbsp;Michael Wiegand,&nbsp;Josef Ruppenhofer&nbsp;and&nbsp;Benjamin Roth&nbsp;(2017).&nbsp;<strong>&quot;Towards Bootstrapping a Polarity Shifter Lexicon using Linguistic Features&quot;</strong>. Proceedings of the 8th International Joint Conference on Natural Language Processing (IJCNLP). Taipei, Taiwan, November 27 - December 3, 2017.&nbsp;<a href="https://doi.org/10.5281/zenodo.3365609">DOI: 10.5281/zenodo.3365609</a>.</p> <p>If you use the data in your research or work, please cite the publication.</p></description> <description descriptionType="Other">{"references": ["Jindal, Nitin and Bing Liu (2008). \"Opinion Spam and Analysis.\" In: Proceedings of the International Conference on Web Search and Data Mining (WSDM). Palo Alto, California, USA: Association for Com- puting Machinery, pp. 219\u2013230. isbn: 978-1-59593-927-2. doi: 10. 1145/1341531.1341560"]}</description> </descriptions> </resource>
All versions | This version | |
---|---|---|
Views | 190 | 190 |
Downloads | 67 | 67 |
Data volume | 131.6 GB | 131.6 GB |
Unique views | 164 | 164 |
Unique downloads | 42 | 42 |