Dataset Open Access

Word Embedding of Amazon Product Review Corpus

Marc Schulder; Wiegand, Michael

Dublin Core Export

<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="" xmlns:oai_dc="" xmlns:xsi="" xsi:schemaLocation="">
  <dc:creator>Marc Schulder</dc:creator>
  <dc:creator>Wiegand, Michael</dc:creator>
  <dc:description>A word embedding of the Amazon Product Review Corpus (Jindal and Liu, 2008).

Created using Word2Vec in CBOW mode, 500 dimensions and window size 5.

Words have been lemmatised and particle verbs have been merged into a single token (e.g. calm_down).



This dataset was created as part of the following publication:

Marc Schulder, Michael Wiegand, Josef Ruppenhofer and Benjamin Roth (2017). "Towards Bootstrapping a Polarity Shifter Lexicon using Linguistic Features". Proceedings of the 8th International Joint Conference on Natural Language Processing (IJCNLP). Taipei, Taiwan, November 27 - December 3, 2017. DOI: 10.5281/zenodo.3365609.

If you use the data in your research or work, please cite the publication.</dc:description>
  <dc:subject>Word Embedding</dc:subject>
  <dc:subject>Product Reviews</dc:subject>
  <dc:title>Word Embedding of Amazon Product Review Corpus</dc:title>
All versions This version
Views 190190
Downloads 6767
Data volume 131.6 GB131.6 GB
Unique views 164164
Unique downloads 4242


Cite as