Published November 27, 2017 | Version v1.0.0
Dataset Open

Bootstrapped Lexicon of English Verbal Polarity Shifters

  • 1. Spoken Language Systems, Saarland University
  • 2. Institute for German Language, Mannheim
  • 3. Center for Information and Language Processing, LMU Munich

Description

An extended version of this dataset that also covers nominal and adjectival polarity shifters can be found at doi:10.5281/zenodo.3365601.

 

We provide a bootstrapped lexicon of English verbal polarity shifters. Our lexicon covers 3043 verbs of WordNet v3.1 (Miller et al., 1990) that are single word or particle verbs. Polarity shifter labels are given for each word lemma.

Data

The data consists of:

  1. Two lists of WordNet verbs (Miller et al., 1990), annotated for whether they cause shifting.
    1. The initial gold standard (§2) of 2000 randomly chosen verbs.
    2. The bootstrapped 1043 verbs (§5.3) that were labelled as shifters by our best classifier and then manually annotated.
  2. Data set of verb phrases from the Amazon Product Review Data corpus (Jindal & Liu, 2008), annotated for polarity of phrase and polar noun.

 

1. Verbal Shifters

Files

  • The initial gold standard: verbal_shifters.gold_standard.txt
  • The bootstrapped verbs: verbal_shifters.bootstrapping.txt

Format

  • Each line contains a verb and its label, separate by a whitespace.
  • Multiword expressions are separated by an underscore (WORD_WORD).
  • All labels were assigned by an expert annotator.

 

2. Sentiment Verb Phrases

Files

  • All annotated verb phrases: sentiment_phrases.txt

Content

The file starts with 400 phrases containing shifter verbs, followed by 2231 phrases containing non-shifter verbs.

Format

Every item consists of:

  • The sentence from which the VP and the polar noun were extracted.
  • The VP, polar noun and the verb heading the VP.
  • Constituency parse for the VP.
  • Gold labels for VP and polar noun by a human annotator.
  • Predicted labels for VP and polar noun by RNTN tagger (Socher et al., 2013) and LEX_gold approach.
  • Items are separated by a line of asterisks (*)

Related Resources

Attribution

This dataset was created as part of the following publication:

Marc Schulder, Michael Wiegand, Josef Ruppenhofer and Benjamin Roth (2017). "Towards Bootstrapping a Polarity Shifter Lexicon using Linguistic Features". Proceedings of the 8th International Joint Conference on Natural Language Processing (IJCNLP). Taipei, Taiwan, November 27 - December 3, 2017. DOI: 10.5281/zenodo.3365609.

If you use the data in your research or work, please cite the publication.

 

Notes

This work was partially supported by the German Research Foundation (DFG) under grants RU 1873/2-1 and WI4204/2-1.

Files

README.md

Files (1.3 MB)

Name Size Download all
md5:4a17ffc27c9f3b240fbf4fe17783c89c
18.6 kB Download
md5:fac976b3e967d1faee07c9328c3d44c1
2.5 kB Preview Download
md5:e5524ce70e76b7c33a9e0580a2c7023b
1.2 MB Preview Download
md5:ce285a6b0fe9a75737ec065269855f3a
18.3 kB Preview Download
md5:50e016ea1d18825da59b49f387bd6681
37.4 kB Preview Download

Additional details

Related works

Is part of
Dataset: 10.5281/zenodo.3365605 (DOI)
Is supplement to
Dataset: https://github.com/uds-lsv/bootstrapped-lexicon-of-english-verbal-polarity-shifters (URL)
Conference paper: 10.5281/zenodo.3365609 (DOI)
References
Dataset: 10.5281/zenodo.3370051 (DOI)