Bootstrapped Lexicon of English Verbal Polarity Shifters
- 1. Spoken Language Systems, Saarland University
- 2. Institute for German Language, Mannheim
- 3. Center for Information and Language Processing, LMU Munich
Description
An extended version of this dataset that also covers nominal and adjectival polarity shifters can be found at doi:10.5281/zenodo.3365601.
We provide a bootstrapped lexicon of English verbal polarity shifters. Our lexicon covers 3043 verbs of WordNet v3.1 (Miller et al., 1990) that are single word or particle verbs. Polarity shifter labels are given for each word lemma.
Data
The data consists of:
- Two lists of WordNet verbs (Miller et al., 1990), annotated for whether they cause shifting.
- The initial gold standard (§2) of 2000 randomly chosen verbs.
- The bootstrapped 1043 verbs (§5.3) that were labelled as shifters by our best classifier and then manually annotated.
- Data set of verb phrases from the Amazon Product Review Data corpus (Jindal & Liu, 2008), annotated for polarity of phrase and polar noun.
1. Verbal Shifters
Files
- The initial gold standard:
verbal_shifters.gold_standard.txt
- The bootstrapped verbs:
verbal_shifters.bootstrapping.txt
Format
- Each line contains a verb and its label, separate by a whitespace.
- Multiword expressions are separated by an underscore (WORD_WORD).
- All labels were assigned by an expert annotator.
2. Sentiment Verb Phrases
Files
- All annotated verb phrases:
sentiment_phrases.txt
Content
The file starts with 400 phrases containing shifter verbs, followed by 2231 phrases containing non-shifter verbs.
Format
Every item consists of:
- The sentence from which the VP and the polar noun were extracted.
- The VP, polar noun and the verb heading the VP.
- Constituency parse for the VP.
- Gold labels for VP and polar noun by a human annotator.
- Predicted labels for VP and polar noun by RNTN tagger (Socher et al., 2013) and
LEX_gold
approach. - Items are separated by a line of asterisks (*)
Related Resources
- Paper: ACL Anthology or DOI: 10.5281/zenodo.3365609
- Presentation: ACL Anthology
- Word Embedding: DOI: 10.5281/zenodo.3370051
Attribution
This dataset was created as part of the following publication:
Marc Schulder, Michael Wiegand, Josef Ruppenhofer and Benjamin Roth (2017). "Towards Bootstrapping a Polarity Shifter Lexicon using Linguistic Features". Proceedings of the 8th International Joint Conference on Natural Language Processing (IJCNLP). Taipei, Taiwan, November 27 - December 3, 2017. DOI: 10.5281/zenodo.3365609.
If you use the data in your research or work, please cite the publication.
Notes
Files
README.md
Files
(1.3 MB)
Name | Size | Download all |
---|---|---|
md5:4a17ffc27c9f3b240fbf4fe17783c89c
|
18.6 kB | Download |
md5:fac976b3e967d1faee07c9328c3d44c1
|
2.5 kB | Preview Download |
md5:e5524ce70e76b7c33a9e0580a2c7023b
|
1.2 MB | Preview Download |
md5:ce285a6b0fe9a75737ec065269855f3a
|
18.3 kB | Preview Download |
md5:50e016ea1d18825da59b49f387bd6681
|
37.4 kB | Preview Download |
Additional details
Identifiers
Related works
- Is part of
- Dataset: 10.5281/zenodo.3365605 (DOI)
- Is supplement to
- Dataset: https://github.com/uds-lsv/bootstrapped-lexicon-of-english-verbal-polarity-shifters (URL)
- Conference paper: 10.5281/zenodo.3365609 (DOI)
- References
- Dataset: 10.5281/zenodo.3370051 (DOI)