Towards Olfactory Information Extraction from Text: A Case Study on Detecting Smell Experiences in Novels
Creators
- 1. University of Amsterdam
- 2. KNAW Humanities Cluster
Description
Dataset accompanying "Ryan Brate, Paul Groth and Marieke van Erp (2020) Towards Olfactory Information Extraction from Text: A Case Study on Detecting Smell Experiences in Novels. LaTeCH-CLfL 2020. Barcelona, December 2020."
Abstract:
Environmental factors determine the smells we perceive, but societal factors factors shape the importance, sentiment and biases we give to them. Descriptions of smells in text, or as we call them `smell experiences', offer a window into these factors, but they must first be identified. To the best of our knowledge, no tool exists to extract references to smell experiences from text. In this paper, we present two variations on a semi-supervised approach to identify smell experiences in English literature. The combined set of patterns from both implementations offer significantly better performance than a keyword-based baseline.
Files
datasets.zip
Files
(152.7 MB)
Name | Size | Download all |
---|---|---|
md5:2c43999b78c16c35d14bbe1b88869081
|
152.7 MB | Preview Download |