Planned intervention: On Wednesday June 26th 05:30 UTC Zenodo will be unavailable for 10-20 minutes to perform a storage cluster upgrade.
Published November 2, 2020 | Version 1
Dataset Open

Towards Olfactory Information Extraction from Text: A Case Study on Detecting Smell Experiences in Novels

  • 1. University of Amsterdam
  • 2. KNAW Humanities Cluster

Description

Dataset accompanying "Ryan Brate, Paul Groth and Marieke van Erp (2020) Towards Olfactory Information Extraction from Text: A Case Study on Detecting Smell Experiences in Novels. LaTeCH-CLfL 2020. Barcelona, December 2020."

Abstract:

Environmental factors determine the smells we perceive, but societal factors factors shape the importance, sentiment and biases we give to them. Descriptions of smells in text, or as we call them `smell experiences', offer a window into these factors, but they must first be identified. To the best of our knowledge, no tool exists to extract references to smell experiences from text. In this paper, we present two variations on a semi-supervised approach to identify smell experiences in English literature. The combined set of patterns from both implementations offer significantly better performance than a keyword-based baseline.

Files

datasets.zip

Files (152.7 MB)

Name Size Download all
md5:2c43999b78c16c35d14bbe1b88869081
152.7 MB Preview Download