Published September 6, 2021 | Version v1
Conference paper Open

Shallow Context Analysis for German Idiom Detection

  • 1. Leipzig University
  • 2. Leibniz Institute for the German Language

Description

In order to differentiate between figurative and literal usage of verb-noun combinations for the shared task on the disambiguation of German Verbal Idioms issued for KONVENS 2021, we apply and extend an approach originally developed for detecting idioms in a dataset consisting of random ngram samples. The classification is done by implementing a rather shallow, statistics-based pipeline without intensive preprocessing and examinations on the morphosyntactic and semantic level. We describe the overall approach, the differences between the original dataset and the dataset of the KONVENS task, provide experimental classification results, and analyse the individual contributions of our feature sets.

Files

KONVENS_2021_Disambiguation_ST-Shallow_Context_Analysis_for_German_Idiom_Detection.pdf

Additional details