Conference paper Open Access

Shallow Context Analysis for German Idiom Detection

Amin, Miriam; Fankhauser, Peter; Kupietz, Marc; Schneider, Roman

In order to differentiate between figurative and literal usage of verb-noun combinations for the shared task on the disambiguation of German Verbal Idioms issued for KONVENS 2021, we apply and extend an approach originally developed for detecting idioms in a dataset consisting of random ngram samples. The classification is done by implementing a rather shallow, statistics-based pipeline without intensive preprocessing and examinations on the morphosyntactic and semantic level. We describe the overall approach, the differences between the original dataset and the dataset of the KONVENS task, provide experimental classification results, and analyse the individual contributions of our feature sets.

Files (120.3 kB)
Name Size
120.3 kB Download
All versions This version
Views 6767
Downloads 5353
Data volume 6.4 MB6.4 MB
Unique views 4747
Unique downloads 3737


Cite as