SURel: Synchronic Usage Relatedness
Description
This data collection contains synchronic semantic relatedness judgments for German word usage pairs drawn from general language and the domain of cooking. Find a description of the data format, code to process the data and further datasets on the WUGsite.
We provide additional data under misc/
:
- testset: a semantic meaning shift test set with 22 German lexemes exhibiting different degrees of meaning shifts from general language to the domain of cooking. The 'mean relatedness score' denotes the annotation-based measure of semantic shift described in the paper. 'frequency GEN' and 'frequency SPEC' list the frequencies of the target words in the general-language corpus (GEN) and the domain-specific cooking corpus (SPEC). 'translations' provides English translations across senses, illustrating possible meaning shifts. Note that further senses might exist.
- tables: the annotated table of each annotator.
- plots: data visualization plots.
Please find more information on the provided data in the paper referenced below.
Version: 2.0.0, 30.9.2021.
Reference
Anna Hätty, Dominik Schlechtweg, Sabine Schulte im Walde. 2019. SURel: A Gold Standard for Incorporating Meaning Shifts into Term Extraction. In Proceedings of the Eighth Joint Conference on Lexical and Computational Semantics (*SEM). Minneapolis, Minnesota, USA, 2019.
Notes
Files
surel.zip
Files
(5.5 MB)
Name | Size | Download all |
---|---|---|
md5:e8567751665a8cec5a1255605fef9c80
|
5.5 MB | Preview Download |
Additional details
Related works
- Continues
- Dataset: 10.5281/zenodo.5541274 (DOI)
- Is published in
- Conference paper: 10.18653/v1/S19-1001 (DOI)