There is a newer version of the record available.

Published September 30, 2021 | Version 2.0.0
Dataset Open

SURel: Synchronic Usage Relatedness

  • 1. University of Stuttgart

Description

This data collection contains synchronic semantic relatedness judgments for German word usage pairs drawn from general language and the domain of cooking. Find a description of the data format, code to process the data and further datasets on the WUGsite.

We provide additional data under misc/:

  • testset: a semantic meaning shift test set with 22 German lexemes exhibiting different degrees of meaning shifts from general language to the domain of cooking. The 'mean relatedness score' denotes the annotation-based measure of semantic shift described in the paper. 'frequency GEN' and 'frequency SPEC' list the frequencies of the target words in the general-language corpus (GEN) and the domain-specific cooking corpus (SPEC). 'translations' provides English translations across senses, illustrating possible meaning shifts. Note that further senses might exist.
  • tables: the annotated table of each annotator.
  • plots: data visualization plots.

Please find more information on the provided data in the paper referenced below.

Version: 2.0.0, 30.9.2021.

Reference

Anna Hätty, Dominik Schlechtweg, Sabine Schulte im Walde. 2019. SURel: A Gold Standard for Incorporating Meaning Shifts into Term Extraction. In Proceedings of the Eighth Joint Conference on Lexical and Computational Semantics (*SEM). Minneapolis, Minnesota, USA, 2019.

Notes

WUG version

Files

surel.zip

Files (5.5 MB)

Name Size Download all
md5:e8567751665a8cec5a1255605fef9c80
5.5 MB Preview Download

Additional details

Related works

Continues
Dataset: 10.5281/zenodo.5541274 (DOI)
Is published in
Conference paper: 10.18653/v1/S19-1001 (DOI)