Published April 28, 2020 | Version 2.0.0
Dataset Open

DUPS: Diachronic Usage Pair Similarity

  • 1. University of Amsterdam

Description

The DUPS (Diachronic Usage Pair Similarity) dataset contains similarity judgements of English word usage pairs from different time periods, as described in the paper below. 

The WUG version of the DUPS dataset (version 2.0.0) contains diachronic Word Usage Graphs constructed from the similarity judgements of English word usage pairs contained in DUPS. In a word usage graph, the usages of a word are represented as nodes connected by edges weighted according to (human-annotated) semantic proximity. A description of the data format as well as the code used to generate the graphs from DUPS can be found at https://www.ims.uni-stuttgart.de/data/wugs.

Both versions of the DUPS dataset can be downloaded from the Files section of this web page.

Please cite this paper if you use any version of the dataset in your work:

Mario Giulianelli, Marco Del Tredici, and Raquel Fernández. 2020. Analysing Lexical Semantic Change with Contextualised Word Representations. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL-2020). Association for Computational Linguistics.

 

Files

DUPS-WUG.zip

Files (7.0 MB)

Name Size Download all
md5:e91f6e4cd5212449e9490deaad7d8ef6
4.7 MB Preview Download
md5:b0b2b68ff01881c56a4a39e39e753339
2.3 MB Preview Download

Additional details

Funding

DREAM – Distributed dynamic REpresentations for diAlogue Management 819455
European Commission