Dataset Open Access

DUPS: Diachronic Usage Pair Similarity

Giulianelli, Mario; Del Tredici, Marco; Fernández, Raquel

The DUPS (Diachronic Usage Pair Similarity) dataset contains similarity judgements of English word usage pairs from different time periods, as described in the paper below. 

The WUG version of the DUPS dataset (version 2.0.0) contains diachronic Word Usage Graphs constructed from the similarity judgements of English word usage pairs contained in DUPS. In a word usage graph, the usages of a word are represented as nodes connected by edges weighted according to (human-annotated) semantic proximity. A description of the data format as well as the code used to generate the graphs from DUPS can be found at

Both versions of the DUPS dataset can be downloaded from the Files section of this web page.

Please cite this paper if you use any version of the dataset in your work:

Mario Giulianelli, Marco Del Tredici, and Raquel Fernández. 2020. Analysing Lexical Semantic Change with Contextualised Word Representations. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL-2020). Association for Computational Linguistics.


Files (7.0 MB)
Name Size
4.7 MB Download
2.3 MB Download
All versions This version
Views 867189
Downloads 16551
Data volume 475.7 MB214.8 MB
Unique views 776165
Unique downloads 13932


Cite as