DUPS: Diachronic Usage Pair Similarity

doi:10.5281/zenodo.5500223

Published April 28, 2020 | Version 2.0.0

Dataset Open

DUPS: Diachronic Usage Pair Similarity

1. University of Amsterdam

The DUPS (Diachronic Usage Pair Similarity) dataset contains similarity judgements of English word usage pairs from different time periods, as described in the paper below.

The WUG version of the DUPS dataset (version 2.0.0) contains diachronic Word Usage Graphs constructed from the similarity judgements of English word usage pairs contained in DUPS. In a word usage graph, the usages of a word are represented as nodes connected by edges weighted according to (human-annotated) semantic proximity. A description of the data format as well as the code used to generate the graphs from DUPS can be found at https://www.ims.uni-stuttgart.de/data/wugs.

Both versions of the DUPS dataset can be downloaded from the Files section of this web page.

Please cite this paper if you use any version of the dataset in your work:

Mario Giulianelli, Marco Del Tredici, and Raquel Fernández. 2020. Analysing Lexical Semantic Change with Contextualised Word Representations. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL-2020). Association for Computational Linguistics.

Files

DUPS-WUG.zip

Files (7.0 MB)

Name	Size	Download all
DUPS-WUG.zip md5:e91f6e4cd5212449e9490deaad7d8ef6	4.7 MB	Preview Download
DUPS.zip md5:b0b2b68ff01881c56a4a39e39e753339	2.3 MB	Preview Download

Additional details

DREAM – Distributed dynamic REpresentations for diAlogue Management 819455: European Commission

	All versions	This version
Views	1,815	646
Downloads	291	149
Data volume	1.1 GB	760.7 MB

DUPS: Diachronic Usage Pair Similarity

Creators

Description

Files

DUPS-WUG.zip

Files (7.0 MB)

Additional details

Funding