Planned intervention: On Wednesday April 3rd 05:30 UTC Zenodo will be unavailable for up to 2-10 minutes to perform a storage cluster upgrade.

There is a newer version of the record available.

Published December 15, 2021 | Version 2.0.0
Dataset Open

DWUG EN: Diachronic Word Usage Graphs for English

  • 1. University of Stuttgart
  • 2. University of Cambridge
  • 3. University of Gothenburg
  • 4. King's College London, The Alan Turing Institute

Description

This data collection contains diachronic Word Usage Graphs (WUGs) for English. Find a description of the data format, code to process the data and further datasets on the WUGsite.

See previous versions for additional testsets.

Please find more information on the provided data in the paper referenced below.

Version: 2.0.0, 15.12.2021. Important: extends previous versions with one more annotation round and new clusterings.

Reference

Dominik Schlechtweg, Nina Tahmasebi, Simon Hengchen, Haim Dubossarsky, Barbara McGillivray. 2021. DWUG: A large Resource of Diachronic Word Usage Graphs in Four Languages.

Notes

additional annotation round; new clusterings

Files

dwug_en.zip

Files (19.7 MB)

Name Size Download all
md5:85a4a607c6d7ac94a828bc950d3b4c22
19.7 MB Preview Download

Additional details

Related works

Continues
Dataset: 10.5281/zenodo.5541274 (DOI)
Is published in
Conference paper: arXiv:2104.08540 (arXiv)
Is supplement to
Dataset: 10.5281/zenodo.5255227 (DOI)
Dataset: 10.5281/zenodo.5090647 (DOI)
Dataset: 10.5281/zenodo.5544198 (DOI)