DWUG EN: Diachronic Word Usage Graphs for English
Creators
- 1. University of Stuttgart
- 2. University of Cambridge
- 3. University of Gothenburg
- 4. King's College London, The Alan Turing Institute
Description
This data collection contains diachronic Word Usage Graphs (WUGs) for English. Find a description of the data format, code to process the data and further datasets on the WUGsite.
See previous versions for additional testsets.
Please find more information on the provided data in the paper referenced below.
Version: 2.0.1, 30.11.2022. Assigns noise uses the cluster label '-1' instead of removing them. Important: Version 2.0.0 extends previous versions with one more annotation round and new clusterings.
Reference
Dominik Schlechtweg, Nina Tahmasebi, Simon Hengchen, Haim Dubossarsky, Barbara McGillivray. 2021. DWUG: A large Resource of Diachronic Word Usage Graphs in Four Languages. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing.
Notes
Files
dwug_en.zip
Files
(11.5 MB)
Name | Size | Download all |
---|---|---|
md5:80c3f00f3d13e5ad396e8464ebe67bbc
|
11.5 MB | Preview Download |
Additional details
Related works
- Continues
- Dataset: 10.5281/zenodo.5541274 (DOI)
- Is published in
- Conference paper: arXiv:2104.08540 (arXiv)
- Is supplement to
- Dataset: 10.5281/zenodo.5255227 (DOI)
- Dataset: 10.5281/zenodo.5090647 (DOI)
- Dataset: 10.5281/zenodo.5544198 (DOI)