2024-03-28T21:49:39Z
https://zenodo.org/oai2d
oai:zenodo.org:3773250
2021-09-10T15:41:12Z
openaire_data
user-dream
user-illc
user-eu
Giulianelli, Mario
Del Tredici, Marco
Fernández, Raquel
2020-04-28
<p>The DUPS (Diachronic Usage Pair Similarity) dataset contains similarity judgements of English word usage pairs from different time periods, as described in the following paper. Please cite this paper if you use the dataset in your work.</p>
<p>Mario Giulianelli, Marco Del Tredici, and Raquel Fernández. 2020. Analysing Lexical Semantic Change with Contextualised Word Representations. In <em>Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL-2020)</em>. Association for Computational Linguistics.</p>
https://doi.org/10.5281/zenodo.3773250
oai:zenodo.org:3773250
eng
Zenodo
https://zenodo.org/communities/dream
https://zenodo.org/communities/illc
https://zenodo.org/communities/eu
https://doi.org/10.5281/zenodo.3773249
info:eu-repo/semantics/openAccess
Creative Commons Attribution 4.0 International
https://creativecommons.org/licenses/by/4.0/legalcode
linguistics
natural language processing
word similarity judgements
lexical semantic change
DUPS: Diachronic Usage Pair Similarity
info:eu-repo/semantics/other
oai:zenodo.org:10025272
2023-10-20T10:16:20Z
openaire_data
user-dream
Giulianelli, Mario
Baan, Joris
Aziz, Wilker
Fernández, Raquel
Plank, Barbara
2023-10-20
<p>This dataset contains production variability scores as described in the following paper:</p><blockquote><p>Mario Giulianelli, Joris Baan, Wilker Aziz, Raquel Fernández, and Barbara Plank. 2023. <a href="https://arxiv.org/pdf/2305.11707.pdf"><strong>What Comes Next? Evaluating Uncertainty in Neural Text Generators Against Human Production Variability</strong></a>. In <i>Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing. A</i>ssociation for Computational Linguistics.</p></blockquote><p>We make these scores available to facilitate the reproducibility of our experiments, which can be found at <a href="https://github.com/dmg-illc/nlg-uncertainty-probes">https://github.com/dmg-illc/nlg-uncertainty-probes</a>.</p>
https://doi.org/10.5281/zenodo.10025272
oai:zenodo.org:10025272
Zenodo
https://zenodo.org/communities/dream
https://doi.org/10.5281/zenodo.10025271
info:eu-repo/semantics/openAccess
Creative Commons Attribution 4.0 International
https://creativecommons.org/licenses/by/4.0/legalcode
whatsnext-scores
info:eu-repo/semantics/other
oai:zenodo.org:10006413
2023-10-20T10:16:54Z
openaire_data
user-dream
user-eu
Giulianelli, Mario
Wallbridge, Sarenne
Fernández, Raquel
2023-10-20
<h2>AltGen: 1.3M Plausible Alternatives From Neural Text Generators</h2><p>The AltGen dataset contains 1.3 million English texts generated by neural language generators conditioned on contexts from three corpora of acceptability judgements and two corpora of reading times. </p><p>For each corpus, each text generator, and each sampling algorithm,100 generations are sampled—for a total of 1,257,300 generations. Details about the language generators and the corpora are presented in a paper published at EMNLP 2023 (in particular, Section 4). Please cite this paper if you use any version of the dataset in your work:</p><blockquote><p>Mario Giulianelli, Sarenne Wallbridge, and Raquel Fernández. 2023. <strong>Information Value: Measuring Utterance Predictability as Distance from Plausible Alternatives</strong>. In <i>Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing</i>. Association for Computational Linguistics.</p></blockquote><p>The files are in jsonl format and include a <i>context_id</i> field, which allows retrieving the relevant entry from the original corpus, and the <i>alternatives</i> field, which contains the language model generations. Please note that the alternatives are not post-processed (see code and footnote 2 in the paper for further details). Filenames are built as follows: <i>DecodingAlgorithm</i>_<i>DecodingParameter</i>-n<i>NumAlternatives</i>-maxlen_<i>MaxGenerationLength</i>-sep_<i>Separator.</i>jsonl.</p>
https://doi.org/10.5281/zenodo.10006413
oai:zenodo.org:10006413
eng
Zenodo
https://zenodo.org/communities/dream
https://zenodo.org/communities/eu
https://doi.org/10.5281/zenodo.10006412
info:eu-repo/semantics/openAccess
Creative Commons Attribution 4.0 International
https://creativecommons.org/licenses/by/4.0/legalcode
AltGen: 1.3M Plausible Alternatives From Neural Text Generators
info:eu-repo/semantics/other
oai:zenodo.org:5500223
2021-09-10T15:41:13Z
openaire_data
user-dream
user-illc
user-eu
Giulianelli, Mario
Del Tredici, Marco
Fernández, Raquel
2020-04-28
<p>The DUPS (Diachronic Usage Pair Similarity) dataset contains similarity judgements of English word usage pairs from different time periods, as described in the paper below. </p>
<p>The WUG version of the DUPS dataset (version 2.0.0) contains diachronic Word Usage Graphs constructed from the similarity judgements of English word usage pairs contained in DUPS. In a word usage graph, the usages of a word are represented as nodes connected by edges weighted according to (human-annotated) semantic proximity. A description of the data format as well as the code used to generate the graphs from DUPS can be found at <a href="https://www.ims.uni-stuttgart.de/data/wugs">https://www.ims.uni-stuttgart.de/data/wugs</a>.</p>
<p>Both versions of the DUPS dataset can be downloaded from the Files section of this web page.</p>
<p>Please cite this paper if you use any version of the dataset in your work:</p>
<blockquote>
<p>Mario Giulianelli, Marco Del Tredici, and Raquel Fernández. 2020. <a href="https://aclanthology.org/2020.acl-main.365/">Analysing Lexical Semantic Change with Contextualised Word Representations</a>. In <em>Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL-2020)</em>. Association for Computational Linguistics.</p>
</blockquote>
<p> </p>
https://doi.org/10.5281/zenodo.5500223
oai:zenodo.org:5500223
eng
Zenodo
https://zenodo.org/communities/dream
https://zenodo.org/communities/illc
https://zenodo.org/communities/eu
https://doi.org/10.5281/zenodo.3773249
info:eu-repo/semantics/openAccess
Creative Commons Attribution 4.0 International
https://creativecommons.org/licenses/by/4.0/legalcode
linguistics
natural language processing
word similarity judgements
lexical semantic change
DUPS: Diachronic Usage Pair Similarity
info:eu-repo/semantics/other