Studying Taxonomy Enrichment on Diachronic WordNet Versions

10.5281/zenodo.4279821 https://zenodo.org/records/4279821 oai:zenodo.org:4279821 Irina Nikishina Irina Nikishina 0000-0003-4910-8568 Skolkovo Institute of Science and Technology, Moscow, Russia Alexander Panchenko Alexander Panchenko Skolkovo Institute of Science and Technology, Moscow, Russia Varvara Logacheva Varvara Logacheva Skolkovo Institute of Science and Technology, Moscow, Russia Natalia Loukachevitch Natalia Loukachevitch Research Computing Center, Lomonosov Moscow State University, Moscow, Russia Studying Taxonomy Enrichment on Diachronic WordNet Versions Zenodo 2020 RuWordNet, wordnets 2020-11-12 2020-11-19 rus 10.5281/zenodo.4270477 Creative Commons Attribution 4.0 International We choose two versions of WordNet and then select words which appear only in a newer version. For each word, we get its hypernyms from the newer WordNet version and consider them as gold standard hypernyms. We add words to the dataset if only their hypernyms appear in both snippets. We do not consider adjectives and adverbs, because they often introduce abstract concepts and are difficult to interpret by context. Previous dataset (RUSSE'2020) does not include short words (<4 symbols), diminutives, named entities and other constraints described in the shared task paper. We remove those constraints and present a non-restricted Russian dataset and a symmetrical English dataset from WordNet database.