Dataset Open Access

Studying Taxonomy Enrichment on Diachronic WordNet Versions

Irina Nikishina; Alexander Panchenko; Varvara Logacheva; Natalia Loukachevitch


MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nmm##2200000uu#4500</leader>
  <datafield tag="041" ind1=" " ind2=" ">
    <subfield code="a">rus</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">RuWordNet, wordnets</subfield>
  </datafield>
  <controlfield tag="005">20201119002705.0</controlfield>
  <controlfield tag="001">4279821</controlfield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Skolkovo Institute of Science and Technology, Moscow, Russia</subfield>
    <subfield code="a">Alexander Panchenko</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Skolkovo Institute of Science and Technology, Moscow, Russia</subfield>
    <subfield code="a">Varvara Logacheva</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Research Computing Center, Lomonosov Moscow State University, Moscow, Russia</subfield>
    <subfield code="a">Natalia Loukachevitch</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">1248958</subfield>
    <subfield code="z">md5:cc053dc6fd255044c0085c0c52ed8086</subfield>
    <subfield code="u">https://zenodo.org/record/4279821/files/datasets.zip</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2020-11-12</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire_data</subfield>
    <subfield code="o">oai:zenodo.org:4279821</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">Skolkovo Institute of Science and Technology, Moscow, Russia</subfield>
    <subfield code="0">(orcid)0000-0003-4910-8568</subfield>
    <subfield code="a">Irina Nikishina</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">Studying Taxonomy Enrichment on Diachronic WordNet Versions</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u">https://creativecommons.org/licenses/by/4.0/legalcode</subfield>
    <subfield code="a">Creative Commons Attribution 4.0 International</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;We choose two versions of WordNet and then select words which appear only in a newer version. For each word, we get its hypernyms from the newer WordNet version and consider them as gold standard hypernyms. We add words to the dataset if only their hypernyms appear in both snippets. We do not consider adjectives and adverbs, because they often introduce abstract concepts and are difficult to interpret by context.&lt;/p&gt;

&lt;p&gt;Previous dataset (RUSSE&amp;#39;2020) does not include short words (&amp;lt;4&amp;nbsp;symbols), diminutives, named entities and other constraints described in the shared task paper. We remove those constraints and present a non-restricted Russian dataset and a symmetrical English dataset from WordNet database.&lt;/p&gt;</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.5281/zenodo.4270477</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.4279821</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">dataset</subfield>
  </datafield>
</record>
176
22
views
downloads
All versions This version
Views 176132
Downloads 2217
Data volume 23.8 MB21.2 MB
Unique views 128107
Unique downloads 2217

Share

Cite as