163857
doi
10.5281/zenodo.163857
oai:zenodo.org:163857
Ustalov, Dmitry
Arefyev, Nikolay
Paperno, Denis
Konstantinova, Natalia
Loukachevitch, Natalia
Biemann, Chris
Human and Machine Judgements for Russian Semantic Relatedness
Panchenko, Alexander
info:eu-repo/semantics/openAccess
Creative Commons Attribution 4.0 International
https://creativecommons.org/licenses/by/4.0/legalcode
semantic relatedness
semantic similarity
distributional semantics
word2vec
russian language
evaluation
<p>Semantic relatedness of terms represents similarity of meaning by a numerical score. On the one hand, humans easily make judgements about semantic relatedness. On the other hand, this kind of information is useful in language processing systems. While semantic relatedness has been extensively studied for English using numerous language resources, such as associative norms, human judgements and datasets generated from lexical databases, no evaluation resources of this kind have been available for Russian to date. Our contribution addresses this problem. We present five language resources of different scale and purpose for Russian semantic relatedness, each being a list of triples (wordi, wordj , similarityij ). Four of them are designed for evaluation of systems for computing semantic relatedness, complementing each other in terms of the semantic relation type they represent. These benchmarks were used to organise a shared task on Russian semantic relatedness, which attracted 19 teams. We use one of the best approaches identified in this competition to generate the fifth high-coverage resource, the first open distributional thesaurus of Russian. Multiple evaluations of this thesaurus, including a large-scale crowdsourcing study involving native speakers, indicate its high accuracy.</p>
<p>For more details see: </p>
<ul>
<li>The web page of the RUSSE evaluation campaign: http://russe.nlpub.ru/downloads</li>
<li>The original publication "Panchenko A., Ustalov D., Arefyev N., Paperno D. Konstantinova N., Loukachevitch N. and Biemann C. undefinedHuman and Machine Judgements about Russian Semantic Relatedness. In Proceedings of the 5th Conference on Analysis of Images, Social Networks and Texts (AIST'2016). Communications in Computer and Information Science (CCIS). Springler-Verlag Berlin Heidelberg": https://www.lt.informatik.tu-darmstadt.de/fileadmin/user_upload/Group_LangTech/publications/aist_2016_hmj.pdf</li>
</ul>
Zenodo
2016-11-26
info:eu-repo/semantics/other
657370
1579893910.494994
4109127
md5:826bc67f4a774a9f9ef167b784894ba6
https://zenodo.org/records/163857/files/rt.csv
350485
md5:d065c9dae0e6ee06f74f0e4e582b7550
https://zenodo.org/records/163857/files/rt-test.csv
9532
md5:c6d3437cd205fbd53c622bfcf99ee4cf
https://zenodo.org/records/163857/files/hj-wordsim353-relatedness.csv
1091
md5:1ed4b7d36b485b3e8bbc151140e6b50e
https://zenodo.org/records/163857/files/hj-mc.csv
15205
md5:2ccd92a09182b581423bb86186e9b394
https://zenodo.org/records/163857/files/hj.csv
1884528636
md5:530d03982f35552c762a64a0a7f8417b
https://zenodo.org/records/163857/files/all.norm-sz500-w10-cb0-it3-min5.w2v.vocab_1100000_similar250.gz
7391
md5:365c7be0528a28328f252a9c57ef3ed1
https://zenodo.org/records/163857/files/hj-wordsim353-similarity.csv
594231
md5:7cf92fd365d18105cbdb3226957e7239
https://zenodo.org/records/163857/files/mj.csv
2379
md5:fed9990e1e4b67cd415ae4862dde4e95
https://zenodo.org/records/163857/files/hj-rg.csv
public
isVersionOf
doi