Dataset Open Access
Hill, Nathan W.;
Garrett, Edward
{ "publisher": "Zenodo", "DOI": "10.5281/zenodo.574876", "title": "A part-of-speech (POS) lexicon of Classical Tibetan for NLP", "issued": { "date-parts": [ [ 2017, 5, 11 ] ] }, "abstract": "<p>This part-of-speech (POS) lexicon of Classical Tibetan was prepared in the course of the research project 'Tibetan in Digital Communication' (2012-2015) hosted at SOAS, University of London and funded by the UK's Arts and Humanities Research Council (grant code: AH/J00152X/1). The data for verbs comes from a digitized version of <em>A Lexicon of Tibetan Verb Stems as Reported by the Grammatical Tradition</em> (Munich: Bayerische Akademie der Wissenschaften, 2010) by Nathan W. Hill. Otherwise data comes from the manually part-of-speech tagged training data produced by the corpus and a few lexical items specifically added by hand to improve rule based tagging.</p>", "author": [ { "family": "Hill, Nathan W." }, { "family": "Garrett, Edward" } ], "note": "funded by the UK's Arts and Humanities Research Council (grant code: AH/J00152X/1)", "type": "dataset", "id": "574876" }
All versions | This version | |
---|---|---|
Views | 278 | 278 |
Downloads | 93 | 93 |
Data volume | 8.2 MB | 8.2 MB |
Unique views | 258 | 258 |
Unique downloads | 92 | 92 |