Published May 11, 2017
| Version v1
Dataset
Open
A part-of-speech (POS) lexicon of Classical Tibetan for NLP
Creators
- 1. SOAS, Univeristy of London
- 2. SOAS, University of London
Description
This part-of-speech (POS) lexicon of Classical Tibetan was prepared in the course of the research project 'Tibetan in Digital Communication' (2012-2015) hosted at SOAS, University of London and funded by the UK's Arts and Humanities Research Council (grant code: AH/J00152X/1). The data for verbs comes from a digitized version of A Lexicon of Tibetan Verb Stems as Reported by the Grammatical Tradition (Munich: Bayerische Akademie der Wissenschaften, 2010) by Nathan W. Hill. Otherwise data comes from the manually part-of-speech tagged training data produced by the corpus and a few lexical items specifically added by hand to improve rule based tagging.
Notes
Files
Lexicons.zip
Files
(88.1 kB)
Name | Size | Download all |
---|---|---|
md5:021d0e1089f91ef7cc65d42dbf21518c
|
88.1 kB | Preview Download |