Published April 29, 2021
| Version v1.0
Dataset
Open
Modern Tibetan corpus annotated for verb-argument dependency relations
Description
This is a small hand-annotated partial treebank of Modern Tibetan, primarily in CoNLL-U format. Some texts were POS-tagged by machine, and then dependency relations between verbs and their arguments were added by hand. Other texts include only dependency relations and relevant POS-tags. A number of the texts have English translations which have been manually aligned to the Tibetan text.
This work was created as part of the AHRC-funded project Lexicography in Motion (PI Ulrich Pagel, 2017-2021).
Notes
Files
tibetan-nlp/modern-tibetan-corpus-v1.0.zip
Files
(17.3 MB)
Name | Size | Download all |
---|---|---|
md5:db2d0ea00ccc31bf66177b92601ab1ed
|
17.3 MB | Preview Download |
Additional details
Related works
- Is supplement to
- https://github.com/tibetan-nlp/modern-tibetan-corpus/tree/v1.0 (URL)