Published April 29, 2021 | Version v1.0
Dataset Open

Modern Tibetan corpus annotated for verb-argument dependency relations

Description

This is a small hand-annotated partial treebank of Modern Tibetan, primarily in CoNLL-U format. Some texts were POS-tagged by machine, and then dependency relations between verbs and their arguments were added by hand. Other texts include only dependency relations and relevant POS-tags. A number of the texts have English translations which have been manually aligned to the Tibetan text.

This work was created as part of the AHRC-funded project Lexicography in Motion (PI Ulrich Pagel, 2017-2021).

Notes

Funded by the UK's Arts and Humanities Research Council (grant code: AH/P004644/1)

Files

tibetan-nlp/modern-tibetan-corpus-v1.0.zip

Files (17.3 MB)

Name Size Download all
md5:db2d0ea00ccc31bf66177b92601ab1ed
17.3 MB Preview Download

Additional details