MRK Alignment Scoring Guidelines
Description
This document is paart of the supplementary material for Felbur, Meelen & Vierthaler (2022). It provides descriptions and examples of manually aligned passages in Classical Tibetan and Classical Chinese texts, including instructions for identifcation, input and scoring of alignments. The alignment engine developed for the Open Philology project will automatically detect bilingual aligned passages within the MRK or other text collections using training data created with this manual alignment input & scoring tool; it is important to create training data in order to increase the accuracy of the automated system. These training data consist of manually aligned passages from the MRK with a score indicating the strength of each alignment.
Felbur, Rafal, Marieke Meelen & Paul Vierthaler (2022), 'Crosslinguistic Semantic Textual Similarity of Buddhist Chinese and Classical Tibetan' in Journal of Open Humanities Data.
We gratefully acknowledge funding through the Open Philology project. This project (running 2018–2022) is funded by the European Research Council (ERC) under the Horizon 2020 program (Advanced Grant agreement No 741884). It is based at the Leiden University Institute for Area Studies.
Files
AlignmentScoringManual_v2May2020.pdf
Files
(17.4 MB)
Name | Size | Download all |
---|---|---|
md5:f168f476d0e3dfc4e5389a41ddee1c7a
|
17.4 MB | Preview Download |
Additional details
Funding
References
- Felbur, Rafal, Marieke Meelen & Paul Vierthaler (2022), 'Crosslinguistic Semantic Textual Similarity of Buddhist Chinese and Classical Tibetan' in Journal of Open Humanities Data.