Validation and Correction Guidelines for the Multilingual Annotation Projection of Gold Standard Corpora
Authors/Creators
Description
These guidelines explain how to proceed in the correction and validation of multilingual corpora created via annotation projection techniques. They outline the goal of the task, the framework used, the necessary actions and the specific rules to be considered. The ultimate goal is to leverage high quality datasets written in one language to train artificial intelligence systems in as many languages as needed.
They have been used for multiple datasets, including but not limited to:
License
This work is licensed under a Creative Commons Attribution 4.0 International License.
Contact
If you have any questions or suggestions, please contact us at:
- Salvador Lima-López (<salvador [dot] limalopez [at] gmail [dot] com>)
- Martin Krallinger (<krallinger [dot] martin [at] gmail [dot] com>)
Files
Annotation_Transfer_Validation_Guidelines.pdf
Files
(1.2 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:a7dd05a0e61e55013a92c6fdb93e09dc
|
1.2 MB | Preview Download |