Published August 1, 2024 | Version v1
Other Open

Validation and Correction Guidelines for the Multilingual Annotation Projection of Gold Standard Corpora

  • 1. ROR icon Barcelona Supercomputing Center

Description

These guidelines explain how to proceed in the correction and validation of multilingual corpora created via annotation projection techniques. They outline the goal of the task, the framework used, the necessary actions and the specific rules to be considered. The ultimate goal is to leverage high quality datasets written in one language to train artificial intelligence systems in as many languages as needed.

They have been used for multiple datasets, including but not limited to:

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Contact

If you have any questions or suggestions, please contact us at:

- Salvador Lima-López (<salvador [dot] limalopez [at] gmail [dot] com>)
- Martin Krallinger (<krallinger [dot] martin [at] gmail [dot] com>)

Files

Annotation_Transfer_Validation_Guidelines.pdf

Files (1.2 MB)

Name Size Download all
md5:a7dd05a0e61e55013a92c6fdb93e09dc
1.2 MB Preview Download