MEDDOPROF guidelines
Creators
- 1. Barcelona Supercomputing Center
- 2. D-REAL
Description
The MEDDOPROF Shared Task tackles the detection of occupations and employment statuses in clinical cases in Spanish from different specialties. Systems capable of automatically processing clinical texts are of interest to the medical community, social workers, researchers, the pharmaceutical industry, computer engineers, AI developers, policy makers, citizen’s associations and patients. Additionally, other NLP tasks (such as anonymization) can also benefit from this type of data.
These guidelines describe the process followed by the clinical and linguist experts who manually annotated the MEDDOPROF corpus, and a series of rules for annotating occupations in clinical texts.
Annotation quality:
We have performed a consistency analysis of the corpus. ~10% of the documents have been annotated by an internal annotator as well as by the linguist experts following these annotation guidelines. The average Inter-Annotator Agreement (pairwise agreement) after multiple rounds is around 0.9.
MEDDOPROF is part of the IberLEF 2021 workshop, which is co-located with the SEPLN 2021 conference. For further information, please visit https://temu.bsc.es/meddoprof/ or email us at encargo-pln-life@bsc.es
MEDDOPROF is promoted by the Plan de Impulso de las Tecnologías del Lenguaje de la Agenda Digital (Plan TL).
Resources:
- Web
Files
MEDDOPROF_guias.pdf
Files
(557.1 kB)
Name | Size | Download all |
---|---|---|
md5:c601ed509ee93abd456b012e7a51d0e5
|
557.1 kB | Preview Download |