There is a newer version of the record available.

Published April 15, 2021 | Version v1
Other Open

MEDDOPROF guidelines

Description

The MEDDOPROF Shared Task tackles the detection of occupations and employment statuses in clinical cases in Spanish from different specialties. Systems capable of automatically processing clinical texts are of interest to the medical community, social workers, researchers, the pharmaceutical industry, computer engineers, AI developers, policy makers, citizen’s associations and patients. Additionally, other NLP tasks (such as anonymization) can also benefit from this type of data.

These guidelines describe the process followed by the clinical and linguist experts who manually annotated the MEDDOPROF corpus, and a series of rules for annotating occupations in clinical texts.

 

Annotation quality:

We have performed a consistency analysis of the corpus. ~10% of the documents have been annotated by an internal annotator as well as by the linguist experts following these annotation guidelines. The average Inter-Annotator Agreement (pairwise agreement) after multiple rounds is around 0.9.

 

MEDDOPROF is part of the IberLEF 2021 workshop, which is co-located with the SEPLN 2021 conference. For further information, please visit https://temu.bsc.es/meddoprof/ or email us at encargo-pln-life@bsc.es

MEDDOPROF is promoted by the Plan de Impulso de las Tecnologías del Lenguaje de la Agenda Digital (Plan TL).

 

Resources:

- Web

- Training Data

Files

MEDDOPROF_guias.pdf

Files (557.1 kB)

Name Size Download all
md5:c601ed509ee93abd456b012e7a51d0e5
557.1 kB Preview Download