Published February 9, 2021 | Version 1.0
Dataset Open

AI4PROFHEALTH Profession and occupations gazetteer (ProfNER & MEDDOPROF - occupations, professions and working status terms with their associated codes)

  • 1. Barcelona Supercomputing Center

Description

Gazetteer of occupations extracted from a set of terminologies (DeCS, ESCO, SnomedCT and WordNet). In addition, we include in the gazetteer a list of occupations detected by Stanford CoreNLP in a large collection of social media profiles. For every term, we include its lemma. The lemma was extracted using the PlanTL Med_Tagger: https://github.com/PlanTL-SANIDAD/SPACCC_POS-TAGGER

 

Please, cite:

Miranda-Escalada, A., Farré-Maduell, E., Lima-López, S., Gascó, L., Briva-Iglesias, V., Agüero-Torales, M., & Krallinger, M. (2021, June). The profner shared task on automatic recognition of occupation mentions in social media: systems, evaluation, guidelines, embeddings and corpora. In Proceedings of the Sixth Social Media Mining for Health (# SMM4H) Workshop and Shared Task (pp. 13-20).

@inproceedings{miranda2021profner,
  title={The profner shared task on automatic recognition of occupation mentions in social media: systems, evaluation, guidelines, embeddings and corpora},
  author={Miranda-Escalada, Antonio and Farr{\'e}-Maduell, Eul{\`a}lia and Lima-L{\'o}pez, Salvador and Gasc{\'o}, Luis and Briva-Iglesias, Vicent and Ag{\"u}ero-Torales, Marvin and Krallinger, Martin},
  booktitle={Proceedings of the Sixth Social Media Mining for Health (\# SMM4H) Workshop and Shared Task},
  pages={13--20},
  year={2021}
}

 

For further information, please visit https://temu.bsc.es/smm4h-spanish/ or email us at encargo-pln-life@bsc.es
 

Resources:

Notes

Funded by the Plan de Impulso de las Tecnologías del Lenguaje (Plan TL).

Files

occupations-gazetteer.zip

Files (269.8 kB)

Name Size Download all
md5:8b05571017df897155e6715dbce1085e
269.8 kB Preview Download