Dataset Open Access


Ander Intxaurrondo

Martin Krallinger

[Plan TL/medicine/lexical/terminological resource] The Spanish Medical Abbreviation DataBase.

The database is created automatically by detecting abbreviations and their potential definitions explicitly mentioned in the same sentence. These abbreviations are extracted from the metadata of different biomedical publications written in Spanish, which contain the titles and abstracts. The sources of these publications are SciELO, IBECS and Pubmed. The chosen schema is Dublin Core. We use the official ones from SciELO, and customized adaptations of the XML files to Dublin Core from IBECS and Pubmed metadata. The objective is to create a semantic inventory of interest for the resolution of definitions of abbreviations.


Funded by the Plan de Impulso de las Tecnologías del Lenguaje (Plan TL).
Files (22.2 MB)
Name Size
22.2 MB Download
  • Intxaurrondo A, Krallinger M. CNIO at BARR IberEval 2017: Exploring Three Biomedical Abbreviation Identifiers for Spanish Biomedical Publications. In IberEval@ SEPLN 2017 (pp. 278-285).

All versions This version
Views 494434
Downloads 132128
Data volume 2.9 GB2.8 GB
Unique views 394373
Unique downloads 128125


Cite as