Dataset Open Access


Ander Intxaurrondo

Martin Krallinger

[Plan TL/medicine/lexical/terminological resource] The Spanish Medical Abbreviation DataBase.

The database is created automatically by detecting abbreviations and their potential definitions explicitly mentioned in the same sentence. These abbreviations are extracted from the metadata of different biomedical publications written in Spanish, which contain the titles and abstracts. The sources of these publications are SciELO, IBECS and Pubmed. The chosen schema is Dublin Core. We use the official ones from SciELO, and customized adaptations of the XML files to Dublin Core from IBECS and Pubmed metadata. The objective is to create a semantic inventory of interest for the resolution of definitions of abbreviations.


Funded by the Plan de Impulso de las Tecnologías del Lenguaje (Plan TL).
Files (22.2 MB)
Name Size
22.2 MB Download
  • Intxaurrondo A, Krallinger M. CNIO at BARR IberEval 2017: Exploring Three Biomedical Abbreviation Identifiers for Spanish Biomedical Publications. In IberEval@ SEPLN 2017 (pp. 278-285).

All versions This version
Views 634571
Downloads 158154
Data volume 3.5 GB3.4 GB
Unique views 510487
Unique downloads 154151


Cite as