Published November 20, 2018 | Version 2018-12-01
Dataset Open


  • 1. Barcelona Supercomputing Center



[Plan TL/medicine/lexical/terminological resource] The Spanish Medical Abbreviation DataBase.

The database is created automatically by detecting abbreviations and their potential definitions explicitly mentioned in the same sentence. These abbreviations are extracted from the metadata of different biomedical publications written in Spanish, which contain the titles and abstracts. The sources of these publications are SciELO, IBECS and Pubmed. The chosen schema is Dublin Core. We use the official ones from SciELO, and customized adaptations of the XML files to Dublin Core from IBECS and Pubmed metadata. The objective is to create a semantic inventory of interest for the resolution of definitions of abbreviations.


Copyright (c) 2018 Secretaría de Estado para el Avance Digital (SEAD)


Funded by the Plan de Impulso de las Tecnologías del Lenguaje (Plan TL).


Files (22.2 MB)

Name Size Download all
22.2 MB Preview Download

Additional details

Related works


  • Intxaurrondo A, Krallinger M. CNIO at BARR IberEval 2017: Exploring Three Biomedical Abbreviation Identifiers for Spanish Biomedical Publications. In IberEval@ SEPLN 2017 (pp. 278-285).