Dataset Open Access

CodiEsp codes: list of valid CIE10 codes for the CodiEsp task

Miranda-Escalada, Antonio; Krallinger, Martin

Please cite if you use this dataset:

Antonio Miranda-Escalada, Aitor Gonzalez-Agirre, Jordi Armengol-Estapé and Martin Krallinger. Overview of automatic clinical coding: annotations, guidelines, and solutions for non-English clinical cases at CodiEsp track of CLEF eHealth 2020. In CLEF (Working Notes). 2020

@inproceedings{miranda2020overview,
  title={Overview of automatic clinical coding: annotations, guidelines, and solutions for non-english clinical cases at codiesp track of CLEF eHealth 2020},
  author={Miranda-Escalada, Antonio and Gonzalez-Agirre, Aitor and Armengol-Estap{\'e}, Jordi and Krallinger, Martin},
  booktitle={Working Notes of Conference and Labs of the Evaluation (CLEF) Forum. CEUR Workshop Proceedings},
  year={2020}
}

 

This compressed folder contains two files:

 + codiesp-D_codes.tsv: list of CIE10-Diagnósticos terms (2018 version) with their description in Spanish and in English.
 + codiesp-P_codes.tsv: list of CIE10-Procedimiento terms (2018 version) with their description in Spanish and in English. In addition, the list also contains the codes until the 4th axis, which are also used in the CodiEsp-P track due to annotation reasons.

A limited number of codes do not have an English description because they were removed from the English version but maintained in the Spanish version of the terminology.

 

Format: 
Tab-separated files with 3 columns
code    es-description    en-description

 

Spanish to English description mapping:
For CodiEsp-D, the mapping to the English description was done through the files in the National Center for Health Statistics webpage: https://www.cdc.gov/nchs/icd/icd10cm.htm
Specifically, the file used was: ftp://ftp.cdc.gov/pub/Health_Statistics/NCHS/Publications/ICD10CM/2018/2018-ICD-10-CM-Codes-File.zip/icd10cm_codes_2018.txt

For CodiEsp-P, the mapping to the English description was done through the files in the Centers for Medicare Services webpage: https://www.cms.gov/Medicare/Coding/ICD10/2018-ICD-10-PCS-and-GEMs
Specifically, the file used was: 2018_icd10pcs_codes_file.zip/icd10pcs_codes_2018.txt

Funded by the Plan de Impulso de las Tecnologías del Lenguaje (Plan TL).
Files (2.0 MB)
Name Size
codiesp_codes.zip
md5:531d29c58447e86820517a5a0c437a2d
2.0 MB Download
722
173
views
downloads
All versions This version
Views 722311
Downloads 17371
Data volume 329.1 MB142.2 MB
Unique views 546256
Unique downloads 15466

Share

Cite as