Published January 31, 2020 | Version 1.1
Dataset Open

CodiEsp codes: list of valid CIE10 codes for the CodiEsp task

  • 1. Barcelona Supercomputing Center

Description

Please cite if you use this dataset:

Antonio Miranda-Escalada, Aitor Gonzalez-Agirre, Jordi Armengol-Estapé and Martin Krallinger. Overview of automatic clinical coding: annotations, guidelines, and solutions for non-English clinical cases at CodiEsp track of CLEF eHealth 2020. In CLEF (Working Notes). 2020

@inproceedings{miranda2020overview,
  title={Overview of automatic clinical coding: annotations, guidelines, and solutions for non-english clinical cases at codiesp track of CLEF eHealth 2020},
  author={Miranda-Escalada, Antonio and Gonzalez-Agirre, Aitor and Armengol-Estap{\'e}, Jordi and Krallinger, Martin},
  booktitle={Working Notes of Conference and Labs of the Evaluation (CLEF) Forum. CEUR Workshop Proceedings},
  year={2020}
}

 

This compressed folder contains two files:

 + codiesp-D_codes.tsv: list of CIE10-Diagnósticos terms (2018 version) with their description in Spanish and in English.
 + codiesp-P_codes.tsv: list of CIE10-Procedimiento terms (2018 version) with their description in Spanish and in English. In addition, the list also contains the codes until the 4th axis, which are also used in the CodiEsp-P track due to annotation reasons.

A limited number of codes do not have an English description because they were removed from the English version but maintained in the Spanish version of the terminology.

 

Format: 
Tab-separated files with 3 columns
code    es-description    en-description

 

Spanish to English description mapping:
For CodiEsp-D, the mapping to the English description was done through the files in the National Center for Health Statistics webpage: https://www.cdc.gov/nchs/icd/icd10cm.htm
Specifically, the file used was: ftp://ftp.cdc.gov/pub/Health_Statistics/NCHS/Publications/ICD10CM/2018/2018-ICD-10-CM-Codes-File.zip/icd10cm_codes_2018.txt

For CodiEsp-P, the mapping to the English description was done through the files in the Centers for Medicare Services webpage: https://www.cms.gov/Medicare/Coding/ICD10/2018-ICD-10-PCS-and-GEMs
Specifically, the file used was: 2018_icd10pcs_codes_file.zip/icd10pcs_codes_2018.txt

Notes

Funded by the Plan de Impulso de las Tecnologías del Lenguaje (Plan TL).

Files

codiesp_codes.zip

Files (2.0 MB)

Name Size Download all
md5:531d29c58447e86820517a5a0c437a2d
2.0 MB Preview Download