Published November 23, 2018
| Version 2019-02-01
Dataset
Open
SPACCC_TOKEN
Description
[PlanTL/medicine/annotated corpus/guidelines/tokenization] First version of the tokenization annotations in the Spanish Clinical Case Corpus that have been carried out by means of the Spanish Clinical Case Corpus Part-of-Speech Tagger based on FreeLing3.1 (SPACCC_POS-TAGGER, https://github.com/PlanTL/SPACCC_POS-TAGGER).
Copyright (c) 2018 Secretaría de Estado para el Avance Digital
Notes
Files
SPACCC_TOKEN.zip
Files
(13.0 MB)
Name | Size | Download all |
---|---|---|
md5:0c5696260771cdb2ff37963009e5be3c
|
13.0 MB | Preview Download |
Additional details
References
- Villegas M, de la Peña S, Intxaurrondo A, Santamaria J, Krallinger M. Esfuerzos para fomentar la minería de textos en biomedicina más allá del inglés: el plan estratégico nacional español para las tecnologías del lenguaje. Procesamiento del Lenguaje Natural. 2017(59):141-4.
Subjects
- Natural language processing
- http://id.loc.gov/authorities/subjects/sh88002425
- Medicine
- http://id.loc.gov/authorities/subjects/sh85083064