SPACCC_TOKEN

Montserrat Marimon; Martin Krallinger; Aitor Gonzalez Agirre; Marta Villegas; Ander Intxaurrondo

doi:10.5281/zenodo.2560338

Published November 23, 2018 | Version 2019-02-01

Dataset Open

SPACCC_TOKEN

[PlanTL/medicine/annotated corpus/guidelines/tokenization] First version of the tokenization annotations in the Spanish Clinical Case Corpus that have been carried out by means of the Spanish Clinical Case Corpus Part-of-Speech Tagger based on FreeLing3.1 (SPACCC_POS-TAGGER, https://github.com/PlanTL/SPACCC_POS-TAGGER).

Notes

Funded by the Plan de Impulso de las Tecnologías del Lenguaje (Plan TL).

Files

SPACCC_TOKEN.zip

Files (13.0 MB)

Name	Size	Download all
SPACCC_TOKEN.zip md5:0c5696260771cdb2ff37963009e5be3c	13.0 MB	Preview Download

Additional details

Villegas M, de la Peña S, Intxaurrondo A, Santamaria J, Krallinger M. Esfuerzos para fomentar la minería de textos en biomedicina más allá del inglés: el plan estratégico nacional español para las tecnologías del lenguaje. Procesamiento del Lenguaje Natural. 2017(59):141-4.

Natural language processing: http://id.loc.gov/authorities/subjects/sh88002425
Medicine: http://id.loc.gov/authorities/subjects/sh85083064

	All versions	This version
Views	794	520
Downloads	143	120
Data volume	2.1 GB	1.7 GB

SPACCC_TOKEN

Notes

Files

SPACCC_TOKEN.zip

Files (13.0 MB)

Additional details

References

Subjects

SPACCC_TOKEN

Creators

Description

Notes

Files

SPACCC_TOKEN.zip

Files (13.0 MB)

Additional details

References

Subjects