Published February 23, 2021 | Version v1.0
Dataset Open

Spanish Biomedical Sub-word Embeddings in FastText

Description

Spanish Biomedical Sub-word Embeddings in FastText

These embeddings have been generated from the largest corpus ever made from Spanish Biomedical resources till the date.

Citation

@misc{temu2021spanish,
      title={Spanish Biomedical and Clinical Language Embeddings}, 
      author={Asier Gutiérrez-Fandiño and Jordi Armengol-Estapé and Casimiro Pio Carrino and Ona De Gibert and Aitor Gonzalez-Agirre and Marta Villegas},
      year={2021},
      eprint={2102.12843},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

 

Copyright (c) 2021 Secretaría de Estado de Digitalización e Inteligencia Artificial

Notes

Funded by the Plan de Impulso de las Tecnologías del Lenguaje (Plan TL).

Files

cased.zip

Files (12.9 GB)

Name Size Download all
md5:5b1365aa7a5f42aba05f5e8c73a61811
6.8 GB Preview Download
md5:2ab724713fdaf49e4523c4503bfd068d
18.7 kB Preview Download
md5:fa42858ee36cfb53a7d3e06e8d161091
1.0 kB Preview Download
md5:e636b4d7c0298b192b0cc892a98aed47
6.1 GB Preview Download

Additional details