Published February 16, 2021
| Version 3.0
Dataset
Open
Spanish Biomedical Word Embeddings in FastText
Authors/Creators
- 1. Barcelona Supercomputing Center
Description
Spanish Biomedical Word Embeddings in FastText
These word embeddings have been generated from the largest corpus ever made from Spanish Biomedicine resources till the date.
The corpus has more than 6Gb of curated high quality text.
For previous version (v2) see: https://zenodo.org/record/3744326#.YCu3fGj0mUk
Citation
@misc{temu2021spanish,
title={Spanish Biomedical and Clinical Language Embeddings},
author={Asier Gutiérrez-Fandiño and Jordi Armengol-Estapé and Casimiro Pio Carrino and Ona De Gibert and Aitor Gonzalez-Agirre and Marta Villegas},
year={2021},
eprint={2102.12843},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
Copyright (c) 2021 Text Mining Unit - Barcelona Supercomputing Center
Notes
Files
cased.zip
Additional details
Related works
- Is supplement to
- https://github.com/PlanTL-SANIDAD/Biomedical-Word-Embeddings-for-Spanish (URL)