Published February 25, 2021
| Version v1.0
Dataset
Open
Catalan Sub-word Embeddings in FastText
Authors/Creators
- 1. Barcelona Supercomputing Center
Description
These Catalan sub-word embeddings in FastText using BPE have been generated from the largest corpus ever made in Catalan till the date. The corpus has more than 10Gb of curated high quality text.
If this material is useful, please cite it.
Copyright (c) 2021 Text Mining Unit - Barcelona Supercomputing Center