Speech corpus for Catalan. The final version, v2.0 corpus can be downloaded alternatively in parts:
Preparation of this corpus was partly supported by the Department of Culture of the Catalan autonomous government, and the v2.0 was supported by the Barcelona Supercomputing Center, within the framework of the project AINA of the Departament de Polítiques Digitals.
As of v2.0 the corpus is separated into 211 hours of clean and 400 hours of other quality segments. The stat details are as follows:
For more information go to https://collectivat.cat/asr or mail email@example.com.