Transfer Learning Performance of Self-Supervised Speech Models Pre-trained on Flemish Dutch for ASR Tasks
Description
Recent research in speech processing exhibits a growing interest in unsupervised and self-supervised representation learning from unlabelled data to alleviate the need for large amounts of annotated data. We investigate several popular pre-training methods and apply them to Flemish Dutch. We compare off-the-shelf English pre-trained models to models trained on an increasing amount of Flemish data. We find that the most important factors for positive transfer to downstream speech recognition tasks include a substantial amount of data and a matching pre-training domain. Ideally, we also finetune
Research goal: How does the transfer learning performance of self-supervised speech models pre-trained on Flemish Dutch compare to other low-resource languages when fine-tuned for automated speech recognition (ASR) tasks, as measured by word error rate (WER) on standardized benchmarks like LibriSpeech or Common Voice?
Autonomous synthesis report generated by Assignee Research. Tribunal consensus score: 7.5/10.
Notes
Files
paper.pdf
Files
(84.9 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:bf3941ca4fe42aab79a1bd3c87c3c870
|
84.9 kB | Preview Download |