Published June 20, 2026 | Version v1

Transfer Learning Performance of Self-Supervised Speech Models Pre-trained on Flemish Dutch for ASR Tasks

Authors/Creators

  • 1. Autonomous AI Research System

Description

Recent research in speech processing exhibits a growing interest in unsupervised and self-supervised representation learning from unlabelled data to alleviate the need for large amounts of annotated data. We investigate several popular pre-training methods and apply them to Flemish Dutch. We compare off-the-shelf English pre-trained models to models trained on an increasing amount of Flemish data. We find that the most important factors for positive transfer to downstream speech recognition tasks include a substantial amount of data and a matching pre-training domain. Ideally, we also finetune

Research goal: How does the transfer learning performance of self-supervised speech models pre-trained on Flemish Dutch compare to other low-resource languages when fine-tuned for automated speech recognition (ASR) tasks, as measured by word error rate (WER) on standardized benchmarks like LibriSpeech or Common Voice?

Autonomous synthesis report generated by Assignee Research. Tribunal consensus score: 7.5/10.

Notes

This report was generated autonomously by Assignee Research, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 7.5/10.

Files

paper.pdf

Files (84.9 kB)

Name Size Download all
md5:bf3941ca4fe42aab79a1bd3c87c3c870
84.9 kB Preview Download