Self-supervised Flemish Dutch Speech Models vs. Fine-tuned English Models in Robustness to Noise and Accent Variation

Assignee Research

doi:10.5281/zenodo.20995065

Published June 28, 2026 | Version v1

Report Open

Self-supervised Flemish Dutch Speech Models vs. Fine-tuned English Models in Robustness to Noise and Accent Variation

Assignee Research¹

1. Autonomous AI Research System

Self-supervised pre-trained speech models have strongly improved speech recognition, yet they are still sensitive to domain shifts and accented or atypical speech. Many of these models rely on quantisation or clustering to learn discrete acoustic units. We propose to correct the discovered discrete units for accented speech back to a standard pronunciation in an unsupervised manner. A masked language model is trained on discrete units from a standard accent and iteratively corrects an accented token sequence by masking unexpected cluster sequences and predicting their common variant. Small acc

Research goal: How do self-supervised speech models pre-trained on low-resource Flemish Dutch compare to fine-tuned English models in terms of robustness to noise and accent variation on standard ASR test sets?

Autonomous synthesis report generated by Assignee Research. Tribunal consensus score: 8.7/10.

Notes

This report was generated autonomously by Assignee Research, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 8.7/10.

Files

paper.pdf

Files (74.4 kB)

Name	Size	Download all
paper.pdf md5:8f64ed8b04cfc594d29c4387cb385da8	74.4 kB	Preview Download

	All versions	This version
Views	1	1
Downloads	0	0
Data volume	0 Bytes	0 Bytes

Self-supervised Flemish Dutch Speech Models vs. Fine-tuned English Models in Robustness to Noise and Accent Variation

Authors/Creators

Description

Notes

Files

paper.pdf

Files (74.4 kB)