Pronunciation modeling for dialectal arabic speech recognition

Al Haj, Hassan; Hsiao, Roger; Lane, Ian; Black, Alan W.; Waibel, Alex

doi:10.1109/asru.2009.5373245

Published January 17, 2010 | Version v1

Conference paper Open

Pronunciation modeling for dialectal arabic speech recognition

Short vowels in Arabic are normally omitted in written text which leads to ambiguity in the pronunciation. This is even more pronounced for dialectal Arabic where a single word can be pronounced quite differently based on the speaker's nationality, level of education, social class and religion. In this paper we focus on pronunciation modeling for Iraqi-Arabic speech. We introduce multiple pronunciations into the Iraqi speech recognition lexicon, and compare the performance, when weights computed via forced alignment are assigned to the different pronunciations of a word. Incorporating multiple pronunciations improved recognition accuracy compared to a single pronunciation baseline and introducing pronunciation weights further improved performance. Using these techniques an absolute reduction in word-error-rate of 2.4% was obtained compared to the baseline system.

Files

article.pdf

Files (112.4 kB)

Name	Size	Download all
article.pdf md5:6fd57d0f128a9cad1be7ffd31991632b	112.4 kB	Preview Download

Views

Downloads

Show more details

	All versions	This version
Views	43	43
Downloads	84	84
Data volume	9.4 MB	9.4 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

Zenodo

Creative Commons Zero v1.0 Universal

CC0 waives copyright interest in a work you've created and dedicates it to the world-wide public domain. Use CC0 to opt out of copyright entirely and ensure your work has the widest reach. Read more

Technical metadata

Created: June 4, 2018
Modified: August 2, 2024

Pronunciation modeling for dialectal arabic speech recognition

Creators

Description

Files

article.pdf

Files (112.4 kB)