Dialect classification using acoustic and linguistic features in Arabic speech

doi:10.11591/ijai.v12.i2.pp739-746

Published June 1, 2023 | Version v1

Journal article Open

Dialect classification using acoustic and linguistic features in Arabic speech

1. Universiti Brunei Darussalam

Speech dialects refer to linguistic and pronunciation variations in the speech of the same language. Automatic dialect classification requires considerable acoustic and linguistic differences between different dialect categories of speech. This paper proposes a classification model composed of a combination of classifiers for the Arabic dialects by utilizing both the acoustic and linguistic features of spontaneous speech. The acoustic classification comprises of an ensemble of classifiers focusing on different frequency ranges within the short-term spectral features, as well as a classifier utilizing the ‘i-vector’, whilst the linguistic classifiers use features extracted by transformer models pre-trained on large Arabic text datasets. It has been shown that the proposed fusion of multiple classifiers achieves a classification accuracy of 82.44% for the identification task of five Arabic dialects. This represents the highest accuracy reported on the dataset, despite the relative simplicity of the proposed model, and has shown its applicability and relevance for dialect identification tasks.

Files

26 22490 1570766525.pdf

Files (276.4 kB)

Name	Size	Download all
26 22490 1570766525.pdf md5:3142215f31cbca8d4d4edc097ad3d0af	276.4 kB	Preview Download

	All versions	This version
Views	22	22
Downloads	19	19
Data volume	5.5 MB	5.5 MB

Dialect classification using acoustic and linguistic features in Arabic speech

Creators

Description

Files

26 22490 1570766525.pdf

Files (276.4 kB)