The convolutional neural networks for Amazigh speech recognition system

Meryam Telmem; Youssef Ghanou

doi:10.12928/TELKOMNIKA.v19i2.16793

Published April 1, 2021 | Version v1

Journal article Open

The convolutional neural networks for Amazigh speech recognition system

1. Moulay Ismail University, Meknes, Morocco

In this paper, we present an approach based on convolutional neural networks to build an automatic speech recognition system for the Amazigh language. This system is built with TensorFlow and uses mel frequency cepstral coefficient (MFCC) to extract features. In order to test the effect of the speaker's gender and age on the accuracy of the model, the system was trained and tested on several datasets. The first experiment the dataset consists of 9240 audio files. The second experiment the dataset consists of 9240 audio files distributed between females and males’ speakers. The last experiment 3 the dataset consists of 13860 audio files distributed between age 9-15, age 16-30, and age 30+. The result shows that the model trained on a dataset of adult speaker’s age +30 categories generates the best accuracy with 93.9%.

Files

20 16793.pdf

Files (808.0 kB)

Name	Size	Download all
20 16793.pdf md5:d3b7f95671cff702679d23b3a3496c87	808.0 kB	Preview Download

Views

Downloads

Show more details

	All versions	This version
Views	41	41
Downloads	84	84
Data volume	70.3 MB	70.3 MB

More info on how stats are collected....

DOI

Resource type

Journal article

Publisher

Zenodo

Published in

TELKOMNIKA Telecommunication, Computing, Electronics and Control, 19(2), pp. 515~522, 2021.

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: March 20, 2021
Modified: July 19, 2024

The convolutional neural networks for Amazigh speech recognition system

Authors/Creators

Description

Files

20 16793.pdf

Files (808.0 kB)