Speaker ethnic identification for continuous speech in Malay language using pitch and MFCC

Rafizah Mohd Hanifa; Khalid Isa; Shamsul Mohamad

doi:10.11591/ijeecs.v19.i1.pp207-214

Published July 1, 2020 | Version v1

Journal article Open

Speaker ethnic identification for continuous speech in Malay language using pitch and MFCC

1. Universiti Tun Hussein Onn Malaysia

Voice recognition has evolved exponentially over the years. The purpose of voice recognition or sometimes called speaker identification, is to identify the person who is speaking. This can be done by extracting features of speech that differ between individuals due to physiology (shape and size of the mouth and throat) and also behavioral patterns (pitch, accent and style of speaking). This paper explains an approach of voice recognition to identify the ethnicity of Malaysian people. Pitch and 13 Mel-Frequency Cepstrum Coefficients (MFCCs) are extracted from 52 recorded continuous speech in Malay for use as features to train the classifiers using Tree, Naïve Bayes, Nearest Neighbors and Support Vector Machine (SVM) and another 10 recorded speeches are used for testing. The results reveal that the use of a combination of pitch and 13 coefficients for features extraction and training the data using SVM provide better accuracy (57.7%) than the use of only 13 coefficients (53.8%).

Files

25 21483-41731-1-PB.pdf

Files (450.7 kB)

Name	Size	Download all
25 21483-41731-1-PB.pdf md5:1e7cc95219c8427b8de69cd36c337b50	450.7 kB	Preview Download

Views

Downloads

Show more details

	All versions	This version
Views	21	21
Downloads	28	28
Data volume	12.6 MB	12.6 MB

More info on how stats are collected....

DOI

Resource type

Journal article

Publisher

Zenodo

Published in

Indonesian Journal of Electrical Engineering and Computer Science (IJEECS), 19(1), 207-214, 2020.

Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: November 18, 2021
Modified: July 17, 2024

Speaker ethnic identification for continuous speech in Malay language using pitch and MFCC

Creators

Description

Files

25 21483-41731-1-PB.pdf

Files (450.7 kB)