An ensemble technique for speech recognition in noisy environments

Imad Qasim Habeeb; Tamara Z. Fadhil; Yaseen Naser Jurn; Zeyad Qasim Habeeb; Hanan Najm Abdulkhudhur

doi:10.11591/ijeecs.v18.i2.pp835-842

Published May 1, 2020 | Version v1

Journal article Open

An ensemble technique for speech recognition in noisy environments

1. University of Information Technology and Communications
2. University of Technology
3. Ministry of Higher Education and Scientific Research

Automatic speech recognition (ASR) is a technology that allows a computer and mobile device to recognize and translate spoken language into text. ASR systems often produce poor accuracy for the noisy speech signal. Therefore, this research proposed an ensemble technique that does not rely on a single filter for perfect noise reduction but incorporates information from multiple noise reduction filters to improve the final ASR accuracy. The main factor of this technique is the generation of K-copies of the speech signal using three noise reduction filters. The speech features of these copies differ slightly in order to extract different texts from them when processed by the ASR system. Thus, the best among these texts can be elected as final ASR output. The ensemble technique was compared with three related current noise reduction techniques in terms of CER and WER. The test results were encouraging and showed a relatively decreased by 16.61% and 11.54% on CER and WER compared with the best current technique. ASR field will benefit from the contribution of this research to increase the recognition accuracy of a human speech in the presence of background noise.

Files

33 20428.pdf

Files (638.3 kB)

Name	Size	Download all
33 20428.pdf md5:1ea0c589377e88500e9eafbea0a653e4	638.3 kB	Preview Download

	All versions	This version
Views	42	42
Downloads	32	32
Data volume	20.4 MB	20.4 MB

An ensemble technique for speech recognition in noisy environments

Creators

Description

Files

33 20428.pdf

Files (638.3 kB)