Info: Zenodo’s user support line is staffed on regular business days between Dec 23 and Jan 5. Response times may be slightly longer than normal.

Published January 27, 2009 | Version 4571
Journal article Open

An Advanced Method for Speech Recognition

Description

In this paper in consideration of each available techniques deficiencies for speech recognition, an advanced method is presented that-s able to classify speech signals with the high accuracy (98%) at the minimum time. In the presented method, first, the recorded signal is preprocessed that this section includes denoising with Mels Frequency Cepstral Analysis and feature extraction using discrete wavelet transform (DWT) coefficients; Then these features are fed to Multilayer Perceptron (MLP) network for classification. Finally, after training of neural network effective features are selected with UTA algorithm.

Files

4571.pdf

Files (493.0 kB)

Name Size Download all
md5:5423363575431d4112cfcab36ab317f0
493.0 kB Preview Download

Additional details

References

  • Abdul Ahad, Ahsan Fayyaz, Tariq Mehmood. "Speech Recognition using Multilayer Perceptron" . IEEE trans. pp.103,2002.
  • Karina Vieira, Bogdan Wilamowski, and Robert Kubichek " Speaker Verification for Security Systems Using Artificial Neural Networks". IEEE trans. pp.1102-1105,2003.
  • Song Yang, Meng Joo Er, and Yang Gao. "A High Performance Neural- Networks-Based Speech Recognition System". IEEE trans. pp.1527,2001.
  • Keogh, E. & M. Pazzani. "Derivative Dynamic Time Warping". In Proc. of the First Intl. SIAM Intl. Conf. on Data Mining, Chicago, Illinois, 2001.
  • Abdulla, W., D. Chow, and G. Sin, "Cross-words reference template for DTW-based speech recognition systems", in Proc. IEEE TENCON, Bangalore, India, 2003.
  • Corneliu Octavian DUMITRU, Inge GAVAT. "Vowel, Digit and Continuous Speech Recognition Based on Statistical, Neural and Hybrid Modelling by Using ASRS_RL ". EUROCON 2007, The International Conference on "Computer as Tool", pp.858-859.
  • i.Gavat, O.Dumitru, C. Iancu, Gostache, "Learning strategies in speech Recognition", Proc. Elmar 2005, pp.237-240, june 2005,Zadar, Croatia.
  • Bahlmann. Haasdonk. Burkhardt. "speech and audio recognition" . IEEE trans. Vol 11. May 2003.
  • Edward Gatt, Joseph Micallef, Paul Micsllef, Edward Chilton. "Phoneme Classification in Hardware Implemented Neural Networks ". IEEE trans, pp.481, 2001. [10] Redondo, M.F. Espinosa, C.H. "A comparison among feature selection methods based on trainednetworks." IEEE trans.Aug1999 [11] Kirschning. 1. "Continuous Speech Recognition Using the Time-Sliced Paradigm", MEng.Dissertation, University Of Tokushinia, 1998. [12] Tebelskis. J. "Speech Recognition Using Neural Networks", PhD. Dissertation, School Of ComputerScience, Carnegie Mellon University, 1995. [13] J. Tchorz, B. Kollmeier; "A Psychoacoustical Model of the Auditory Periphery as Front-end forASR"; ASAEAAiDEGA Joint Meeting on Acoustics; Berlin, March 1999. [14] Cory L. Clark "LabVIEW Digital Signal Processing and Digital Communications". McGraw-Hill Companies.2005 [15] " Digital Signal Processing System-Level Design Using LabVIEW " by Nasser Kehtarnavaz and Namjin Kim University of Texas at Dallas. 2005. [16] M. Kantardzic. Data Mining Concepts, Models, Methods, and Algorithms. IEEE, Piscataway, NJ, USA, 2003. [17] R.P. Lippmann, "An Introduction to computing with neural nets." IEEE ASSP Mag. , vol 4, Apr.1997 [18] H. B. D. Martin T. Hagan and M. Beale. Neural Network Design. PWS Publishing Company, Boston, MA, USA, 1996. [19] T. G. Dietterich. Machine learning for sequential data: A review. In Proceedings of the Joint IAPR International Workshop on Structural, Syntactic, and Statistical Pattern Recognition, pp.15-30, 2002. Springer- Verlag, London, UK. [20] MathWorks. Neural Network Toolbox User-s Guide, 2004. [21] S.M Peeling, R.K Moore and R.J.Tomlinson, "TheMulti Layer Perceptron as a tool for speech pattern processing research." in Proc. IoA Autumn Conf.Speech Hearing. 1986.