Effect of Singular Value Decomposition Based Processing on Speech Perception
Authors/Creators
Description
Speech is an important biological signal for primary mode of communication among human being and also the most natural and efficient form of exchanging information among human in speech. Speech processing is the most important aspect in signal processing. In this paper the theory of linear algebra called singular value decomposition (SVD) is applied to the speech signal. SVD is a technique for deriving important parameters of a signal. The parameters derived using SVD may further be reduced by perceptual evaluation of the synthesized speech using only perceptually important parameters, where the speech signal can be compressed so that the information can be transformed into compressed form without losing its quality. This technique finds wide applications in speech compression, speech recognition, and speech synthesis. The objective of this paper is to investigate the effect of SVD based feature selection of the input speech on the perception of the processed speech signal. The speech signal which is in the form of vowels \a\, \e\, \u\ were recorded from each of the six speakers (3 males and 3 females). The vowels for the six speakers were analyzed using SVD based processing and the effect of the reduction in singular values was investigated on the perception of the resynthesized vowels using reduced singular values. Investigations have shown that the number of singular values can be drastically reduced without significantly affecting the perception of the vowels.
Files
2113ijnlc02.pdf
Files
(431.0 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:6e682d08d788795733f2c8e5344cee8c
|
431.0 kB | Preview Download |
Additional details
Identifiers
Related works
- Is cited by
- https://scholar.google.co.in/citations?user=A5tqIdoAAAAJ&hl=en (URL)
Dates
- Copyrighted
-
2013
References
- [1] Ziolko B (2009) "Speech Recognition of Highly Inflective Languages", Ph.D. Thesis, University of York, Artificial Intelligence Group Pattern Recognition and Computer Vision Group, Department of Computer Science, United Kingdom, pp 1-122. [2] Pantazis I (2006) "Detection of discontinuities in concatenative speech synthesis", Msc thesis ,University of Crete, Department of Computer Science, Heraklion, Fall. [3] Gaikwad S K, Marathwada B A , Gawali B W, & Yannawar P (2010) "A review on speech recognition technique", Research Student Department of CS& IT, International Journal of Computer Applications, Nov., Vol. 10, No.3, pp 1-24. [4] Plannerer B (2005) "An introduction to speech recognition", March, pp 1-68. [5] Rabiner L R (1989) "A tutorial on hidden markov models and selected applications in speech recognition", in Proc. of the IEEE, Vol.77, No. 2. [6] Elaydi H, Jaber M I, Tanboura M B "Speech compression using wavelets", Electrical & Computer Engineering Department, Islamic University of Gaza,Gaza, Palestine. [7] Jayant N ,fellow, Johnston J, & Safranek R (1993) " Signal compression based on models of human perception", in proc. of IEEE signal processing research department , oct,Vol. 81, No. 10 , pp 1385- 1422. [8] ZHOU Z (2001) " Data compression for radar signals an SVD based approach" , Msc thesis, state university of new york at binghamton, may, pp 1-57. [9] Wall M E., Rechtsteiner A, & Rocha L M (2003) "Singular value decomposition and principal component analysis". [10] Lilly B T & Paliwal K K (1997)" Robust speech recognition using singular value decomposition based speech enhancement" , IEEE Tencon Speech and Image Technologies for Computing and Telecommunications, Signal Processing Laboratory School of Microelectronic Engineering Griffith University. [11] Akritas A G & Malaschonok G I (2002) "Applications of singular value decomposition (SVD)", Department of Computer and Communication Engineering, University of Thessaly, Greece, pp 1-15. [12] Jolliffe, I. T (1986) " Principal Component Analysis" ,Springer-Verlag ,pp. 487. [13] Zou H, Hastiey T and Tibshiraniz R (2004), " Sparse Principal Component Analysis", Department of Health Research & Policy and Department of Statistics, Stanford University, Stanford, April 26, pp 1- 30. [14] Dendrinos et al,& Loizou P C (2003) "A generalized subspace approach for enhancing speech corrupted by colored noise" ,Yi Hu, Student Member, IEEE. [15] Kamm C, Walker M, & Rabiner L "The role of speech processing in human-computer intelligent communication", Speech and Image Processing Services Research Laboratory AT&T Labs Research, Florham Park, pp 1-26. [16] Chakroborty S, & Saha G (2010) "Feature selection using singular value decomposition and QR factorization with column pivoting for text-independent speaker identification", Speech Communication, pp 693–709. [17] Zehtabian A, & Hassanpour H "Optimized Singular vector denoising approach for speech enhancement", Iranica Journal of Energy & Environment, IJEE an Official Peer Reviewed Journal , Babol Noshirvani University of Technology, pp 166-180. [18] Cao L (2007) "Singular Value Decomposition Applied to Digital Image Processing", Division of computing studies, Arizona state university polytechnic campus mesa, May, pp 1-16.