Effect of Singular Value Decomposition Based Processing on Speech Perception

International Journal on Natural Language Computing (IJNLC)

doi:10.5121/ijnlc.2013.2102

Published December 13, 2023 | Version v1

Journal article Open

Effect of Singular Value Decomposition Based Processing on Speech Perception

International Journal on Natural Language Computing (IJNLC)

Speech is an important biological signal for primary mode of communication among human being and also the most natural and efficient form of exchanging information among human in speech. Speech processing is the most important aspect in signal processing. In this paper the theory of linear algebra called singular value decomposition (SVD) is applied to the speech signal. SVD is a technique for deriving important parameters of a signal. The parameters derived using SVD may further be reduced by perceptual evaluation of the synthesized speech using only perceptually important parameters, where the speech signal can be compressed so that the information can be transformed into compressed form without losing its quality. This technique finds wide applications in speech compression, speech recognition, and speech synthesis. The objective of this paper is to investigate the effect of SVD based feature selection of the input speech on the perception of the processed speech signal. The speech signal which is in the form of vowels \a\, \e\, \u\ were recorded from each of the six speakers (3 males and 3 females). The vowels for the six speakers were analyzed using SVD based processing and the effect of the reduction in singular values was investigated on the perception of the resynthesized vowels using reduced singular values. Investigations have shown that the number of singular values can be drastically reduced without significantly affecting the perception of the vowels.

Files

2113ijnlc02.pdf

Files (431.0 kB)

Name	Size	Download all
2113ijnlc02.pdf md5:6e682d08d788795733f2c8e5344cee8c	431.0 kB	Preview Download

Additional details

URL: https://airccse.org/journal/ijnlc/papers/2113ijnlc02.pdf

Is cited by: https://scholar.google.co.in/citations?user=A5tqIdoAAAAJ&hl=en (URL)

Copyrighted: 2013

[1] Ziolko B (2009) "Speech Recognition of Highly Inflective Languages", Ph.D. Thesis, University of York, Artificial Intelligence Group Pattern Recognition and Computer Vision Group, Department of Computer Science, United Kingdom, pp 1-122. [2] Pantazis I (2006) "Detection of discontinuities in concatenative speech synthesis", Msc thesis ,University of Crete, Department of Computer Science, Heraklion, Fall. [3] Gaikwad S K, Marathwada B A , Gawali B W, & Yannawar P (2010) "A review on speech recognition technique", Research Student Department of CS& IT, International Journal of Computer Applications, Nov., Vol. 10, No.3, pp 1-24. [4] Plannerer B (2005) "An introduction to speech recognition", March, pp 1-68. [5] Rabiner L R (1989) "A tutorial on hidden markov models and selected applications in speech recognition", in Proc. of the IEEE, Vol.77, No. 2. [6] Elaydi H, Jaber M I, Tanboura M B "Speech compression using wavelets", Electrical & Computer Engineering Department, Islamic University of Gaza,Gaza, Palestine. [7] Jayant N ,fellow, Johnston J, & Safranek R (1993) " Signal compression based on models of human perception", in proc. of IEEE signal processing research department , oct,Vol. 81, No. 10 , pp 1385- 1422. [8] ZHOU Z (2001) " Data compression for radar signals an SVD based approach" , Msc thesis, state university of new york at binghamton, may, pp 1-57. [9] Wall M E., Rechtsteiner A, & Rocha L M (2003) "Singular value decomposition and principal component analysis". [10] Lilly B T & Paliwal K K (1997)" Robust speech recognition using singular value decomposition based speech enhancement" , IEEE Tencon Speech and Image Technologies for Computing and Telecommunications, Signal Processing Laboratory School of Microelectronic Engineering Griffith University. [11] Akritas A G & Malaschonok G I (2002) "Applications of singular value decomposition (SVD)", Department of Computer and Communication Engineering, University of Thessaly, Greece, pp 1-15. [12] Jolliffe, I. T (1986) " Principal Component Analysis" ,Springer-Verlag ,pp. 487. [13] Zou H, Hastiey T and Tibshiraniz R (2004), " Sparse Principal Component Analysis", Department of Health Research & Policy and Department of Statistics, Stanford University, Stanford, April 26, pp 1- 30. [14] Dendrinos et al,& Loizou P C (2003) "A generalized subspace approach for enhancing speech corrupted by colored noise" ,Yi Hu, Student Member, IEEE. [15] Kamm C, Walker M, & Rabiner L "The role of speech processing in human-computer intelligent communication", Speech and Image Processing Services Research Laboratory AT&T Labs Research, Florham Park, pp 1-26. [16] Chakroborty S, & Saha G (2010) "Feature selection using singular value decomposition and QR factorization with column pivoting for text-independent speaker identification", Speech Communication, pp 693–709. [17] Zehtabian A, & Hassanpour H "Optimized Singular vector denoising approach for speech enhancement", Iranica Journal of Energy & Environment, IJEE an Official Peer Reviewed Journal , Babol Noshirvani University of Technology, pp 166-180. [18] Cao L (2007) "Singular Value Decomposition Applied to Digital Image Processing", Division of computing studies, Arizona state university polytechnic campus mesa, May, pp 1-16.

	All versions	This version
Views	19	19
Downloads	14	14
Data volume	6.0 MB	6.0 MB

2113ijnlc02.pdf

Files (431.0 kB)

Identifiers

Related works

Dates

References

Effect of Singular Value Decomposition Based Processing on Speech Perception

Authors/Creators

Description

Files

2113ijnlc02.pdf

Files (431.0 kB)

Additional details

Identifiers

Related works

Dates

References