Planned intervention: On Thursday March 28th 07:00 UTC Zenodo will be unavailable for up to 5 minutes to perform a database upgrade.
Published November 25, 2009 | Version 5166
Journal article Open

A High Quality Speech Coder at 600 bps

Description

This paper presents a vocoder to obtain high quality synthetic speech at 600 bps. To reduce the bit rate, the algorithm is based on a sinusoidally excited linear prediction model which extracts few coding parameters, and three consecutive frames are grouped into a superframe and jointly vector quantization is used to obtain high coding efficiency. The inter-frame redundancy is exploited with distinct quantization schemes for different unvoiced/voiced frame combinations in the superframe. Experimental results show that the quality of the proposed coder is better than that of 2.4kbps LPC10e and achieves approximately the same as that of 2.4kbps MELP and with high robustness.

Files

5166.pdf

Files (117.4 kB)

Name Size Download all
md5:7c87223c971072b58053408bb8b30ecd
117.4 kB Preview Download

Additional details

References

  • Ovens, M.J..Ponting, Turner.M.E, "Ultra low bit rate voice coding," IEE Seminar, Vol.4, pp 911 - 920, 2000
  • Gwenael guilmin, Francois Capman, and et.al, "New NATO STANAG narrow band voice coder at 600 bit/s", IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol.3, pp.689-692, 2006
  • T.Wang, K.Koishida, V.Cuperman, and et.al, "A 1200/2400 bps coding suite based on MELP," Proc of IEEE Workshop on Speech Coding, Vol.1, pp. 122-126, 2002
  • O.Gottesman, A.Gersho, "Enhanced Waveform Interpolative Coding at Low Bit-rate", IEEE Trans.Speech Audio Processing, vol.9, No.8, pp.242-250, 2001
  • Minoru Kohata, "A New 1.2kbit/s speech coding method based on a sinusoidal harmonic vocoder," Systems and Computers in Japan, vol.31, No.14, pp.64-73, 2000
  • Jian Cong, Suo Cong, "New speech encoding algorithm for ultra low bit rate at 600/300," IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol.2, pp.709-712, 2006
  • Ehsan Jahangiri, Shahrokh Ghaemmaghami, "Scalable speech coding at rates below 900 bps", IEEE International Conference on Multimedia & Expo, Vol.1, pp.85-88, 2008
  • A.D.Subramaniam, B.D.Rao, "PDF Optimized Parametric Vector Quantization of Speech Line Spectral Frequencies," IEEE Trans. Speech Audio Processing, Vol. 11, No. 2, pp. 130-142, Mar. 2003.
  • L.M. Supplee, R.P.Cohn, J.S.Collura, A.V.McCree, "MELP: The new federal standard at 2400 bits/s," IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol.4, pp.1591-1954, 1997 [10] Thomas E.Tremain, "The Government Standard Linear Predictive Coding Algorithm: LPC-10," Speech Technology, No.2, pp.40-49, 1982