Info: Zenodo’s user support line is staffed on regular business days between Dec 23 and Jan 5. Response times may be slightly longer than normal.

Published June 1, 2011 | Version v1
Figure Open

Figure 2. MFCC Block Diagram Step 1-Development an Automatic Speech to Facial Animation Conversion for Improve Deaf Lives

  • 1. S. Alireza Kasaei Young Researchers Club, Isfahan Branch (Khurasgan), Islamic Azad University,Isfahan,

Description

Mel Frequency Cepstral Coefficients (MFCC) are coefficients that represent audio, based on
perception. It is derived from the Fourier Transform (FFT) or the Discrete Cosine Transform (DCT)
of the audio clip. The basic difference between the FFT/DCT and the MFCC is that in the MFCC,
the frequency bands are positioned logarithmically (on the Mel scale) which approximates the
human auditory system's response more closely than the linearly spaced frequency bands of FFT or
DCT. This allows for better processing of data. The main purpose of the MFCC processor is to
mimic the behavior of the human ears. Overall the MFCC process has 5 steps that show in figure 2.

Notes

https://www.edusoft.ro/brain/index.php/brain/issue/view/14

Files

Figure 2. MFCC Block Diagram.png

Files (93.7 kB)

Name Size Download all
md5:db17f54599cae4ecc673672e78fb3b83
93.7 kB Preview Download