Figure 2. MFCC Block Diagram Step 1-Development an Automatic Speech to Facial Animation Conversion for Improve Deaf Lives
- 1. S. Alireza Kasaei Young Researchers Club, Isfahan Branch (Khurasgan), Islamic Azad University,Isfahan,
Description
Mel Frequency Cepstral Coefficients (MFCC) are coefficients that represent audio, based on
perception. It is derived from the Fourier Transform (FFT) or the Discrete Cosine Transform (DCT)
of the audio clip. The basic difference between the FFT/DCT and the MFCC is that in the MFCC,
the frequency bands are positioned logarithmically (on the Mel scale) which approximates the
human auditory system's response more closely than the linearly spaced frequency bands of FFT or
DCT. This allows for better processing of data. The main purpose of the MFCC processor is to
mimic the behavior of the human ears. Overall the MFCC process has 5 steps that show in figure 2.
Notes
Files
Figure 2. MFCC Block Diagram.png
Files
(93.7 kB)
Name | Size | Download all |
---|---|---|
md5:db17f54599cae4ecc673672e78fb3b83
|
93.7 kB | Preview Download |