Publication Type : Conference Paper
Publisher : International Conference on Intelligent Computing (ICIC) 2018, Amrita School of Engineering,
Source : International Conference on Intelligent Computing (ICIC) 2018, Amrita School of Engineering, Bengaluru, 2018.
Campus : Bengaluru
School : School of Engineering
Department : Electronics and Communication
Year : 2018
Abstract : Mel Frequency Cepstral Coefficients (MFCCs) and Perceptual linear prediction coefficients (PLPCs) are widely casted nonlinear vocal parameters in majority of the speaker identification, speaker and speech recognition techniques as well in the field of emotion recognition. Post 1980s, significant exertions are put forth on for the progress of these features. Considerations like the usage of appropriate frequency estimation approaches, proposal of appropriate filter banks, and selection of preferred features perform a vital part for the strength of models employing these features. This article projects an overview of MFCC and PLPC features for different speech applications. The insights such as performance metrics of accuracy, background environment, type of data, and size of features are inspected and concise with the corresponding key references. Adding more to this, the advantages and shortcomings of these features have been discussed. This background work will hopefully contribute to floating a heading step in the direction of the enhancement of MFCC and PLPC with respect to novelty, raised levels of accuracy, and lesser complexity.
Cite this Research Publication : S. Lalitha and Dr. Deepa Gupta, “An Encapsulation of Vital Non-Linear Frequency Features for Speech Applications”, in International Conference on Intelligent Computing (ICIC) 2018, Amrita School of Engineering, Bengaluru, 2018.