Publication Type : Conference Paper
Publisher : Procedia Computer Science, Elsevier.
Source : Procedia Computer Science, Elsevier, Volume 89, p.804-811 (2016)
Keywords : Data mining, extraction, Feature extraction, Filter banks, filtration, Image processing, IMFCC, Loudspeakers, Mel-filter banks, MFCC, Short-time energy, Signal processing, Speech, Speech analysis, Speech communication, Speech recognition, Unvoiced speech, Warehouses
Campus : Bengaluru
School : School of Agricultural Sciences, School of Engineering
Year : 2016
Abstract : Speaker identification system identifies the person by his/her speech sample. Speaker Identification (SI) system should posses a robust feature extraction unit and a good classifier. Mel frequency cepstral coefficient (MFCC) is very old feature extraction scheme, which has been regarded as standard set of feature vectors for speaker identification. The mel filter bank used in MFCC method, captures the speaker information more effectively in lower frequencies than higher frequencies. Hence high frequency region characteristics are lost. This problem is solved in the proposed method. The speech signal comprises both voiced and unvoiced segments. The voiced segment includes high energy, low frequency components and unvoiced segment includes low energy, high frequency components. In proposed method, the speech sample is divided into voiced and unvoiced segments. The voiced speech segment is filtered using mel filter bank to generate MFCC from lower frequencies of speech signal and unvoiced speech segment is filtered using inverted mel filter bank to generate IMFCC from higher frequencies of speech signal. © 2016 Elsevier B.V. All rights reserved.
Cite this Research Publication : Latha, “Robust Speaker Identification Incorporating High Frequency Features”, in Procedia Computer Science, 2016, vol. 89, pp. 804-811.