Back close

Weighted cosine distance features for speaker verification

Publication Type : Conference Paper

Publisher : 12th IEEE International Conference Electronics, Energy, Environment, Communication, Computer, Control: (E3-C3), INDICON 2015

Source : 12th IEEE International Conference Electronics, Energy, Environment, Communication, Computer, Control: (E3-C3), INDICON 2015, Institute of Electrical and Electronics Engineers Inc. (2015)

Url : https://www.scopus.com/inward/record.uri?eid=2-s2.0-84994381810&partnerID=40&md5=5ba693bdcfce142b838475919f80c6fc

ISBN : 9781467373999

Keywords : Additive noise, Cepstral coefficients, Classification (of information), Cosine distance scoring, Individual systems, Mel-frequency cepstral coefficients, Robustness (control systems), Semantics, Signal to noise ratio, Speaker recognition evaluations, Speaker verification, Speaker verification system, Speech recognition, Support vector machine classifiers, Support vector machines, Vectors

Campus : Coimbatore

School : School of Engineering

Center : Computational Engineering and Networking

Department : Electronics and Communication, Mechanical Engineering

Year : 2015

Abstract : pCosine distance similarities with a set of reference speakers, cosine distance features (CDF), with a backend support vector machine classifier (CDF-SVM) have been explored in our earlier studies for improving the performance of speaker verification systems. Subsequently, we also investigated on its effectiveness in improving the noise robustness of speaker verification systems. In this work, we study how the performance of CDF-SVM systems can be further improved by weighting the feature vectors using latent semantic information (LSI) technique. We use mel frequency cepstral coefficients (MFCC), power normalized cepstral coefficients (PNCC), or delta spectral cepstral coefficients (DSCC) for deriving CDF. Experimental results on the female part of short2-short3 trials of NIST speaker recognition evaluation dataset show that the proposed weighted CDF-SVM system outperforms the baseline i-vector with cosine distance scoring (i-CDS), i-vector with a backend SVM classifier (i-SVM) and CDF-SVM systems. Finally, we fused the weighted CDF-SVM with i-CDS and the performance of the combined system was evaluated under different stationary and non-stationary additive noise test conditions. It was seen that the noise robustness of the fused weighted CDF-SVM+i-CDS system is significantly better than the individual systems and the fused CDF-SVM+i-CDS of our earlier work in both clean and noisy test environments except for the zero SNR level condition of certain noises. © 2015 IEEE./p

Cite this Research Publication : Dr. Santhosh Kumar C., George, K. K., Dr. K. I. Ramachandran, and Panda, A., “Weighted cosine distance features for speaker verification”, in 12th IEEE International Conference Electronics, Energy, Environment, Communication, Computer, Control: (E3-C3), INDICON 2015, 2015.

Admissions Apply Now