Back close

Novel approach for detecting applause in continuous meeting speech

Publication Type : Conference Paper

Publisher : ICECT 2011 - 2011 3rd International Conference on Electronics Computer Technology

Source : ICECT 2011 - 2011 3rd International Conference on Electronics Computer Technology, Volume 3, Kanyakumari, p.182-186 (2011)

Url : http://www.scopus.com/inward/record.url?eid=2-s2.0-79961219782&partnerID=40&md5=1bc45ab4b82c0931f46c487b036382f2

ISBN : 9781424486779

Keywords : Acoustic features, Algorithms, Audio acoustics, Audio classification, Audio content analysis, Audio signal, Audio stream, Autocorrelation, Autocorrelation features, Conventional methods, Correlation detectors, Decision threshold, Detection algorithm, Energy decay, Feature vectors, Gaussian Mixture Model, Local minimums, Mel-frequency cepstral coefficients, Multimedia database, Natural language processing systems, Precision rates, Recall rate, Semantics, Speech recognition, sports highlight extraction, Video summarization, Zero crossing points

Campus : Coimbatore

School : School of Engineering

Department : Electronics and Communication

Year : 2011

Abstract : This paper proposes a robust and automated applause detection algorithm for meeting speech. The features used in the proposed algorithm are the short-time autocorrelation features such as autocorrelation energy decay factor, amplitude and lag values of first local minimum and zero-crossing points extracted from the autocorrelation sequence of a windowed audio signal. We apply decision thresholds for the above acoustic features to identify applause and non-applause segments from the audio stream. The performance of the proposed algorithm is compared with the conventional method using mel frequency cepstral coefficients (MFCC) feature vectors and Gaussian mixture model (GMM) as classifier. We have also analyzed the performance of these algorithms by varying the number of mixtures in GMM (2, 4, 8, 16 and 32) and various thresholds in the proposed method. The methods are tested with a multimedia database of 4 hours 37 minutes of meeting speech and the results are compared. The precision rate, recall rate and F1 score of the proposed method are 94.40%, 90.75% and 92.54% respectively while those of conventional method are 67.47%, 96.13% and 79.29% respectively. © 2011 IEEE.

Cite this Research Publication : C. Manoj, Magesh, S., Sankaran, A. S., and Manikandan, M. S., “Novel approach for detecting applause in continuous meeting speech”, in ICECT 2011 - 2011 3rd International Conference on Electronics Computer Technology, Kanyakumari, 2011, vol. 3, pp. 182-186.

Admissions Apply Now