Publication Type : Conference Paper
Publisher : IEEE
Source : 2019 International Conference on Intelligent Computing and Control Systems (ICCS) Pages 1094-1097
Url : https://ieeexplore.ieee.org/document/9065370
Campus : Amritapuri
School : School of Engineering
Department : Electronics and Communication
Year : 2019
Abstract : The paper outlines a language identification system from speech with the help of deep learning. In a multilingual country like India, language identification systems have a wide variety of applications. For this, a custom speech database of seven Indian languages from IIIT-Indic has been taken. Preprocessing is done in order to avoid the noise interference and the disturbances in the audio signal. After subsequent framing and windowing, the spectrograms are extracted and fed to a Convolutional Neural Network (CNN). Utterances in languages: Bengali, Hindi, Marathi, Malayalam, Kannada, Tamil and Telugu are utilized in this study. A mixture of speech utterances from binary combination of these languages is used in this analysis. The performance of the system is measured in the form of accuracy, precision, recall, F1 score and confusion matrix. For the data set analyzed, a maximum accuracy of 99.5% is achieved for Bengali vs Kannada and 99% for Kannada vs Malayalam.
Cite this Research Publication : A. N..P and P. S.S., "Deep Learning Based Language Identification System From Speech," 2019 International Conference on Intelligent Computing and Control Systems (ICCS), Madurai, India, 2019, pp. 1094-1097, doi: 10.1109/ICCS45141.2019.9065370.