Back close

Telugu Dialect Speech Dataset Creation and Recognition using Deep Learning Techniques

Publication Type : Conference Paper

Publisher : IEEE

Source : In 2022 IEEE 19th India Council International Conference (INDICON) (pp. 1-6). IEEE

Url : https://ieeexplore.ieee.org/document/10040194

Campus : Bengaluru

School : School of Engineering

Department : Electronics and Communication

Year : 2022

Abstract : According to India’s 2011 demography, there seem to be approximately 8 crore Telugu communicators. Apart from that, the Telugu language has many dialects spread across the states Telangana and Andhra Pradesh. Telangana, Rayalaseema, and Coastal accents are the most common. The main concern is to understand the language irrespective of the dialects to have good communication near border areas of these states. Availability of data for analysis of Telugu speech dialects is of high scope for recognition. So, the creation of data is done for Telugu dialects with a total of 9 speakers, 3 speakers for each dialect. Once the data is created, analysis and recognition can help direct our needs. Classifying dialects cannot only solve this problem but also can act as a subset for solving bigger problems like machine translation, sentiment analysis, etc. We have used four RNN models viz. LSTM, GRU, BiLSTM & BiLSTM with attention layer for classification using speech data as input. Maximum test accuracy of 99.11% was obtained using the BiLSTM model with attention layer.

Cite this Research Publication : Podila, R. S. A., Kommula, G. S. S., Ruthvik, K., Vekkot, S., & Gupta, D. (2022, November). Telugu Dialect Speech Dataset Creation and Recognition using Deep Learning Techniques. In 2022 IEEE 19th India Council International Conference (INDICON) (pp. 1-6). IEEE

Admissions Apply Now