CEN-Tamil@DravidianLangTech-ACL2022: Abusive Comment detection in Tamil using TF-IDF and Random Kitchen Sink Algorithm

Publication Type : Conference Paper

Thematic Areas : Center for Computational Engineering and Networking (CEN)

Source : Proceedings of the Second Workshop on Speech and Language Technologies for Dravidian Languages, (Dublin, Ireland, 70 - 74), 2022

Url : https://www.researchgate.net/publication/361054864_CEN-TamilDravidianLangTech-ACL2022_Abusive_Comment_detection_in_Tamil_using_TF-IDF_and_Random_Kitchen_Sink_Algorithm

Campus : Coimbatore

School : School of Artificial Intelligence, School of Artificial Intelligence - Coimbatore, School of Engineering

Department : Center for Computational Engineering and Networking (CEN)

Year : 2022

Abstract : This paper describes the approach of team CEN-Tamil used for abusive comment detection in Tamil. This task aims to identify whether a given comment contains abusive comments. We used TF-IDF with char-wb analyzers with Random Kitchen Sink (RKS) algorithm to create feature vectors and the Support Vector Machine (SVM) classifier with polynomial kernel for classification. We used this method for both Tamil and Tamil-English datasets and secured first place with an f1-score of 0.32 and seventh place with an f1-score of 0.25, respectively. The code for our approach is shared in the GitHub repository.

Cite this Research Publication : Prasanth S N, R Aswin Raj, Adhithan P, Premjith B, Soman Kp "CEN-Tamil@DravidianLangTech-ACL2022: Abusive Comment detection in Tamil using TF-IDF and Random Kitchen Sink Algorithm", Proceedings of the Second Workshop on Speech and Language Technologies for Dravidian Languages, (Dublin, Ireland, 70 - 74), 2022.

About Amrita Vishwa Vidyapeetham

Rankings

Accreditation

Governance

Chancellor

Leadership

Press Media

Newsletters

Amritapuri
Campus

Amaravati
Campus

Bengaluru
Campus

Chennai
Campus

Coimbatore
Campus

Faridabad
Campus

Kochi
Campus

Mysuru
Campus

Nagercoil
Campus

Research

Centers

Patents

Publication