Publication Type : Conference Paper
Publisher : IEEE
Source : In 2023 2nd International Conference for Innovation in Technology (INOCON), pp. 1-8. IEEE, 2023.
Url : https://ieeexplore.ieee.org/document/10101157
Campus : Coimbatore
School : School of Computing
Year : 2023
Abstract : On social media networks like Twitter, Facebook, and Tumblr, people frequently share information. However, these platforms are also notorious for the spread of hate speech and insults, often posted anonymously. Hate speech involves using violent, abusive, or aggressive language towards a particular group based on factors such as gender, race, religion, or region. The prevalence of hate speech on these websites is a major concern, and manually detecting it can be time-consuming. To address this issue, this study presents an automated hate speech detection model that is evaluated on a publicly available Twitter dataset. The proposed method emphasizes data pre-processing, including stemming, term frequency-inverse document frequency (TF-IDF) for feature extraction, and various sampling techniques (random sampler, synthetic minority over-sampling technique (SMOTE), and ALL-KNN) to balance an imbalanced dataset. The logistic regression, support vector machine (SVM), and k-nearest neighbor (k-NN) machine learning classifiers were trained and tested using hold-out cross-validation to reduce overfitting and evaluate performance. The performance was evaluated using metrics such as accuracy, precision, and confusion matrix. The results showed that the logistic regression classifier using the SMOTE approach had the best performance, with an accuracy of 82%, a macro average of precision, recall, and an F1-score of 80%, 82%, and 79%, respectively.
Cite this Research Publication : Reddy, B. Ajay Chandrasekhar, Girish Kumar Chandra, Dilip Singh Sisodia, and Arti Anuragi. " Balancing Techniques for Improving Automated Detection of Hate Speech and Offensive Language on Social Media. " In 2023 2nd International Conference for Innovation in Technology (INOCON), pp. 1-8. IEEE, 2023.