Back close

AMRITA_CEN-NLP@SAIL2015: Sentiment Analysis in Indian Language Using Regularized Least Square Approach with Randomized Feature Learning

Publication Type : Conference Paper

Thematic Areas : Center for Computational Engineering and Networking (CEN)

Source : International Conference on Mining Intelligence and Knowledge Exploration, 2015

Url : https://www.researchgate.net/publication/311640429_AMRITA_CEN-NLPSAIL2015_Sentiment_Analysis_in_Indian_Language_Using_Regularized_Least_Square_Approach_with_Randomized_Feature_Learning

Campus : Coimbatore

School : School of Artificial Intelligence, School of Artificial Intelligence - Coimbatore

Department : Center for Computational Engineering and Networking (CEN)

Verified : No

Year : 2015

Abstract : The present work is done as part of shared task in Sentiment Analysis in Indian Languages (SAIL 2015), under constrained category. The task is to classify the twitter data into three polarity categories such as positive, negative and neutral. For training, twitter dataset under three languages were provided Hindi, Bengali and Tamil. In this shared task, ours is the only team who participated in all the three languages. Each dataset contained three separate categories of twitter data namely positive, negative and neutral. The proposed method used binary features, statistical features generated from SentiWordNet, and word presence (binary feature). Due to the sparse nature of the generated features, the input features were mapped to a random Fourier feature space to get a separation and performed a linear classification using regularized least square method. The proposed method identified more negative tweets in the test data provided Hindi and Bengali language. In test tweet for Tamil language, positive tweets were identified more than other two polarity categories. Due to the lack of language specific features and sentiment oriented features, the tweets under neutral were less identified and also caused misclassifications in all the three polarity categories. This motivates to take forward our research in this area with the proposed method.

Cite this Research Publication : Sachin Kumar, S., Premjith, B., Anand Kumar, M., Soman, K.P., AMRITA_CEN-NLP@SAIL2015: Sentiment analysis in indian language using regularized least square approach with randomized feature learning, (2015) Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 9468, pp. 671-683., DOI: 10.1007/978-3-319-26832-3_64

Admissions Apply Now