Publication Type : Journal Article
Publisher : IEEE Access
Source : IEEE Access, vol. 8, pp. 118050-118071, 2020, (IF: 3.745)
Url : https://ieeexplore.ieee.org/document/9126775?denied=
Campus : Bengaluru
School : Department of Computer Science and Engineering
Department : Computer Science
Year : 2020
Abstract : Sentiment lexicon learning is of paramount importance in sentiment analysis. One of the most considerable challenges in learning sentiment lexicons is their domain-specific behavior. Transferring knowledge acquired from a sentiment lexicon from one domain to another is an open research problem. In this study, we attempt to address this challenge by presenting a transfer learning approach that creates new learning insights for multiple domains of the same genre. We propose an unsupervised sentiment lexicon learning methodology scalable to new domains of the same genre. Incremental learning and the methodology learn polarity seed words from corpora of multiple automatically selected source domains. This process then transfers its genre-level knowledge of corpus-learned seed words to the target domains. The corpus-learned seed words are used for sentiment lexicon generation for multiple target domains of the same genre. The sentiment lexicon learning process is based on the latent semantic analysis technique and uses unlabeled training data from the source and target domains. The experiment was performed using 24 domains of the same genre, i.e., consumer product review. The proposed model displays the best results using standard evaluation measures compared with the competitive baselines. The proposed genre-based unsupervised approach achieves a maximum accuracy of 86% and outperforms methods recently presented in the literature.
Cite this Research Publication : S. Sanagar and D. Gupta, (2020), "Unsupervised Genre-Based Multidomain Sentiment Lexicon Learning Using Corpus-Generated Polarity Seed Words”, IEEE Access, vol. 8, pp. 118050-118071, 2020, (IF: 3.745)