Publication Type : Conference Paper
Publisher : Springer
Source : In 3rd EAI International Conference on Big Data Innovation for Sustainable Cognitive Computing (pp. 239-248). Springer, Cham
Url : https://link.springer.com/chapter/10.1007/978-3-030-78750-9_17
Campus : Chennai
School : School of Engineering
Department : Computer Science and Engineering
Year : 2022
Abstract : Determining the similarity among the sentences is a predominant task in natural language processing. The semantic determining task is one of the important research area in today’s applications related to text analytics. The semantic of the sentences get varied according to the textual context it is used. In natural language processing, determining the semantic likeness between sentences is an important research area. As a result, a lot of research is done in determining the semantic likeness in the text. For example, there exists many possible semantics for a word (polysemy) and the synonym of the word; and also these techniques avoid considering the stop words in English which are critical for English phrase/word division, speech investigation, and meaningful comprehension. Our proposed work utilizes Term Frequency-based Inverse Document Frequency model and Glove algorithm-based word embeddings vector for determining the semantic similarity among the terms in the textual contents. Lemmatizer is utilized to reduce the terms to the most possible smallest lemmas. The outcomes demonstrate that the proposed methodology is more prominent than the TF-idf score in ranking the terms with respect to the search query terms. The Pearson correlation coefficient achieved for the semantic similarity model is 0.875.
Cite this Research Publication :
Karthiga, M., Sountharrajan, S., Bazila Banu, A., Sankarananth, S., Suganya, E. and Sathish Kumar, B., 2022. Similarity Analytics for Semantic Text Using Natural Language Processing. In 3rd EAI International Conference on Big Data Innovation for Sustainable Cognitive Computing (pp. 239-248). Springer, Cham.