LSTM based paraphrase identification using combined word embedding features

Publication Type : Conference Paper

Publisher : Springer Verlag

Source : Advances in Intelligent Systems and Computing, Springer Verlag, Volume 898, p.385-394 (2019)

Url : https://www.scopus.com/inward/record.uri?eid=2-s2.0-85062266194&doi=10.1007%2f978-981-13-3393-4_40&partnerID=40&md5=0b28956abaf78fc40a8a10f50a75146d

ISBN : 9789811333927

Keywords : Corpus, Deep learning, Embeddings, Fast-text, Glove, Long short-term memory, Natural language processing systems, Newsprint, Paraphrase identifications, RNN-LSTM, Semantics, Signal processing, Soft computing

Campus : Coimbatore

School : School of Engineering

Center : Computational Engineering and Networking

Department : Electronics and Communication

Year : 2019

Abstract : Paraphrase identification is the process of analyzing two text entities (sentences) and determining whether the two entities represent the similar sense or not. This is a task of Natural Language Processing (NLP) in which we need to identify the sentences whether it is a paraphrase or not. Here, the chosen approach for this task is a deep Learning model that is Recurrent Neural Network-LSTM with word embedding features. Word embedding is an approach, from where we can extract the semantics of the word in dense vector representation. The word embedding models that are used for the feature extraction in Telugu are Word2Vec, Glove and Fasttext. These extracted feature models are added in the embedding layer of Long Short-Term Memory algorithm in order to classify the Telugu sentence pairs whether they are Paraphrase or not. The corpus for Telugu is generated manually from various Telugu newspapers. The sentences for word embedding model is also gathered from Telugu newspapers. This is the first attempt for paraphrase identification in Telugu using deep learning approach. © Springer Nature Singapore Pte Ltd. 2019.

Cite this Research Publication : A. D. Reddy, M. Kumar, A., Dr. Soman K. P., G.R., M. Reddy, V.S., R., and V.K., P., “LSTM based paraphrase identification using combined word embedding features”, in Advances in Intelligent Systems and Computing, 2019, vol. 898, pp. 385-394.

About Amrita Vishwa Vidyapeetham

Rankings

Accreditation

Governance

Chancellor

Leadership

Press Media

Newsletters

Amritapuri
Campus

Amaravati
Campus

Bengaluru
Campus

Chennai
Campus

Coimbatore
Campus

Faridabad
Campus

Kochi
Campus

Mysuru
Campus

Nagercoil
Campus

Haridwar
(Proposed Campus)

Research

Centers

Patents

Publication