Publication Type : Conference Proceedings
Thematic Areas : Center for Computational Engineering and Networking (CEN)
Source : In book: Recent Advances in Computational Intelligence (pp.341-370), 2020
Campus : Coimbatore
Verified : No
Year : 2019
Abstract : Preposition sense disambiguation has huge significance in Natural language processing tasks such as Machine Translation. Transferring the various senses of a simple preposition in source language to a set of senses in target language has high complexity due to these many-to-many relationships, particularly in English-Malayalam machine translation. In order to reduce this complexity in the transfer of senses, in this paper, we used linguistic information such as noun class features and verb class features of the respective noun and verb correlated to the target simple preposition. The effect of these linguistic features for the proper classification of the senses (postposition in Malayalam) is studied with the help of several machine learning algorithms. The study showed that, the classification accuracy is higher when both verb and noun class features are taken into consideration. In linguistics, the major factor that decides the sense of the preposition is the noun in the prepositional phrase. The same trend was observed in the study when the training data contained only noun class features. i.e., noun class features dominates the verb class features.
Cite this Research Publication : Premjith B., Soman Kp, M. Anand Kumar, Jyothi Ratnam "Embedding Linguistic Features in Word Embedding for Preposition Sense Disambiguation in English-Malayalam Machine Translation Context", In book: Recent Advances in Computational Intelligence (pp.341-370), 2020