Publication Type : Journal Article
Publisher : International Journal of Computer Applications
Source : International Journal of Computer Applications , Volume 34, Issue 8, p.0975-8887 (2011)
Campus : Coimbatore
School : School of Engineering
Center : Computational Engineering and Networking
Department : Computer Science, Electronics and Communication
Year : 2011
Abstract : Part of speech (POS) tagging is the process of assigning the part of speech tag or other lexical class marker to each and every word in a sentence. In many Natural Language Processing applications such as word sense disambiguation, information retrieval, information processing, parsing, question answering, and machine translation, POS tagging is considered as the one of the basic necessary tool. Identifying the ambiguities in language lexical items is the challenging objective in the process of developing an efficient and accurate POS Tagger. Literature survey shows that, for Indian languages, POS taggers were developed only in Hindi, Bengali, Panjabi and Dravidian languages. Some POS taggers were also developed generic to the Hindi, Bengali and Telugu languages. All proposed POS taggers were based on different Tagset, developed by different organization and individuals. This paper addresses the various developments in POS-taggers and POS-tagset for Indian language, which is very essential computational linguistic tool needed for many natural language processing (NLP) applications
Cite this Research Publication : Dr. Soman K. P. and J, A. P., “Parts Of Speech Tagging for Indian Languages: A Literature Survey”, International Journal of Computer Applications , vol. 34, no. 8, pp. 0975-8887, 2011.