Publication Type : Journal Article
Source : 2011
Campus : Coimbatore
School : Computational Engineering and Networking
Center : Computational Engineering and Networking
Department : Center for Computational Engineering and Networking (CEN)
Year : 2011
Abstract : This paper presents the intricacies involved in developing a hierarchal POS tagger generator using SVMTool for Tamil language. Tamil, a Dravidian language has a very rich morphological structure which is agglutinative. Tamil words are made up of lexical roots followed by one or more affixes, mostly suffixes. So tagging a word in a language like Tamil is very complex. We try to resolve this complexity by identifying the categorical ambiguities and developing three hierarchaltag sets at word grammatical category and grammatical feature level. These tag sets were used to annotate the corpora and trained using the SVMTool (An Open source tool available at http://www.lsi.upc.es/~nlp/SVMTool ) to generate the POS tagger model. The results obtained in each level were encouraging.
Cite this Research Publication : V. Dhanalakshmi and M Kumar, A., “Hierarchal POS tagging for Tamil language using Machine learning approach”, 2011.