Tamil POS tagging using Linear Programming

Publisher : International Journal of Recent Trends in Engineering

Campus : Coimbatore

Center : Computational Engineering and Networking

Year : 2009

Abstract : Part of speech (POS) tagging is the process of annotating syntactic categories for each word in a corpus. This paper presents an SVM methodology based on Linear Programming for implementing automatic Tamil POS tagger. We have designed our own tagset consisting of 32 tags for preparing the annotated corpus for Tamil. The features are extracted from a corpus of twenty five thousand sentences and trained with linear programming based SVM. This method, when tested with 10,000 sentences, gave an ...

