Publication Type : Journal Article
Publisher : International Journal of Recent Trends in Engineering
Source : International Journal of Recent Trends in Engineering, Volume 1, Issue 1, p.498 (2009)
Campus : Coimbatore
School : School of Engineering
Center : Computational Engineering and Networking
Department : Electronics and Communication
Year : 2009
Abstract : Machine transliteration has gained prime importance as a supporting tool for Machine translation and cross language information retrieval especially when proper names and technical terms are involved. The performance of machine translation and cross-language information retrieval depends extremely on accurate transliteration of named entities. Hence the transliteration model must aim to preserve the phonetic structure of words as closely as possible. In this paper, the transliteration problem is modeled as classification problem and trained using C4.5 decision tree classifier, in WEKA Environment. The training was implemented with features extracted from a parallel corpus. This technique was demonstrated for English to Tamil Transliteration and achieved exact Tamil transliterations for 84.82% of English names. Possible equivalent transliterations were also generated by the model. It is found that the transliteration accuracy is increased when the top five ranked transliterations were considered
Cite this Research Publication : Dr. Soman K. P., MS, V., VP, A., and G, S., “English to Tamil Transliteration using WEKA”, International Journal of Recent Trends in Engineering, vol. 1, no. 1, p. 498, 2009.