Publication Type : Journal Article
Publisher : Volume 2
Source : Volume 2 (2011)
Campus : Coimbatore
School : School of Engineering
Center : Computational Engineering and Networking
Department : Electronics and Communication
Year : 2011
Abstract : In this paper, a recently proposed word alignment algorithm is simplified for easy understanding and tested for an Indian language. The word alignment problem is viewed as a simple assignment problem and is formulated as an Integer Linear Programming problem. The newobjective function defined is tested for obtaining optimal alignment for English-Tamil translation pair. This alignment is necessary forcreating the probabilistic bilingual dictionary and is also required for automatic machine translation. We have used this objective unction to align words in 25 sentences of English-Tamil parallel corpora. The formulation is solved using the open source LP-Solver. Result obtained indicates that the methodology is applicable for all Indian languages. The work implemented is useful for pedagogical purposes, as it is a standard problem in computational linguistics. Accuracy of modern statistical machine translation depends on good word alignment. The document of the formulated model is available on request.
Cite this Research Publication : Harshawardhan, Augustine, M., and Dr. Soman K. P., “A Simplified Approach to Word Alignment Algorithm for English-Tamil Translation”, vol. 2, 2011.