Publication Type : Journal Article
Thematic Areas : Center for Computational Engineering and Networking (CEN)
Publisher : Indian Society for Education and Environment
Source : Indian Journal of Science and Technology, Indian Society for Education and Environment, Volume 8, Issue 24, Number 24 (2015)
Campus : Coimbatore
School : School of Engineering
Center : Computational Engineering and Networking
Department : Electronics and Communication
Year : 2015
Abstract : In this Information age, all sources of information like historic documents, books, manuscripts are digitized and are available all over the world through internet in the form of scanned copies. These scanned images contain valuable information which are available either in colour or black and white for pleasant viewing. Optical Character Recognition (OCR) technology provides facility to search for keywords in these digital copies. In this paper, new method in which building an OCR system for Telugu language script; mainly focussing on the character recognition module. Features extracted through Discrete Wavelet Transform (DWT), Projection Profile (PP) and Singular Value Decomposition (SVD) is evaluated using k-Nearest Neighbour (k-NN) and Support Vector Machine (SVM) classifiers. Most productive results are obtained from the DWT features with SVM classifiers.
Cite this Research Publication : J. Jyothi, Manjusha, K., M. Kumar, A., and Dr. Soman K. P., “Innovative feature sets for machine learning based Telugu character recognition”, Indian Journal of Science and Technology, vol. 8, no. 24, 2015.