Sciweavers

ICDAR
2003
IEEE

A Bilingual OCR for Hindi-Telugu Documents and its Applications

13 years 9 months ago
A Bilingual OCR for Hindi-Telugu Documents and its Applications
This paper describes the character recognition process from printed documents containing Hindi and Telugu text. Hindi and Telugu are among the most popular languages in India. The bilingual recognizer is based on Principal Component Analysis followed by support vector classification. This attains an overall accuracy of approximately 96.7%. Extensive experimentation is carried out on an independent test set of approximately 200000 characters. Applications based on this OCR are sketched.
C. V. Jawahar, M. N. S. S. K. Pavan Kumar, S. S. R
Added 04 Jul 2010
Updated 04 Jul 2010
Type Conference
Year 2003
Where ICDAR
Authors C. V. Jawahar, M. N. S. S. K. Pavan Kumar, S. S. Ravi Kiran
Comments (0)