Sciweavers

15 search results - page 3 / 3
» Script Identification from Indian Documents
Sort
View
ICPR
2008
IEEE
15 years 6 months ago
Word-wise Sinhala Tamil and English script identification using Gaussian kernel SVM
There are many documents in Srilanka where a single document page may contain Sinhala, Tamil and English texts. For OCR development of such a document page, it is better to identi...
Sukalpa Chanda, Srikanta Pal, Umapada Pal
DAS
2006
Springer
15 years 3 months ago
Bangla/English Script Identification Based on Analysis of Connected Component Profiles
Script identification is required for a multilingual OCR system. In this paper, we present a novel and efficient technique for Bangla/English script identification with application...
Lijun Zhou, Yue Lu, Chew Lim Tan
ICDAR
2009
IEEE
14 years 9 months ago
Off-Line Multi-Script Writer Identification Using AR Coefficients
The problem of writer identification in a multiscript environment is attempted using a twodimensional (2D) autoregressive (AR) modelling technique. Each writer is represented by a...
Utpal Garain, Thierry Paquet
DAS
2010
Springer
15 years 3 months ago
A post-processing scheme for malayalam using statistical sub-character language models
Most of the Indian scripts do not have any robust commercial OCRs. Many of the laboratory prototypes report reasonable results at recognition/classification stage. However, word ...
Karthika Mohan, C. V. Jawahar
ICDAR
2011
IEEE
13 years 11 months ago
BLSTM Neural Network Based Word Retrieval for Hindi Documents
—Retrieval from Hindi document image collections is a challenging task. This is partly due to the complexity of the script, which has more than 800 unique ligatures. In addition,...
Raman Jain, Volkmar Frinken, C. V. Jawahar, Raghav...