Sciweavers

15 search results - page 3 / 3
» Script Identification from Indian Documents
Sort
View
ICPR
2008
IEEE
13 years 11 months ago
Word-wise Sinhala Tamil and English script identification using Gaussian kernel SVM
There are many documents in Srilanka where a single document page may contain Sinhala, Tamil and English texts. For OCR development of such a document page, it is better to identi...
Sukalpa Chanda, Srikanta Pal, Umapada Pal
DAS
2006
Springer
13 years 9 months ago
Bangla/English Script Identification Based on Analysis of Connected Component Profiles
Script identification is required for a multilingual OCR system. In this paper, we present a novel and efficient technique for Bangla/English script identification with application...
Lijun Zhou, Yue Lu, Chew Lim Tan
ICDAR
2009
IEEE
13 years 3 months ago
Off-Line Multi-Script Writer Identification Using AR Coefficients
The problem of writer identification in a multiscript environment is attempted using a twodimensional (2D) autoregressive (AR) modelling technique. Each writer is represented by a...
Utpal Garain, Thierry Paquet
DAS
2010
Springer
13 years 9 months ago
A post-processing scheme for malayalam using statistical sub-character language models
Most of the Indian scripts do not have any robust commercial OCRs. Many of the laboratory prototypes report reasonable results at recognition/classification stage. However, word ...
Karthika Mohan, C. V. Jawahar
ICDAR
2011
IEEE
12 years 4 months ago
BLSTM Neural Network Based Word Retrieval for Hindi Documents
—Retrieval from Hindi document image collections is a challenging task. This is partly due to the complexity of the script, which has more than 800 unique ligatures. In addition,...
Raman Jain, Volkmar Frinken, C. V. Jawahar, Raghav...