Sciweavers

ICDAR
2011
IEEE
12 years 3 months ago
Chinese Keyword Spotting Using Knowledge-Based Clustering
—Content-based document image retrieval is a new and promising research area. Without OCR, document indexing directly based on image content is more general and convenient. Howev...
Yong Xia, Kuanquan Wang, Mingwei Li
ICDAR
2011
IEEE
12 years 3 months ago
OCR-Driven Writer Identification and Adaptation in an HMM Handwriting Recognition System
—We present an OCR-driven writer identification algorithm in this paper. Our algorithm learns writer-specific characteristics more precisely from explicit character alignment usi...
Huaigu Cao, Rohit Prasad, Prem Natarajan
ICDAR
2011
IEEE
12 years 3 months ago
Tuning between Exponential Functions and Zones for Membership Functions Selection in Voronoi-Based Zoning for Handwritten Charac
— In Handwritten Character Recognition, zoning is rigtly considered as one of the most effective feature extraction techniques. In the past, many zoning methods have been propose...
Sebastiano Impedovo, Giuseppe Pirlo
ICDAR
2011
IEEE
12 years 3 months ago
Script-Free Text Line Segmentation Using Interline Space Model for Printed Document Images
—This paper proposes a model-based text line segmentation algorithm for machine-printed document images. The model is based on geometric configuration which uses the interline sp...
Minwoo Kim, Il-Seok Oh
ICDAR
2011
IEEE
12 years 3 months ago
On-line Handwritten Japanese Characters Recognition Using a MRF Model with Parameter Optimization by CRF
— This paper describes a Markov random field (MRF) model with weighting parameters optimized by conditional random field (CRF) for on-line recognition of handwritten Japanese cha...
Bilan Zhu, Masaki Nakagawa
ICDAR
2011
IEEE
12 years 3 months ago
Browsing Heterogeneous Document Collections by a Segmentation-Free Word Spotting Method
—In this paper, we present a segmentation-free word spotting method that is able to deal with heterogeneous document image collections. We propose a patch-based framework where p...
Marçal Rusiñol, David Aldavert, Rica...
ICDAR
2011
IEEE
12 years 3 months ago
BLSTM Neural Network Based Word Retrieval for Hindi Documents
—Retrieval from Hindi document image collections is a challenging task. This is partly due to the complexity of the script, which has more than 800 unique ligatures. In addition,...
Raman Jain, Volkmar Frinken, C. V. Jawahar, Raghav...
ICDAR
2011
IEEE
12 years 3 months ago
Character Recognition Based on DTW-Radon
Abstract—The paper presents a method for isolated offline character recognition using radon features. The key characteristic of the method is to use DTW algorithm to match corres...
K. C. Santosh
ICDAR
2011
IEEE
12 years 3 months ago
A Method for Removing Inflectional Suffixes in Word Spotting of Mongolian Kanjur
Abstract—According to characteristics of Mongolian wordformation, a method for removing inflectional suffixes from word images of the Mongolian Kanjur is proposed in this paper. ...
Hongxi Wei, Guanglai Gao, Yulai Bao
DRR
2011
12 years 3 months ago
Improved document image segmentation algorithm using multiresolution morphology
Page segmentation into text and non-text components is an essential preprocessing step before OCR operation. If this is not done properly, an OCR classification engine produces g...
Syed Saqib Bukhari, Faisal Shafait, Thomas M. Breu...