Sciweavers

DRR
2008
13 years 6 months ago
Interactive degraded document enhancement and ground truth generation
Degraded documents are frequently obtained in various situations. Examples of degraded document collections include historical document depositories, document obtained in legal an...
G. Bal, Gady Agam, Ophir Frieder, Gideon Frieder
DRR
2008
13 years 6 months ago
Robust line segmentation for handwritten documents
Line segmentation is the first and the most critical pre-processing step for a document recognition/analysis task. Complex handwritten documents with lines running into each other...
Kamal Kuzhinjedathu, Harish Srinivasan, Sargur N. ...
DRR
2008
13 years 6 months ago
Whole-book recognition using mutual-entropy-driven model adaptation
We describe an approach to unsupervised high-accuracy recognition of the textual contents of an entire book using fully automatic mutual-entropy-based model adaptation. Given imag...
Pingping Xiu, Henry S. Baird
DRR
2008
13 years 6 months ago
Word segmentation of off-line handwritten documents
Word segmentation is the most critical pre-processing step for any handwritten document recognition/retrieval system. This paper describes an approach to separate a line of uncons...
Chen Huang, Sargur N. Srihari
DRR
2008
13 years 6 months ago
Efficient implementation of local adaptive thresholding techniques using integral images
Adaptive binarization is an important first step in many document analysis and OCR processes. This paper describes a fast adaptive binarization algorithm that yields the same qual...
Faisal Shafait, Daniel Keysers, Thomas M. Breuel
DRR
2008
13 years 6 months ago
Hybrid approach combining contextual and statistical information for identifying MEDLINE citation terms
There is a strong demand for developing automated tools for extracting pertinent information from the biomedical literature that is a rich, complex, and dramatically growing resou...
In-Cheol Kim, Daniel X. Le, George R. Thoma
DRR
2008
13 years 6 months ago
Segmentation-based retrieval of document images from diverse collections
We describe a methodology for retrieving document images from large extremely diverse collections. First we perform content extraction, that is the location and measurement of reg...
Michael A. Moll, Henry S. Baird
DRR
2008
13 years 6 months ago
DRR is a teenager
George Nagy
DRR
2008
13 years 6 months ago
Versatile page numbering analysis
In this paper, we revisit the problem of detecting the page numbers of a document. This work is motivated by a need for a generic method which applies on a large variety of docume...
Hervé Déjean, Jean-Luc Meunier