Sciweavers

156
Voted
ICDAR
1999
IEEE
15 years 7 months ago
Models and Algorithms for Duplicate Document Detection
This paper introduces a framework for clarifying and formalizing the duplicate document detection problem. Four distinct models are presented, each with a corresponding algorithm ...
Daniel P. Lopresti
98
Voted
ICDAR
1999
IEEE
15 years 7 months ago
Automatic Storage, Retrieval, and Visualization of Bank Check Images
This paper presents an automated system for storage and retrieval of bank checks in contrast with the microfilming techniques that are currently used. The bank check images are in...
Alessandro L. Koerich, Luan Ling Lee
124
Voted
ICDAR
1999
IEEE
15 years 7 months ago
MergeLayouts: Overcoming Faulty Segmentations by a Comprehensive Voting of Commercial OCR Devices
In this paper, we will present a comprehensive voting approach, taking entire layouts obtained from commercial OCR devices as input. Such a layout comprises segments of three kind...
Stefan Klink, Thorsten Jäger
96
Voted
ICDAR
1999
IEEE
15 years 7 months ago
EXTRAFOR: Automatic EXTRAction of Mathematical FORmulas
Afef Kacem, Abdel Belaïd, Mohamed Ben Ahmed
102
Voted
ICDAR
1999
IEEE
15 years 7 months ago
On the Evaluation of Document Analysis Components by Recall, Precision, and Accuracy
In document analysis, it is common to prove the usefulness of a component by an experimental evaluation. By applying the respective algorithms to a test sample, some effectiveness...
Markus Junker, Andreas Dengel, Rainer Hoch
121
Voted
ICDAR
1999
IEEE
15 years 7 months ago
Multifont Classification using Typographical Attributes
This paper introduces a multifont classification scheme to help recognition of multifont and multisize characters. It uses typographical attributes such as ascenders, descenders a...
Min-Chul Jung, Yong-Chul Shin, Sargur N. Srihari
136
Voted
ICDAR
1999
IEEE
15 years 7 months ago
Segmenting Documents using Multiple Lexical Features
A method is presented for segmenting documents into conceptually related areas. Determining the equivalence of text is often based on the number of word repetitions. This approach...
Amanda C. Jobbins, Lindsay J. Evett
150
Voted
ICDAR
1999
IEEE
15 years 7 months ago
Document Image Layout Comparison and Classification
This paper describes features and methods for document image comparison and classification at the spatial layout level. The methods are useful for visual similarity based document...
Jianying Hu, Ramanujan S. Kashi, Gordon T. Wilfong
80
Voted
ICDAR
1999
IEEE
15 years 7 months ago
Influence of Word Length on Handwriting Recognition
F. Grandidier, Robert Sabourin, Mounim A. El-Yacou...
124
Voted
ICDAR
1999
IEEE
15 years 7 months ago
Cursive Character Detection using Incremental Learning
This paper describes a new hybrid architecture for an artificial neural network classifier that enables incremental learning. The learning algorithm of the proposed architecture d...
Jean-François Hébert, Marc Parizeau,...