Sciweavers

DAS
2008
Springer
13 years 6 months ago
Re-targetable OCR with Intelligent Character Segmentation
Mudit Agrawal, David S. Doermann
DAS
2008
Springer
13 years 6 months ago
Truthing for Pixel-Accurate Segmentation
We discuss problems in developing policies for ground truthing document images for pixel-accurate segmentation. First, we describe ground truthing policies that apply to four diff...
Michael A. Moll, Henry S. Baird, Chang An
DAS
2008
Springer
13 years 6 months ago
Named Entity Recognition by Neural Sliding Window
Named Entity Recognition (NER) is an important subtask of document processing such as Information Extraction. This paper describes a NER algorithm which uses a Multi-Layer Percept...
Ignazio Gallo, Elisabetta Binaghi, Moreno Carullo,...
DAS
2008
Springer
13 years 6 months ago
A Comparison of Clustering Methods for Word Image Indexing
In this paper we explore the effectiveness of three clustering methods used to perform word image indexing. The three methods are: the Self-Organazing Map (SOM), the Growing Hiera...
Simone Marinai, Emanuele Marino, Giovanni Soda
DAS
2008
Springer
13 years 6 months ago
A Document Analysis System for Supporting Electronic Voting Research
As a result of well-publicized security concerns with direct recording electronic (DRE) voting, there is a growing call for systems that employ some form of paper artifact to prov...
Daniel P. Lopresti, George Nagy, Elisa H. Barney S...
DAS
2008
Springer
13 years 6 months ago
An Empirical Measure on the Set of Symbols Occurring in Engineering Mathematics Texts
Certain forms of mathematical expression are used more often than others in practice. A quantitative understanding of actual usage can provide additional information to improve th...
Stephen M. Watt
DAS
2008
Springer
13 years 6 months ago
The Convergence of Iterated Classification
We report an improved methodology for training a sequence of classifiers for document image content extraction, that is, the location and segmentation of regions containing handwr...
Chang An, Henry S. Baird
DAS
2008
Springer
13 years 6 months ago
A Graphics Image Processing System
Patent document images maintained by the U.S. patent database have a specific format, in which figures and text descriptions are separated into different sections. This makes it d...
Linlin Li, Chew Lim Tan