Sciweavers

DAS
2008
Springer
14 years 10 months ago
Re-targetable OCR with Intelligent Character Segmentation
Mudit Agrawal, David S. Doermann
DAS
2008
Springer
14 years 10 months ago
Truthing for Pixel-Accurate Segmentation
We discuss problems in developing policies for ground truthing document images for pixel-accurate segmentation. First, we describe ground truthing policies that apply to four diff...
Michael A. Moll, Henry S. Baird, Chang An
DAS
2008
Springer
14 years 10 months ago
Named Entity Recognition by Neural Sliding Window
Named Entity Recognition (NER) is an important subtask of document processing such as Information Extraction. This paper describes a NER algorithm which uses a Multi-Layer Percept...
Ignazio Gallo, Elisabetta Binaghi, Moreno Carullo,...
DAS
2008
Springer
14 years 10 months ago
A Comparison of Clustering Methods for Word Image Indexing
In this paper we explore the effectiveness of three clustering methods used to perform word image indexing. The three methods are: the Self-Organazing Map (SOM), the Growing Hiera...
Simone Marinai, Emanuele Marino, Giovanni Soda
DAS
2008
Springer
14 years 10 months ago
A Document Analysis System for Supporting Electronic Voting Research
As a result of well-publicized security concerns with direct recording electronic (DRE) voting, there is a growing call for systems that employ some form of paper artifact to prov...
Daniel P. Lopresti, George Nagy, Elisa H. Barney S...
DAS
2008
Springer
14 years 10 months ago
An Empirical Measure on the Set of Symbols Occurring in Engineering Mathematics Texts
Certain forms of mathematical expression are used more often than others in practice. A quantitative understanding of actual usage can provide additional information to improve th...
Stephen M. Watt
DAS
2008
Springer
14 years 10 months ago
The Convergence of Iterated Classification
We report an improved methodology for training a sequence of classifiers for document image content extraction, that is, the location and segmentation of regions containing handwr...
Chang An, Henry S. Baird
DAS
2008
Springer
14 years 10 months ago
A Graphics Image Processing System
Patent document images maintained by the U.S. patent database have a specific format, in which figures and text descriptions are separated into different sections. This makes it d...
Linlin Li, Chew Lim Tan