Sciweavers

3180 search results - page 80 / 636
» Knowledge-based Document Analysis
Sort
View
136
Voted
LAWEB
2006
IEEE
15 years 9 months ago
Analysis of Web Search Engine Clicked Documents
In this paper we process and analyze web search engine query and click data from the perspective of the documents (URL’s) selected. We initially define possible document categor...
David F. Nettleton, Liliana Calderón-Benavi...
128
Voted
ICDAR
2007
IEEE
15 years 7 months ago
Identification of Latin-Based Languages through Character Stroke Categorization
This paper presents a language identification technique that detects Latin-based languages of imaged documents without OCR. The proposed technique detects languages through the wo...
S. J. Lu, L. Li, Chew Lim Tan
130
Voted
ICDAR
2009
IEEE
15 years 1 months ago
Learning on the Fly: Font-Free Approaches to Difficult OCR Problems
Despite ubiquitous claims that optical character recognition (OCR) is a "solved problem," many categories of documents continue to break modern OCR software such as docu...
Andrew Kae, Erik G. Learned-Miller
134
Voted
DAS
2004
Springer
15 years 9 months ago
An Integrated Approach for Automatic Semantic Structure Extraction in Document Images
In this paper we present an integrated approach for semantic structure extraction in document images. Document images are initially processed to extract both their layout and logic...
Margherita Berardi, Michele Lapi, Donato Malerba
131
Voted
ICDAR
2003
IEEE
15 years 8 months ago
Numerical Sequence Extraction in Handwritten Incoming Mail Documents
In this communication, we propose a method for the automatic extraction of numerical fields in handwritten documents. The approach exploits the known syntactic structure of the nu...
Guillaume Koch, Laurent Heutte, Thierry Paquet