Sciweavers

149 search results - page 2 / 30
» A Layout-Free Method for Extracting Elements from Document I...
Sort
View
WWW
2005
ACM
14 years 5 months ago
Using visual cues for extraction of tabular data from arbitrary HTML documents
We describe a method to extract tabular data from web pages. Rather than just analyzing the DOM tree, we also exploit visual cues in the rendered version of the document to extrac...
Bernhard Krüpl, Marcus Herzog, Wolfgang Gatte...
ICPR
2010
IEEE
13 years 3 months ago
The PAGE (Page Analysis and Ground-Truth Elements) Format Framework
There is a plethora of established and proposed document representation formats but none that can adequately support individual stages within an entire sequence of document image ...
Stefan Pletschacher, Apostolos Antonacopoulos
DAS
2008
Springer
13 years 7 months ago
Accuracy Improvement and Objective Evaluation of Annotation Extraction from Printed Documents
There is an approach of annotation extraction from printed documents in which annotations are extracted by comparing the image of an annotated document and its original document i...
Tomohiro Nakai, Kazumasa Iwata, Koichi Kise
ICDAR
2003
IEEE
13 years 10 months ago
Reference Line Extraction from Form Documents with Complicated Backgrounds
Form document analysis is one of the most essential tasks in document analysis and recognition. One of the most fundamental and crucial tasks is the extraction of the reference li...
Dihua Xi, Seong-Whan Lee
ICDAR
2003
IEEE
13 years 10 months ago
Proper Names Extraction from Fax Images Combining Textual and Image Features
In the frame of a Unified Messaging System, a crucial task of the system is to provide the user with key information on every message received, like keywords reflecting the object...
Laurence Likforman-Sulem, Pascal Vaillant, Fran&cc...