Sciweavers

373 search results - page 2 / 75
» Correcting the Document Layout: A Machine Learning Approach
Sort
View
RIAO
2007
13 years 6 months ago
From Layout to Semantic: a Reranking Model for Mapping Web Documents to Mediated XML Representations
Many documents on the Web are formated in a weakly structured format. Because of their weak semantic and because of the heterogeneity of their formats, the information conveyed by...
Guillaume Wisniewski, Patrick Gallinari
WWW
2002
ACM
14 years 5 months ago
A machine learning based approach for table detection on the web
Table is a commonly used presentation scheme, especially for describing relational information. However, table understanding remains an open problem. In this paper, we consider th...
Yalin Wang, Jianying Hu
ICDAR
2005
IEEE
13 years 10 months ago
A Statistical Learning Approach To Document Image Analysis
In the field of computer analysis of document images, the problems of physical and logical layout analysis have been approached through a variety of heuristic, rule-based, and gr...
Kevin Laven, Scott Leishman, Sam T. Roweis
FLAIRS
2001
13 years 6 months ago
Multiple Predicate Learning for Document Image Understanding
Documentimageunderstandingdenotesthe recognition of semanticallyrelevant componentsin the layout extracted froma documentimage.This recognitionprocessis based on somevisual models...
Floriana Esposito, Donato Malerba, Francesca A. Li...
ICDAR
2009
IEEE
13 years 2 months ago
Graph b-Coloring for Automatic Recognition of Documents
In order to reduce the rejection rate of our automatic reading system, we propose to pre-classify the business documents by introducing an Automatic Recognition of Documents stage...
Djamel Gaceb, Véronique Eglin, Frank Lebour...