Sciweavers

ICPR
2008
IEEE

Background variability modeling for statistical layout analysis

14 years 5 months ago
Background variability modeling for statistical layout analysis
Geometric layout analysis plays an important role in document image understanding. Many algorithms known in literature work well on standard document images, achieving high text line segmentation accuracy on the UW-III dataset. These algorithms rely on certain assumptions about document layouts, and fail when their underlying assumptions are not met. Also, they do not provide confidence scores for their output. These two problems limit the usefulness of general purpose layout analysis methods in large scale applications. In this contribution, we propose a statistically motivated model-based trainable layout analysis system that allows assumption-free adaptation to different layout types and produces likelihood estimates of the correctness of the computed page segmentation. The performance of our approach is tested on a subset of the Google 1000 books dataset where it achieved a text line segmentation accuracy of 98.4% on layouts where other generalpurpose algorithms failed to do a cor...
Faisal Shafait, Joost van Beusekom, Daniel Keysers
Added 05 Nov 2009
Updated 06 Nov 2009
Type Conference
Year 2008
Where ICPR
Authors Faisal Shafait, Joost van Beusekom, Daniel Keysers, Thomas M. Breuel
Comments (0)