Sciweavers

ICPR
2010
IEEE

Learning Image Anchor Templates for Document Classification and Data Extraction

13 years 2 months ago
Learning Image Anchor Templates for Document Classification and Data Extraction
Image anchor templates are used in document image analysis for document classification, data localization, and other tasks. Current tools allow human operators to mark out small sub-images from documents to act as anchor templates. However, this requires time, and expertise because operators have to make informed decisions based on behavior of the template matching algorithms, and the expected degradations patterns in documents. We propose learning templates for a task automatically and quickly from a few training examples. Document classification or data localization can be done more robustly by combining evidence from many more discriminating templates (e.g., hundreds) than would be practicable for operators to specify.
Prateek Sarkar
Added 12 Feb 2011
Updated 12 Feb 2011
Type Journal
Year 2010
Where ICPR
Authors Prateek Sarkar
Comments (0)