Sciweavers

54 search results - page 4 / 11
» Automatic Ground-truth Generation for Document Image Analysi...
Sort
View
CVPR
2010
IEEE
15 years 3 months ago
ARISTA - Image Search to Annotation on Billions of Web Photos
Though it has cost great research efforts for decades, object recognition is still a challenging problem. Traditional methods based on machine learning or computer vision are stil...
Xin-Jing Wang, Ming Liu, Lei Zhang, Yi Li, Wei-Yin...
DRR
2003
14 years 10 months ago
Correcting OCR text by association with historical datasets
The Medical Article Records System (MARS) developed by the Lister Hill National Center for Biomedical Communications uses scanning, OCR and automated recognition and reformatting ...
Susan E. Hauser, Jonathan Schlaifer, Tehseen F. Sa...
ICCV
2005
IEEE
15 years 3 months ago
Learning Non-Generative Grammatical Models for Document Analysis
— We present a general approach for the hierarchical segmentation and labeling of document layout structures. This approach models document layout as a grammar and performs a glo...
Michael Shilman, Percy Liang, Paul A. Viola
DOCENG
2005
ACM
14 years 11 months ago
Schema matching for transforming structured documents
Structured document content reuse is the problem of restructuring and translating data structured under a source schema into an instance of a target schema. A notion closely tied ...
Aida Boukottaya, Christine Vanoirbeek
RIDE
1998
IEEE
15 years 1 months ago
Generating Hypermedia Documents from Transcriptions of Television Programs Using Parallel Text Alignment
This paper presents a method of automatically creating hypermedia documents from conventional transcriptions of television programs. Using parallel text alignment techniques, the ...
David C. Gibbon