Sciweavers

461 search results - page 2 / 93
» Text Segmentation Based on Document Understanding for Inform...
Sort
View
DOCENG
2009
ACM
13 years 11 months ago
Object-level document analysis of PDF files
The PDF format is commonly used for the exchange of documents on the Web and there is a growing need to understand and extract or repurpose data held in PDF documents. Many system...
Tamir Hassan
ICDAR
1997
IEEE
13 years 9 months ago
Representing OCRed documents in HTML
ABSTRACT: OCR is an error-prone process. It is time-consuming and expensive to manually proofread OCR results. The errors remaining in OCRed texts can cause serious problems in rea...
Tao Hong, Sargur N. Srihari
ICVGIP
2004
13 years 6 months ago
Robust Segmentation of Unconstrained Online Handwritten Documents
A segmentation algorithm, which can detect different regions of a handwritten document such as text lines, tables and sketches will be extremely useful in a variety of application...
Anoop M. Namboodiri, Anil K. Jain
ERCIMDL
2004
Springer
120views Education» more  ERCIMDL 2004»
13 years 10 months ago
Towards Topic Driven Access to Full Text Documents
We address the issue of providing topic driven access to full text documents. The methodology we propose is a combination of topic segmentation and information retrieval techniques...
Caterina Caracciolo, Willem Robert van Hage, Maart...
ICMCS
2006
IEEE
189views Multimedia» more  ICMCS 2006»
13 years 11 months ago
Multiscale Edge-Based Text Extraction from Complex Images
Text that appears in images contains important and useful information. Detection and extraction of text in images have been used in many applications. In this paper, we propose a ...
Xiaoqing Liu, Jagath Samarabandu