Sciweavers

3180 search results - page 169 / 636
» Knowledge-based Document Analysis
Sort
View
DOCENG
2007
ACM
15 years 8 months ago
Mapping paradigm for document transformation
Since the advent of XML, the ability to transform documents using transformation languages such as XSLT has become an important challenge. However, writing a transformation script...
Arnaud Blouin, Olivier Beaudoux
DAS
2008
Springer
15 years 5 months ago
A Two-Step Dewarping of Camera Document Images
Dewarping of camera document images has attracted a lot of interest over the last few years since warping not only reduces the document readability but also affects the accuracy o...
Nikolaos Stamatopoulos, Basilios Gatos, Ioannis Pr...
DRR
2003
15 years 5 months ago
Information retrieval for OCR documents: a content-based probabilistic correction model
The difficulty with information retrieval for OCR documents lies in the fact that OCR documents comprise of a significant amount of erroneous words and unfortunately most informat...
Rong Jin, ChengXiang Zhai, Alexander G. Hauptmann
DRR
2009
15 years 1 months ago
Enriching a document collection by integrating information extraction and PDF annotation
Modern digital libraries offer all the hyperlinking possibilities of the World Wide Web: when a reader finds a citation of interest, in many cases she can now click on a link to b...
Brett Powley, Robert Dale, Ilya Anisimoff
136
Voted
ICDAR
2011
IEEE
14 years 3 months ago
Extending Page Segmentation Algorithms for Mixed-Layout Document Processing
—The goal of this work is to add the capability to segment documents containing text, graphics, and pictures in the open source OCR engine OCRopus. To achieve this goal, OCRopusâ...
Amy Winder, Tim L. Andersen, Elisa H. Barney Smith