Sciweavers

134 search results - page 7 / 27
» das 2006
Sort
View
DAS
2006
Springer
15 years 3 months ago
A System for Converting PDF Documents into Structured XML Format
We present in this paper a system for converting PDF legacy documents into structured XML format. This conversion system first extracts the different streams contained in PDF files...
Hervé Déjean, Jean-Luc Meunier
DAS
2006
Springer
15 years 3 months ago
Performance Comparison of Six Algorithms for Page Segmentation
Abstract. This paper presents a quantitative comparison of six algorithms for page segmentation: X-Y cut, smearing, whitespace analysis, constrained text-line finding, Docstrum, an...
Faisal Shafait, Daniel Keysers, Thomas M. Breuel
DAS
2006
Springer
15 years 3 months ago
Retrieval from Document Image Collections
Abstract. This paper presents a system for retrieval of relevant documents from large document image collections. We achieve effective search and retrieval from a large collection ...
A. Balasubramanian, Million Meshesha, C. V. Jawaha...
DAS
2006
Springer
15 years 1 months ago
XCDF: A Canonical and Structured Document Format
Accessing the structured content of PDF document is a difficult task, requiring pre-processing and reverse engineering techniques. In this paper, we first present different methods...
Jean-Luc Bloechle, Maurizio Rigamonti, Karim Hadja...
DAS
2006
Springer
15 years 3 months ago
On Benchmarking of Invoice Analysis Systems
Abstract. An approach is presented to guide the benchmarking of invoice analysis systems, a specific, applied subclass of document analysis systems. The state of the art of benchma...
Bertin Klein, Stefan Agne, Andreas Dengel