Sciweavers

6 search results - page 1 / 2
» Towards a Canonical and Structured Representation of PDF Doc...
Sort
View
ICDAR
2005
IEEE
13 years 10 months ago
Towards a Canonical and Structured Representation of PDF Documents through Reverse Engineering
This article presents Xed, a reverse engineering tool for PDF documents, which extracts the original document layout structure. Xed mixes electronic extraction methods with state-...
Maurizio Rigamonti, Jean-Luc Bloechle, Karim Hadja...
ICDAR
2009
IEEE
13 years 2 months ago
OCD: An Optimized and Canonical Document Format
Revealing and being able to manipulate the structured content of PDF documents is a difficult task, requiring pre-processing and reverse engineering techniques. In this paper, we ...
Jean-Luc Bloechle, Denis Lalanne, Rolf Ingold
DAS
2006
Springer
13 years 6 months ago
XCDF: A Canonical and Structured Document Format
Accessing the structured content of PDF document is a difficult task, requiring pre-processing and reverse engineering techniques. In this paper, we first present different methods...
Jean-Luc Bloechle, Maurizio Rigamonti, Karim Hadja...
ICSM
1996
IEEE
13 years 8 months ago
VIFOR 2: a tool for browsing and documentation
During the maintenance of legacy systems, the structure and the documentationofthe system usually deteriorates, and hence the maintenance becomesprogressively harder and harder. I...
Vaclav Rajlich, Sridhar Reddy Adnapally
CIKM
2008
Springer
13 years 6 months ago
CE2: towards a large scale hybrid search engine with integrated ranking support
The Web contains a large amount of documents and increasingly, also semantic data in the form of RDF triples. Many of these triples are annotations that are associated with docume...
Haofen Wang, Thanh Tran, Chang Liu