Sciweavers

7495 search results - page 122 / 1499
» Intelligent Document Processing
Sort
View
SIGMOD
2005
ACM
127views Database» more  SIGMOD 2005»
15 years 10 months ago
A framework for processing complex document-centric XML with overlapping structures
The key of overlapping structures or concurrent markup hierarchies in XML encodings of documents is that markup in one hierarchy is not necessarily well-formed with respect to the...
Ionut Emil Iacob, Alex Dekhtyar
ICDAR
2009
IEEE
15 years 4 months ago
PDF-TREX: An Approach for Recognizing and Extracting Tables from PDF Documents
This paper presents PDF-TREX, an heuristic approach for table recognition and extraction from PDF documents. The heuristics starts from an initial set of basic content elements an...
Ermelinda Oro, Massimo Ruffolo
ICEIS
2005
IEEE
15 years 3 months ago
Narrative Support for Technical Documents: Formalising Rhetorical Structure Theory
: Business Process Re-engineering (BPR) is an area that requires a lot of technical documents and an important feature of a well-written document is a coherent narrative. Even thou...
Nishadi De Silva, Peter Henderson
SETN
2004
Springer
15 years 3 months ago
Exploiting Cross-Document Relations for Multi-document Evolving Summarization
This paper presents a methodology for summarization from multiple documents which are about a specic topic. It is based on the specication and identication of the cross-document...
Stergos D. Afantenos, Irene Doura, Eleni Kapellou,...
SETN
2004
Springer
15 years 3 months ago
Clustering XML Documents by Structure
This work explores the application of clustering methods for grouping structurally similar XML documents. Modeling the XML documents as rooted ordered labeled trees, we apply clust...
Theodore Dalamagas, Tao Cheng, Klaas-Jan Winkel, T...