Sciweavers

13 search results - page 2 / 3
» A DTD Extension for Document Structure Recognition
Sort
View
IS
2007
13 years 5 months ago
Schema-conscious XML indexing
User queries on extensible markup language (XML) documents are typically expressed as regular path expressions. A variety of indexing techniques for efficiently retrieving the re...
Krishna P. Leela, Jayant R. Haritsa
FMSP
2000
ACM
177views Formal Methods» more  FMSP 2000»
13 years 10 months ago
DSD: A schema language for XML
XML (eXtensible Markup Language) is a linear syntax for trees, which has gathered a remarkable amount of interest in industry. The acceptance of XML opens new venues for the appli...
Nils Klarlund, Anders Møller, Michael I. Sc...
KDD
2008
ACM
183views Data Mining» more  KDD 2008»
14 years 6 months ago
Structured entity identification and document categorization: two tasks with one joint model
Traditionally, research in identifying structured entities in documents has proceeded independently of document categorization research. In this paper, we observe that these two t...
Indrajit Bhattacharya, Shantanu Godbole, Sachindra...
XIMEP
2005
ACM
106views Database» more  XIMEP 2005»
13 years 11 months ago
Combining a Publish and Subscribe Collaboration Architecture with XQuery Approaches
Markup languages, representations, schemas, and tools have significantly increased the ability for organizations to share their information. Languages such as the Extensible Marku...
M. Brian Blake, David H. Fado, Gregory A. Mack
SIGIR
2004
ACM
13 years 11 months ago
Document clustering via adaptive subspace iteration
Document clustering has long been an important problem in information retrieval. In this paper, we present a new clustering algorithm ASI1, which uses explicitly modeling of the s...
Tao Li, Sheng Ma, Mitsunori Ogihara