Sciweavers

EDBT
2009
ACM

Schema-conscious filtering of XML documents

13 years 9 months ago
Schema-conscious filtering of XML documents
In a publish-subscribe system based on filtering of XML documents, subscribers specify their interests with profiles expressed in the XPath language. The system processes a stream of XML documents and delivers to subscribers a notification or content of documents that match the profiles. For filtering with profiles expressed as linear XPath queries, automaton-based approaches exist where the intractable size growth of a preconstructed deterministic finite automaton is avoided by using a nondeterministic automaton. In this article we examine how these general approaches, which do not assume the existence of any specific schema or document type definition (DTD), might benefit from the knowledge that all the XML documents to be filtered obey a given DTD. We present an algorithm that utilizes the DTD in the preprocessing phase of the filtering automaton to prune out descendant operators (//) and wildcards (∗) from the linear XPath filters. Experiments with data obtained fro...
Panu Silvasti, Seppo Sippu, Eljas Soisalon-Soinine
Added 24 Jul 2010
Updated 24 Jul 2010
Type Conference
Year 2009
Where EDBT
Authors Panu Silvasti, Seppo Sippu, Eljas Soisalon-Soininen
Comments (0)