Faster Path Indexes for Search in XML Data

8 years 8 months ago
Faster Path Indexes for Search in XML Data
This article describes how to implement efficient memory resident path indexes for semi-structured data. Two techniques are introduced, and they are shown to be significantly faster than previous methods when facing path queries using the descendant axis and wild-cards. The first is conceptually simple and combines inverted lists, selectivity estimation, hit expansion and brute force search. The second uses suffix trees with additional statistics and multiple entry points into the query. The entry points are partially evaluated in an order based on estimated cost until one of them is complete. Many path index implementations are tested, using paths generated both from statistical models and DTDs.
Nils Grimsmo
Added 01 Jun 2010
Updated 01 Jun 2010
Type Conference
Year 2008
Where ADC
Authors Nils Grimsmo
Comments (0)