Processing Content-And-Structure Queries for XML Retrieval

8 years 6 months ago
Processing Content-And-Structure Queries for XML Retrieval
Document-centric XML collections contain text-rich documents, marked up with XML tags. The tags add lightweight semantics to the text. Querying such collections calls for a hybrid query language: the text-rich nature of the documents suggest a content-oriented (IR) approach, while the mark-up allows users to add structural constraints to their IR queries. We propose an approach to such hybrid contentand-structure queries that decomposes a query into multiple content-only queries whose results are then combined in ways determined by the structural constraints of the original query. We report on ongoing work and present preliminary evaluation results, based on the INEX 2003 test set.
Börkur Sigurbjörnsson, Jaap Kamps, Maart
Added 31 Oct 2010
Updated 31 Oct 2010
Type Conference
Year 2004
Where TDM
Authors Börkur Sigurbjörnsson, Jaap Kamps, Maarten de Rijke
Comments (0)