Multiple sources of evidence for XML retrieval

12 years 7 months ago
Multiple sources of evidence for XML retrieval
Document-centric XML collections contain text-rich documents, marked up with XML tags. The tags add lightweight semantics to the text. Querying such collections calls for a hybrid query language: the text-rich nature of the documents suggest a content-oriented (IR) approach, while the mark-up allows users to add structural constraints to their IR queries. We will show how evidence for relevancy from different sources helps to answer such hybrid queries. We evaluate our methods using the INEX 2003 test set, and show that structural hints in hybrid queries help to improve retrieval effectiveness. Categories and Subject Descriptors: H.2 [Database Management]: H.2.4 Query processing; H.2.8 Database Applications; H.3 [Information Storage and Retrieval]: H.3.1 Content Analysis and Indexing; H.3.3 Information Search and Retrieval; H.3.4 Systems and Software; H.3.7 Digital Libraries General Terms: Experimentation.
Börkur Sigurbjörnsson, Jaap Kamps, Maart
Added 30 Jun 2010
Updated 30 Jun 2010
Type Conference
Year 2004
Authors Börkur Sigurbjörnsson, Jaap Kamps, Maarten de Rijke
Comments (0)