Sciweavers

SIGMOD
2003
ACM

Querying Structured Text in an XML Database

14 years 4 months ago
Querying Structured Text in an XML Database
XML databases often contain documents comprising structured text. Therefore, it is important to integrate "information retrieval style" query evaluation, which is well-suited for natural language text, with standard "database style" query evaluation, which handles structured queries efficiently. Relevance scoring is central to information retrieval. In the case of XML, this operation becomes more complex because the data required for scoring could reside not directly in an element itself but also in its descendant elements. In this paper, we propose a bulk-algebra, TIX, and describe how it can be used as a basis for integrating information retrieval techniques into a standard pipelined database query evaluation engine. We develop new evaluation strategies essential to obtaining good performance, including a stack-based TermJoin algorithm for efficiently scoring composite elements. We report results from an extensive experimental evaluation, which show, among other ...
Shurug Al-Khalifa, Cong Yu, H. V. Jagadish
Added 08 Dec 2009
Updated 08 Dec 2009
Type Conference
Year 2003
Where SIGMOD
Authors Shurug Al-Khalifa, Cong Yu, H. V. Jagadish
Comments (0)