Sciweavers

ECIR
2003
Springer

Hierarchical Indexing and Flexible Element Retrieval for Structured Document

13 years 5 months ago
Hierarchical Indexing and Flexible Element Retrieval for Structured Document
As more and more structured documents, such as SGML or XML documents become available on the Web, there is a growing demand to develop effective structured document retrieval which exploits both content and hierarchical structure of documents and return document elements with appropriate granularity. Previous work on partial retrieval of structured document has limited applications due to the requirement of structured queries and restriction on sliding along the document structure according to queries. In this paper, we put forward a method for flexible element retrieval which can get relevant document elements with arbitrary granularity against natural language queries. The proposed techniques constitute a novel hierarchical index propagation and pruning mechanism and an algorithm of ranking document elements based on the hierarchical index. The experimental results show that our method significantly outperforms other existing methods. Our method also shows robustness to the long-stan...
Hang Cui, Ji-Rong Wen, Tat-Seng Chua
Added 31 Oct 2010
Updated 31 Oct 2010
Type Conference
Year 2003
Where ECIR
Authors Hang Cui, Ji-Rong Wen, Tat-Seng Chua
Comments (0)