Multi Word Term Queries for Focused Information Retrieval

11 years 3 months ago
Multi Word Term Queries for Focused Information Retrieval
In this paper, we address both standard and focused retrieval tasks based on comprehensible language models and interactive query expansion (IQE). Query topics are expanded using an initial set of Multi Word Terms (MWTs) selected from top n ranked documents. MWTs are special text units that represent domain concepts and objects. As such, they can better represent query topics than ordinary phrases or ngrams. We tested different query representations: bag-of-words, phrases, flat list of MWTs, subsets of MWTs. We also combined the initial set of MWTs obtained in an IQE process with automatic query expansion (AQE) using language models and smoothing mechanism. We chose as baseline the Indri IR engine based on the language model using Dirichlet smoothing. The experiment is carried out on two benchmarks: TREC Enterprise track (TRECent) 2007 and 2008 collections; INEX 2008 Adhoc track using the Wikipedia collection.
Eric SanJuan, Fidelia Ibekwe-Sanjuan
Added 17 Aug 2010
Updated 17 Aug 2010
Type Conference
Year 2010
Authors Eric SanJuan, Fidelia Ibekwe-Sanjuan
Comments (0)