We describe the techniques developed to gather and distribute in a highly compressed, yet accessible, form a series of twelve snapshot of the .uk web domain. Ad hoc compression
Pseudo-relevance feedback assumes that most frequent terms in the pseudo-feedback documents are useful for the retrieval. In this study, we re-examine this assumption and show tha...
Guihong Cao, Jian-Yun Nie, Jianfeng Gao, Stephen R...
This short paper describes the beginnings of a project to digitize some of the older literature in the information retrieval field. So far 14 of the older reports, such as the Cra...
In this paper, we present a study of a novel summarization problem, i.e., summarizing the impact of a scientific publication. Given a paper and its citation context, we study how ...
Topics form a crucial component of a test collection. We show, through visualization, that the INEX 2008 topics have shortcomings, which questions their validity for evaluating XM...
Andrew Trotman, Maria del Rocio Gomez Crisostomo, ...