In a categorized information space, predicting users' information needs at the category level can facilitate personalization, caching and other topic-oriented services. This ...
We develop a method for predicting query performance by computing the relative entropy between a query language model and the corresponding collection language model. The resultin...
An important class of searches on the world-wide-web has the goal to find an entry page (homepage) of an organisation. Entry page search is quite different from Ad Hoc search. Ind...
The intuition that different text classifiers behave in qualitatively different ways has long motivated attempts to build a better metaclassifier via some combination of classifie...
We present a novel sequential clustering algorithm which is motivated by the Information Bottleneck (IB) method. In contrast to the agglomerative IB algorithm, the new sequential ...