The detection of new information in a document stream is an important component of many potential applications. In this paper, a new novelty detection approach based on the identi...
In this paper, we present an approach to answering “Other” questions using the notion of interest marking terms. “Other” questions have been introduced in the TREC-QA track...
Many current effectiveness measures incorporate simplifying assumptions about user behavior. These assumptions prevent the measures from reflecting aspects of the search process...
Abstract— The Topic Detection and Tracking (TDT) research community investigates information retrieval methods for organizing a constantly arriving stream of news articles by the...
James Allan, Stephen M. Harding, David Fisher, Alv...
This paper is concerned with automatic extraction of titles from the bodies of HTML documents (web pages). Titles of HTML documents should be correctly defined in the title fields...