Sciweavers

TREC
2004

Columbia University in the Novelty Track at TREC 2004

13 years 6 months ago
Columbia University in the Novelty Track at TREC 2004
Our system for the Novelty Track at TREC 2004 looks beyond sentence boundaries as well as within sentences to identify novel, nonduplicative passages. It tries to identify text spans of two or more sentences that encompass mini-segments of new information. At the same time, we avoid any pairwise comparison of sentences, but rely on the presence of previously unseen terms to provide evidence of novelty. The system is guided by a number of parameters, both weights and thresholds, that are learned automatically with a randomized hill-climbing algorithm. During learning, we varied the target function to produce configurations that emphasize either precision or recall. We also implemented a straightforward vector-space model as a comparison and to test a combined approach.
Barry Schiffman, Kathleen McKeown
Added 31 Oct 2010
Updated 31 Oct 2010
Type Conference
Year 2004
Where TREC
Authors Barry Schiffman, Kathleen McKeown
Comments (0)