Sciweavers

SIGIR
2008
ACM

Evaluation measures for preference judgments

13 years 4 months ago
Evaluation measures for preference judgments
There has been recent interest in collecting user or assessor preferences, rather than absolute judgments of relevance, for the evaluation or learning of ranking algorithms. Since measures like precision, recall, and DCG are defined over absolute judgments, evaluation over preferences will require new evaluation measures that explicitly model them. We describe a class of such measures and compare absolute and preference measures over a large TREC collection. Categories and Subject Descriptors: H.3.4 Information Storage and Retrieval; Systems and Software: Performance Evaluation General Terms: Performance, Measurement
Ben Carterette, Paul N. Bennett
Added 15 Dec 2010
Updated 15 Dec 2010
Type Journal
Year 2008
Where SIGIR
Authors Ben Carterette, Paul N. Bennett
Comments (0)