Sciweavers

229 search results - page 2 / 46
» Evaluation measures for preference judgments
Sort
View
LREC
2008
87views Education» more  LREC 2008»
13 years 7 months ago
Translation Adequacy and Preference Evaluation Tool (TAP-ET)
Evaluation of Machine Translation (MT) technology is often tied to the requirement for tedious manual judgments of translation quality. While automated MT metrology continues to b...
Mark A. Przybocki, Kay Peterson, Sebastien Bronsar...
SIGIR
2006
ACM
13 years 11 months ago
A statistical method for system evaluation using incomplete judgments
We consider the problem of large-scale retrieval evaluation, and we propose a statistical method for evaluating retrieval systems using incomplete judgments. Unlike existing techn...
Javed A. Aslam, Virgiliu Pavlu, Emine Yilmaz
CIKM
2006
Springer
13 years 9 months ago
Estimating average precision with incomplete and imperfect judgments
We consider the problem of evaluating retrieval systems using incomplete judgment information. Buckley and Voorhees recently demonstrated that retrieval systems can be efficiently...
Emine Yilmaz, Javed A. Aslam
LREC
2008
71views Education» more  LREC 2008»
13 years 7 months ago
Vox Populi Annotation: Measuring Intensity of Ideological Perspectives by Aggregating Group Judgments
Polarizing discussions about political and social issues are common in mass media. Annotations on the degree to which a sentence expresses an ideological perspective can be valuab...
Wei-Hao Lin, Alexander G. Hauptmann
SIGIR
2008
ACM
13 years 5 months ago
Evaluation over thousands of queries
Information retrieval evaluation has typically been performed over several dozen queries, each judged to near-completeness. There has been a great deal of recent work on evaluatio...
Ben Carterette, Virgiliu Pavlu, Evangelos Kanoulas...