Sciweavers

229 search results - page 6 / 46
» Evaluation measures for preference judgments
Sort
View
85
Voted
AMTA
2004
Springer
15 years 1 months ago
The Significance of Recall in Automatic Metrics for MT Evaluation
Recent research has shown that a balanced harmonic mean (F1 measure) of unigram precision and recall outperforms the widely used BLEU and NIST metrics for Machine Translation evalu...
Alon Lavie, Kenji Sagae, Shyamsundar Jayaraman
60
Voted
SIGIR
2005
ACM
15 years 3 months ago
Accurately interpreting clickthrough data as implicit feedback
This paper examines the reliability of implicit feedback generated from clickthrough data in WWW search. Analyzing the users’ decision process using eyetracking and comparing im...
Thorsten Joachims, Laura A. Granka, Bing Pan, Hele...
91
Voted
CIKM
2011
Springer
13 years 9 months ago
A probabilistic method for inferring preferences from clicks
Evaluating rankers using implicit feedback, such as clicks on documents in a result list, is an increasingly popular alternative to traditional evaluation methods based on explici...
Katja Hofmann, Shimon Whiteson, Maarten de Rijke
ACL
2010
14 years 7 months ago
Evaluating Machine Translations Using mNCD
This paper introduces mNCD, a method for automatic evaluation of machine translations. The measure is based on normalized compression distance (NCD), a general information theoret...
Marcus Dobrinkat, Tero Tapiovaara, Jaakko Väy...
106
Voted
DAWAK
2008
Springer
14 years 11 months ago
The Evaluation of Sentence Similarity Measures
The ability to accurately judge the similarity between natural language sentences is critical to the performance of several applications such as text mining, question answering, an...
Palakorn Achananuparp, Xiaohua Hu, Xiajiong Shen