Sciweavers

5489 search results - page 11 / 1098
» Evaluating evaluation measure stability
Sort
View
88
Voted
INFOCOM
2007
IEEE
15 years 6 months ago
A Cost-Based Evaluation of End-to-End Network Measurements in Overlay Multicast
Abstract— Application-layer multicast (ALM, or overlay multicast) has been proposed to overcome limitations in IP multicast. While much measurement work (such as delay or connect...
Xing Jin, Qiuyan Xia, S.-H. Gary Chan
88
Voted
BMCBI
2008
105views more  BMCBI 2008»
14 years 12 months ago
Objective and automated protocols for the evaluation of biomedical search engines using No Title Evaluation protocols
Background: The evaluation of information retrieval techniques has traditionally relied on human judges to determine which documents are relevant to a query and which are not. Thi...
Fabien Campagne
LREC
2008
111views Education» more  LREC 2008»
15 years 1 months ago
Sensitivity of Automated MT Evaluation Metrics on Higher Quality MT Output: BLEU vs Task-Based Evaluation Methods
We report the results of an experiment to assess the ability of automated MT evaluation metrics to remain sensitive to variations in MT quality as the average quality of the compa...
Bogdan Babych, Anthony Hartley
CICLING
2005
Springer
15 years 1 months ago
Evaluating Evaluation Methods for Generation in the Presence of Variation
Recent years have seen increasing interest in automatic metrics for the evaluation of generation systems. When a system can generate syntactic variation, automatic evaluation becom...
Amanda Stent, Matthew Marge, Mohit Singhai
ACL
2008
15 years 1 months ago
Intrinsic vs. Extrinsic Evaluation Measures for Referring Expression Generation
In this paper we present research in which we apply (i) the kind of intrinsic evaluation metrics that are characteristic of current comparative HLT evaluation, and (ii) extrinsic,...
Anja Belz, Albert Gatt