Sciweavers

5489 search results - page 55 / 1098
» Evaluating evaluation measure stability
Sort
View
WSDM
2010
ACM
173views Data Mining» more  WSDM 2010»
15 years 9 months ago
Measuring the Reusability of Test Collections
While test collection construction is a time-consuming and expensive process, the true cost is amortized by reusing the collection over hundreds or thousands of experiments. Some ...
Ben Carterette, Evgeniy Gabrilovich, Vanja Josifov...
JCDL
2005
ACM
111views Education» more  JCDL 2005»
15 years 5 months ago
Developing practical automatic metadata assignment and evaluation tools for internet resources
This paper describes the development of practical automatic metadata assignment tools to support automatic record creation for virtual libraries, metadata repositories and digital...
Gordon W. Paynter
LREC
2010
166views Education» more  LREC 2010»
15 years 1 months ago
Semantic Evaluation of Machine Translation
It is recognized that many evaluation metrics of machine translation in use that focus on surface word level suffer from their lack of tolerance of linguistic variance, and the in...
Billy Tak-Ming Wong
LREC
2010
141views Education» more  LREC 2010»
15 years 1 months ago
Design and Application of a Gold Standard for Morphological Analysis: SMOR as an Example of Morphological Evaluation
This paper describes general requirements for evaluating and documenting NLP tools with a focus on morphological analysers and the design of a Gold Standard. It is argued that any...
Gertrud Faaß, Ulrich Heid, Helmut Schmid
SIGIR
2010
ACM
14 years 6 months ago
PRES: a score metric for evaluating recall-oriented information retrieval applications
Information retrieval (IR) evaluation scores are generally designed to measure the effectiveness with which relevant documents are identified and retrieved. Many scores have been ...
Walid Magdy, Gareth J. F. Jones