Sciweavers

JOI
2007

Measuring quality of similarity functions in approximate data matching

13 years 4 months ago
Measuring quality of similarity functions in approximate data matching
This paper presents a method for assessing the quality of similarity functions. The scenario taken into account is that of approximate data matching, in which it is necessary to determine whether two data instances represent the same real world object. Our method is based on the semi-automatic estimation of optimal threshold values. We propose two methods for performing such estimation. The first method is an algorithm based on a reward function, and the second is a statistical method. Experiments were carried out to validate the techniques proposed. The results show that both methods for threshold estimation produce similar results. The output of such methods was used to design a grading function for similarity functions. This grading function, called discernability, was used to compare a number of similarity functions applied to an experimental data set. © 2006 Elsevier Ltd. All rights reserved.
Roberto da Silva, Raquel Kolitski Stasiu, Viviane
Added 15 Dec 2010
Updated 15 Dec 2010
Type Journal
Year 2007
Where JOI
Authors Roberto da Silva, Raquel Kolitski Stasiu, Viviane Moreira Orengo, Carlos A. Heuser
Comments (0)