Sciweavers

CICLING
2009
Springer

Generalized Mongue-Elkan Method for Approximate Text String Comparison

13 years 4 months ago
Generalized Mongue-Elkan Method for Approximate Text String Comparison
Abstract. The Mongue-Elkan method is a general text string comparison method based on an internal character-based similarity measure (e.g. edit distance) combined with a token level (i.e. word level) similarity measure. We propose a generalization of this method based on the notion of the generalized arithmetic mean instead of the simple average used in the expression to calculate the Monge-Elkan method. The experiments carried out with 12 well-known name-matching data sets show that the proposed approach outperforms the original Monge-Elkan method when character-based measures are used to compare tokens.
Sergio Jimenez, Claudia Becerra, Alexander F. Gelb
Added 08 Nov 2010
Updated 08 Nov 2010
Type Conference
Year 2009
Where CICLING
Authors Sergio Jimenez, Claudia Becerra, Alexander F. Gelbukh, Fabio Gonzalez
Comments (0)