Sciweavers

13 search results - page 1 / 3
» Learnable Similarity Functions and their Applications to Clu...
Sort
View
KDD
2003
ACM
214views Data Mining» more  KDD 2003»
14 years 5 months ago
Adaptive duplicate detection using learnable string similarity measures
The problem of identifying approximately duplicate records in databases is an essential step for data cleaning and data integration processes. Most existing approaches have relied...
Mikhail Bilenko, Raymond J. Mooney
ICDM
2005
IEEE
185views Data Mining» more  ICDM 2005»
13 years 11 months ago
Adaptive Product Normalization: Using Online Learning for Record Linkage in Comparison Shopping
The problem of record linkage focuses on determining whether two object descriptions refer to the same underlying entity. Addressing this problem effectively has many practical ap...
Mikhail Bilenko, Sugato Basu, Mehran Sahami
ICDM
2006
IEEE
138views Data Mining» more  ICDM 2006»
13 years 11 months ago
Adaptive Blocking: Learning to Scale Up Record Linkage
Many information integration tasks require computing similarity between pairs of objects. Pairwise similarity computations are particularly important in record linkage systems, as...
Mikhail Bilenko, Beena Kamath, Raymond J. Mooney
DMKD
2004
ACM
139views Data Mining» more  DMKD 2004»
13 years 11 months ago
Iterative record linkage for cleaning and integration
Record linkage, the problem of determining when two records refer to the same entity, has applications for both data cleaning (deduplication) and for integrating data from multipl...
Indrajit Bhattacharya, Lise Getoor