Sciweavers

ECIR
2007
Springer
13 years 6 months ago
Similarity Measures for Short Segments of Text
Measuring the similarity between documents and queries has been extensively studied in information retrieval. However, there are a growing number of tasks that require computing th...
Donald Metzler, Susan T. Dumais, Christopher Meek
KDD
2003
ACM
214views Data Mining» more  KDD 2003»
14 years 4 months ago
Adaptive duplicate detection using learnable string similarity measures
The problem of identifying approximately duplicate records in databases is an essential step for data cleaning and data integration processes. Most existing approaches have relied...
Mikhail Bilenko, Raymond J. Mooney