Sciweavers

332 search results - page 67 / 67
» Ranking and selecting clustering algorithms using a meta-lea...
Sort
View
DMKD
2004
ACM
139views Data Mining» more  DMKD 2004»
13 years 10 months ago
Iterative record linkage for cleaning and integration
Record linkage, the problem of determining when two records refer to the same entity, has applications for both data cleaning (deduplication) and for integrating data from multipl...
Indrajit Bhattacharya, Lise Getoor
SIGMOD
2010
ACM
214views Database» more  SIGMOD 2010»
13 years 10 months ago
ParaTimer: a progress indicator for MapReduce DAGs
Time-oriented progress estimation for parallel queries is a challenging problem that has received only limited attention. In this paper, we present ParaTimer, a new type of timere...
Kristi Morton, Magdalena Balazinska, Dan Grossman