Sciweavers

282 search results - page 1 / 57
» Interactive deduplication using active learning
Sort
View
KDD
2002
ACM
93views Data Mining» more  KDD 2002»
14 years 5 months ago
Interactive deduplication using active learning
Deduplication is a key operation in integrating data from multiple sources. The main challenge in this task is designing a function that can resolve when a pair of records refer t...
Sunita Sarawagi, Anuradha Bhamidipaty
VLDB
2002
ACM
126views Database» more  VLDB 2002»
13 years 4 months ago
ALIAS: An Active Learning led Interactive Deduplication System
Deduplication, a key operation in integrating data from multiple sources, is a time-consuming, labor-intensive and domainspecific operation. We present our design of alias that us...
Sunita Sarawagi, Anuradha Bhamidipaty, Alok Kirpal...
ICDE
2003
IEEE
159views Database» more  ICDE 2003»
14 years 6 months ago
Scaling up the ALIAS Duplicate Elimination System
Duplicate elimination is an important stage in integrating data from multiple sources. The challenges involved are finding a robust deduplication function that can identify when t...
Sunita Sarawagi, Alok Kirpal
CEC
2010
IEEE
13 years 2 months ago
Active Learning Genetic programming for record deduplication
The great majority of genetic programming (GP) algorithms that deal with the classification problem follow a supervised approach, i.e., they consider that all fitness cases availab...
Junio de Freitas, Gisele L. Pappa, Altigran Soares...
KDD
2009
ACM
205views Data Mining» more  KDD 2009»
13 years 11 months ago
From active towards InterActive learning: using consideration information to improve labeling correctness
Data mining techniques have become central to many applications. Most of those applications rely on so called supervised learning algorithms, which learn from given examples in th...
Abraham Bernstein, Jiwen Li