Sciweavers

178 search results - page 7 / 36
» Interaction between record matching and data repairing
Sort
View
ICDE
2005
IEEE
111views Database» more  ICDE 2005»
15 years 11 months ago
Schema Matching using Duplicates
Most data integration applications require a matching between the schemas of the respective data sets. We show how the existence of duplicates within these data sets can be exploi...
Alexander Bilke, Felix Naumann
ICDE
2007
IEEE
122views Database» more  ICDE 2007»
15 years 11 months ago
Group Linkage
Poor quality data is prevalent in databases due to a variety of reasons, including transcription errors, lack of standards for recording database fields, etc. To be able to query ...
Byung-Won On, Nick Koudas, Dongwon Lee, Divesh Sri...
KDD
2002
ACM
93views Data Mining» more  KDD 2002»
15 years 10 months ago
Interactive deduplication using active learning
Deduplication is a key operation in integrating data from multiple sources. The main challenge in this task is designing a function that can resolve when a pair of records refer t...
Sunita Sarawagi, Anuradha Bhamidipaty
WWW
2010
ACM
14 years 9 months ago
Exploiting content redundancy for web information extraction
We propose a novel extraction approach that exploits content redundancy on the web to extract structured data from template-based web sites. We start by populating a seed database...
Pankaj Gulhane, Rajeev Rastogi, Srinivasan H. Seng...
CHI
1997
ACM
15 years 1 months ago
Computational Models of Information Scent-Following in a Very Large Browsable Text Collection
An ecological-cognitive framework of analysis and a model-tracing architecture are presented and used in the analysis of data recorded from users browsing a large document collect...
Peter Pirolli