Search Sciweavers | Sciweavers

178 search results - page 7 / 36

» Interaction between record matching and data repairing

189

click to vote

ICDE
2005
IEEE

111views Database» more ICDE 2005»

Schema Matching using Duplicates

16 years 3 months ago

Download www.dit.unitn.it

Most data integration applications require a matching between the schemas of the respective data sets. We show how the existence of duplicates within these data sets can be exploi...

Alexander Bilke, Felix Naumann

claim paper

Read More »

203

click to vote

ICDE
2007
IEEE

122views Database» more ICDE 2007»

Group Linkage

16 years 3 months ago

Download pike.psu.edu

Poor quality data is prevalent in databases due to a variety of reasons, including transcription errors, lack of standards for recording database fields, etc. To be able to query ...

Byung-Won On, Nick Koudas, Dongwon Lee, Divesh Sri...

claim paper

Read More »

click to vote

KDD
2002
ACM

93views Data Mining» more KDD 2002»

Interactive deduplication using active learning

16 years 2 months ago

Download www.it.iitb.ac.in

Deduplication is a key operation in integrating data from multiple sources. The main challenge in this task is designing a function that can resolve when a pair of records refer t...

Sunita Sarawagi, Anuradha Bhamidipaty

claim paper

Read More »

148

click to vote

WWW
2010
ACM

188views Internet Technology» more WWW 2010»

Exploiting content redundancy for web information extraction

15 years 2 months ago

Download www.comp.nus.edu.sg

We propose a novel extraction approach that exploits content redundancy on the web to extract structured data from template-based web sites. We start by populating a seed database...

Pankaj Gulhane, Rajeev Rastogi, Srinivasan H. Seng...

claim paper

Read More »

100

click to vote

CHI
1997
ACM

127views Human Computer Interaction» more CHI 1997»

Computational Models of Information Scent-Following in a Very Large Browsable Text Collection

15 years 6 months ago

Download www2.parc.com

An ecological-cognitive framework of analysis and a model-tracing architecture are presented and used in the analysis of data recorded from users browsing a large document collect...

Peter Pirolli

claim paper

Read More »

« Prev « First page 7 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers