Sciweavers

367 search results - page 2 / 74
» Duplicate detection in probabilistic data
Sort
View
ICDE
2006
IEEE
141views Database» more  ICDE 2006»
14 years 6 months ago
Clean Answers over Dirty Databases: A Probabilistic Approach
The detection of duplicate tuples, corresponding to the same real-world entity, is an important task in data integration and cleaning. While many techniques exist to identify such...
Ariel Fuxman, Periklis Andritsos, Renée J. ...
SIGMOD
2005
ACM
119views Database» more  SIGMOD 2005»
14 years 5 months ago
DogmatiX Tracks down Duplicates in XML
Duplicate detection is the problem of detecting different entries in a data source representing the same real-world entity. While research abounds in the realm of duplicate detect...
Melanie Weis, Felix Naumann
ECOOPW
1998
Springer
13 years 9 months ago
Visual Detection of Duplicated Code
Code duplication is considered as bad practice that complicates the maintenance and evolution of software. Detecting duplicated code is a difficult task because of the large amoun...
Matthias Rieger, Stéphane Ducasse
DEBU
2000
118views more  DEBU 2000»
13 years 5 months ago
Matching Algorithms within a Duplicate Detection System
Detecting database records that are approximate duplicates, but not exact duplicates, is an important task. Databases may contain duplicate records concerning the same real-world ...
Alvaro E. Monge
CIKM
2008
Springer
13 years 7 months ago
Scaling up duplicate detection in graph data
Duplicate detection determines different representations of realworld objects in a database. Recent research has considered the use of relationships among object representations t...
Melanie Herschel, Felix Naumann