Sciweavers

260 search results - page 5 / 52
» Industry-scale duplicate detection
Sort
View
ICDE
2005
IEEE
108views Database» more  ICDE 2005»
15 years 3 months ago
Robust Identification of Fuzzy Duplicates
Detecting and eliminating fuzzy duplicates is a critical data cleaning task that is required by many applications. Fuzzy duplicates are multiple seemingly distinct tuples which re...
Surajit Chaudhuri, Venkatesh Ganti, Rajeev Motwani
DSN
2007
IEEE
15 years 3 months ago
Processor-Level Selective Replication
Full duplication of an entire application (through spatial or temporal redundancy) would detect many errors that are benign to the application from the perspective of the end-user...
Nithin Nakka, Karthik Pattabiraman, Ravishankar K....
ESWS
2010
Springer
15 years 27 days ago
Efficient Semantic-Aware Detection of Near Duplicate Resources
Abstract. Efficiently detecting near duplicate resources is an important task when integrating information from various sources and applications. Once detected, near duplicate reso...
Ekaterini Ioannou, Odysseas Papapetrou, Dimitrios ...
82
Voted
MVA
2007
119views Computer Vision» more  MVA 2007»
14 years 11 months ago
Wavelet-Based Multiresolution Features for Detecting Duplications in Images
Duplication of image regions is a common method for manipulating original images using typical software like Adobe Photoshop. In this study, we propose a wavelet based feature rep...
Md. Khayrul Bashar, Keiji Noda, Noboru Ohnishi, Hi...
PEPM
2010
ACM
15 years 6 months ago
Clone detection and elimination for Haskell
Duplicated code is a well known problem in software maintenance and refactoring. Code clones tend to increase program size and several studies have shown that duplicated code make...
Christopher Brown, Simon Thompson