Sciweavers

3 search results - page 1 / 1
» ProbClean: A probabilistic duplicate detection system
Sort
View
ICDE
2010
IEEE
204views Database» more  ICDE 2010»
13 years 11 months ago
ProbClean: A probabilistic duplicate detection system
— One of the most prominent data quality problems is the existence of duplicate records. Current data cleaning systems usually produce one clean instance (repair) of the input da...
George Beskales, Mohamed A. Soliman, Ihab F. Ilyas...
P2P
2010
IEEE
202views Communications» more  P2P 2010»
13 years 3 months ago
Optimizing Near Duplicate Detection for P2P Networks
—In this paper, we propose a probabilistic algorithm for detecting near duplicate text, audio, and video resources efficiently and effectively in large-scale P2P systems. To thi...
Odysseas Papapetrou, Sukriti Ramesh, Stefan Siersd...
FGR
2008
IEEE
264views Biometrics» more  FGR 2008»
13 years 6 months ago
Large scale learning and recognition of faces in web videos
The phenomenal growth of video on the web and the increasing sparseness of meta information associated with it forces us to look for signals from the video content for search/info...
Ming Zhao 0003, Jay Yagnik, Hartwig Adam, David Ba...