Sciweavers

901 search results - page 101 / 181
» On Anonymization of String Data
Sort
View
KDD
2008
ACM
156views Data Mining» more  KDD 2008»
16 years 1 months ago
Unsupervised deduplication using cross-field dependencies
Recent work in deduplication has shown that collective deduplication of different attribute types can improve performance. But although these techniques cluster the attributes col...
Robert Hall, Charles A. Sutton, Andrew McCallum
CORR
2004
Springer
118views Education» more  CORR 2004»
15 years 1 months ago
Understanding Search Trees via Statistical Physics
We study the random m-ary search tree model (where m stands for the number of branches of the search tree), an important problem for data storage in computer science, using a varie...
Satya N. Majumdar, David S. Dean, Paul L. Krapivsk...
KDD
2003
ACM
148views Data Mining» more  KDD 2003»
16 years 1 months ago
Mining data records in Web pages
A large amount of information on the Web is contained in regularly structured objects, which we call data records. Such data records are important because they often present the e...
Bing Liu, Robert L. Grossman, Yanhong Zhai
STACS
2009
Springer
15 years 8 months ago
Error-Correcting Data Structures
We study data structures in the presence of adversarial noise. We want to encode a given object in a succinct data structure that enables us to efficiently answer specific queries...
Ronald de Wolf
KDD
2001
ACM
195views Data Mining» more  KDD 2001»
16 years 1 months ago
Multimedia Data Mining for Traffic Video Sequences
In this paper, a multimedia data mining framework for discovering important but previously unknown knowledge such as vehicle identification, traffic flow, and the spatio-temporal ...
Shu-Ching Chen, Mei-Ling Shyu, Chengcui Zhang, Jef...