Sciweavers

STOC
2001
ACM

Data-streams and histograms

14 years 4 months ago
Data-streams and histograms
Histograms are typically used to approximate data distributions. Histograms and related synopsis structures have been successful in a wide variety of popular database applications including approximate querying, similarity searching and data mining. Histograms were a few of the earliest synopsis structures proposed and continue to be popular tools. Typically, the histograms are used as quick and easy estimates, and thus the slight loss of accuracy can be offset by fast histogram construction algorithms. A natural question arises in this context: can we find a fast near optimal approximation algorithm for the histogram construction problem? In this paper, we give the first linear time (1 + )-factor approximation algorithms (for any > 0) for several histogram construction problems. Several of our algorithms extend to data streams. We also show that our method generalizes to a large number of histogram construction problems including the use of piecewise small degree polynomials to ap...
Sudipto Guha, Nick Koudas, Kyuseok Shim
Added 03 Dec 2009
Updated 03 Dec 2009
Type Conference
Year 2001
Where STOC
Authors Sudipto Guha, Nick Koudas, Kyuseok Shim
Comments (0)