Sciweavers

STDBM
2004
Springer
143views Database» more  STDBM 2004»
13 years 10 months ago
Indexing Query Regions for Streaming Geospatial Data
This paper introduces the Dynamic Cascade Tree (DCT), a structure designed to index query regions on multi-dimensional data streams. The DCT is designed for a stream management sy...
Quinn Hart, Michael Gertz
LATIN
2004
Springer
13 years 10 months ago
An Improved Data Stream Summary: The Count-Min Sketch and Its Applications
We introduce a new sublinear space data structure—the Count-Min Sketch— for summarizing data streams. Our sketch allows fundamental queries in data stream summarization such a...
Graham Cormode, S. Muthukrishnan
EAGC
2004
Springer
13 years 10 months ago
Using Global Snapshots to Access Data Streams on the Grid
Data streams are a prevalent and growing source of timely data. As streams become more prevalent, richer interrogation of the contents of the streams are required. Value of the con...
Beth Plale
DIS
2004
Springer
13 years 10 months ago
Mining Noisy Data Streams via a Discriminative Model
The two main challenges typically associated with mining data streams are concept drift and data contamination. To address these challenges, we seek learning techniques and models ...
Fang Chu, Yizhou Wang, Carlo Zaniolo
APPROX
2004
Springer
108views Algorithms» more  APPROX 2004»
13 years 10 months ago
Estimating Frequency Moments of Data Streams Using Random Linear Combinations
The problem of estimating the kth frequency moment Fk for any nonnegative k, over a data stream by looking at the items exactly once as they arrive, was considered in a seminal pap...
Sumit Ganguly
VLDB
2005
ACM
196views Database» more  VLDB 2005»
13 years 10 months ago
Summarizing and Mining Inverse Distributions on Data Streams via Dynamic Inverse Sampling
Emerging data stream management systems approach the challenge of massive data distributions which arrive at high speeds while there is only small storage by summarizing and minin...
Graham Cormode, S. Muthukrishnan, Irina Rozenbaum
VLDB
2005
ACM
140views Database» more  VLDB 2005»
13 years 10 months ago
Loadstar: Load Shedding in Data Stream Mining
In this demo, we show that intelligent load shedding is essential in achieving optimum results in mining data streams under various resource constraints. The Loadstar system intro...
Yun Chi, Haixun Wang, Philip S. Yu
PAKDD
2005
ACM
146views Data Mining» more  PAKDD 2005»
13 years 10 months ago
An Incremental Data Stream Clustering Algorithm Based on Dense Units Detection
Abstract. The data stream model of computation is often used for analyzing huge volumes of continuously arriving data. In this paper, we present a novel algorithm called DUCstream ...
Jing Gao, Jianzhong Li, Zhaogong Zhang, Pang-Ning ...
FSTTCS
2005
Springer
13 years 10 months ago
Practical Algorithms for Tracking Database Join Sizes
We present novel algorithms for estimating the size of the natural join of two data streams that have efficient update processing times and provide excellent quality of estimates....
Sumit Ganguly, Deepanjan Kesh, Chandan Saha
ESA
2005
Springer
107views Algorithms» more  ESA 2005»
13 years 10 months ago
Workload-Optimal Histograms on Streams
Histograms are used in many ways in conventional databases and in data stream processing for summarizing massive data distributions. Previous work on constructing histograms on da...
S. Muthukrishnan, Martin Strauss, X. Zheng