Sciweavers

333 search results - page 1 / 67
» Optimal sampling from distributed streams
Sort
View
PODS
2010
ACM
232views Database» more  PODS 2010»
13 years 10 months ago
Optimal sampling from distributed streams
A fundamental problem in data management is to draw a sample of a large data set, for approximate query answering, selectivity estimation, and query planning. With large, streamin...
Graham Cormode, S. Muthukrishnan, Ke Yi, Qin Zhang
JACM
2012
11 years 7 months ago
Continuous sampling from distributed streams
A fundamental problem in data management is to draw and maintain a sample of a large data set, for approximate query answering, selectivity estimation, and query planning. With la...
Graham Cormode, S. Muthukrishnan, Ke Yi, Qin Zhang
PODS
2009
ACM
112views Database» more  PODS 2009»
14 years 5 months ago
Optimal sampling from sliding windows
APPEARED IN ACM PODS-2009. A sliding windows model is an important case of the streaming model, where only the most "recent" elements remain active and the rest are disc...
Vladimir Braverman, Rafail Ostrovsky, Carlo Zaniol...
VLDB
2005
ACM
196views Database» more  VLDB 2005»
13 years 10 months ago
Summarizing and Mining Inverse Distributions on Data Streams via Dynamic Inverse Sampling
Emerging data stream management systems approach the challenge of massive data distributions which arrive at high speeds while there is only small storage by summarizing and minin...
Graham Cormode, S. Muthukrishnan, Irina Rozenbaum
DASFAA
2006
IEEE
168views Database» more  DASFAA 2006»
13 years 11 months ago
PMJoin: Optimizing Distributed Multi-way Stream Joins by Stream Partitioning
Abstract. In emerging data stream applications, data sources are typically distributed. Evaluating multi-join queries over streams from different sources may incur large communica...
Yongluan Zhou, Ying Yan, Feng Yu, Aoying Zhou