Sciweavers

Share
153 search results - page 1 / 31
» Structure-aware sampling on data streams
Sort
View
JACM
2012
7 years 2 months ago
Continuous sampling from distributed streams
A fundamental problem in data management is to draw and maintain a sample of a large data set, for approximate query answering, selectivity estimation, and query planning. With la...
Graham Cormode, S. Muthukrishnan, Ke Yi, Qin Zhang
PODS
2015
ACM
22views Database» more  PODS 2015»
3 years 7 months ago
External Memory Stream Sampling
This paper aims to understand the I/O-complexity of maintaining a big sample set—whose size exceeds the internal memory’s capacity—on a data stream. We study this topic in a...
Xiaocheng Hu, Miao Qiao, Yufei Tao
SIGMOD
2005
ACM
220views Database» more  SIGMOD 2005»
10 years 5 days ago
Sampling Algorithms in a Stream Operator
Complex queries over high speed data streams often need to rely on approximations to keep up with their input. The research community has developed a rich literature on approximat...
Theodore Johnson, S. Muthukrishnan, Irina Rozenbau...
SSDBM
2007
IEEE
212views Database» more  SSDBM 2007»
9 years 6 months ago
Adaptive-Size Reservoir Sampling over Data Streams
Reservoir sampling is a well-known technique for sequential random sampling over data streams. Conventional reservoir sampling assumes a fixed-size reservoir. There are situation...
Mohammed Al-Kateb, Byung Suk Lee, Xiaoyang Sean Wa...
SSDBM
2010
IEEE
181views Database» more  SSDBM 2010»
8 years 10 months ago
Stratified Reservoir Sampling over Heterogeneous Data Streams
Reservoir sampling is a well-known technique for random sampling over data streams. In many streaming applications, however, an input stream may be naturally heterogeneous, i.e., c...
Mohammed Al-Kateb, Byung Suk Lee
books