Sciweavers

606 search results - page 68 / 122
» Memory-constrained aggregate computation over data streams
Sort
View
116
Voted
STOC
2010
ACM
185views Algorithms» more  STOC 2010»
15 years 7 months ago
Measuring independence of datasets
Approximating pairwise, or k-wise, independence with sublinear memory is of considerable importance in the data stream model. In the streaming model the joint distribution is give...
Vladimir Braverman, Rafail Ostrovsky
156
Voted
KAIS
2006
247views more  KAIS 2006»
15 years 3 months ago
XCQ: A queriable XML compression system
XML has already become the de facto standard for specifying and exchanging data on the Web. However, XML is by nature verbose and thus XML documents are usually large in size, a fa...
Wilfred Ng, Wai Yeung Lam, Peter T. Wood, Mark Lev...
124
Voted
IPPS
2006
IEEE
15 years 9 months ago
Speeding up NGB with distributed file streaming framework
Grid computing provides a very rich environment for scientific calculations. In addition to the challenges it provides, it also offers new opportunities for optimization. In this ...
Bingchen Li, Kang Chen, Zhiteng Huang, H. L. Rajic...
82
Voted
PVLDB
2010
98views more  PVLDB 2010»
15 years 1 months ago
Dremel: Interactive Analysis of Web-Scale Datasets
Dremel is a scalable, interactive ad-hoc query system for analysis of read-only nested data. By combining multi-level execution trees and columnar data layout, it is capable of ru...
Sergey Melnik, Andrey Gubarev, Jing Jing Long, Geo...
KDD
1999
ACM
104views Data Mining» more  KDD 1999»
15 years 7 months ago
Learning Rules from Distributed Data
In this paper a concern about the accuracy (as a function of parallelism) of a certain class of distributed learning algorithms is raised, and one proposed improvement is illustrat...
Lawrence O. Hall, Nitesh V. Chawla, Kevin W. Bowye...