Sciweavers

PODS
2012
ACM
276views Database» more  PODS 2012»
13 years 9 months ago
Randomized algorithms for tracking distributed count, frequencies, and ranks
We show that randomization can lead to significant improvements for a few fundamental problems in distributed tracking. Our basis is the count-tracking problem, where there are k...
Zengfeng Huang, Ke Yi, Qin Zhang
PODS
2012
ACM
297views Database» more  PODS 2012»
13 years 9 months ago
Query-based data pricing
Data is increasingly being bought and sold online, and Webbased marketplace services have emerged to facilitate these activities. However, current mechanisms for pricing data are ...
Paraschos Koutris, Prasang Upadhyaya, Magdalena Ba...
PODS
2012
ACM
281views Database» more  PODS 2012»
13 years 9 months ago
Mergeable summaries
We study the mergeability of data summaries. Informally speaking, mergeability requires that, given two summaries on two data sets, there is a way to merge the two summaries into ...
Pankaj K. Agarwal, Graham Cormode, Zengfeng Huang,...
SIGMOD
2012
ACM
345views Database» more  SIGMOD 2012»
13 years 9 months ago
Shark: fast data analysis using coarse-grained distributed memory
Shark is a research data analysis system built on a novel rained distributed shared-memory abstraction. Shark marries query processing with deep data analysis, providing a unifie...
Cliff Engle, Antonio Lupher, Reynold Xin, Matei Za...
SIGMOD
2012
ACM
225views Database» more  SIGMOD 2012»
13 years 9 months ago
A model-based approach to attributed graph clustering
Zhiqiang Xu, Yiping Ke, Yi Wang, Hong Cheng, James...