Sciweavers

PODS
2010
ACM
232views Database» more  PODS 2010»
13 years 10 months ago
Optimal sampling from distributed streams
A fundamental problem in data management is to draw a sample of a large data set, for approximate query answering, selectivity estimation, and query planning. With large, streamin...
Graham Cormode, S. Muthukrishnan, Ke Yi, Qin Zhang
PODS
2010
ACM
173views Database» more  PODS 2010»
13 years 10 months ago
Foundations of schema mapping management
In the last few years, a lot of attention has been paid to the specification and subsequent manipulation of schema mappings, a problem which is of fundamental importance in metad...
Marcelo Arenas, Jorge Pérez, Juan L. Reutte...
PODS
2010
ACM
150views Database» more  PODS 2010»
13 years 10 months ago
Understanding queries in a search database system
It is well known that a search engine can significantly benefit from an auxiliary database, which can suggest interpretations of the search query by means of the involved concep...
Ronald Fagin, Benny Kimelfeld, Yunyao Li, Sriram R...
PODS
2010
ACM
215views Database» more  PODS 2010»
13 years 10 months ago
An optimal algorithm for the distinct elements problem
We give the first optimal algorithm for estimating the number of distinct elements in a data stream, closing a long line of theoretical research on this problem begun by Flajolet...
Daniel M. Kane, Jelani Nelson, David P. Woodruff
PODS
2010
ACM
163views Database» more  PODS 2010»
13 years 10 months ago
Computing query probability with incidence algebras
Nilesh N. Dalvi, Karl Schnaitter, Dan Suciu
PODS
2010
ACM
207views Database» more  PODS 2010»
13 years 10 months ago
Understanding cardinality estimation using entropy maximization
Cardinality estimation is the problem of estimating the number of tuples returned by a query; it is a fundamentally important task in data management, used in query optimization, ...
Christopher Ré, Dan Suciu