Sciweavers

630 search results - page 2 / 126
» Optimized union of non-disjoint distributed data sets
Sort
View
EDBT
2010
ACM
188views Database» more  EDBT 2010»
14 years 4 days ago
Subsumption and complementation as data fusion operators
The goal of data fusion is to combine several representations of one real world object into a single, consistent representation, e.g., in data integration. A very popular operator...
Jens Bleiholder, Sascha Szott, Melanie Herschel, F...
CONCURRENCY
2002
82views more  CONCURRENCY 2002»
13 years 5 months ago
Optimizing the distribution of large data sets in theory and practice
Felix Rauch, Christian Kurmann, Thomas Stricker
ICDE
2005
IEEE
135views Database» more  ICDE 2005»
14 years 7 months ago
Finding (Recently) Frequent Items in Distributed Data Streams
We consider the problem of maintaining frequency counts for items occurring frequently in the union of multiple distributed data streams. Na?ive methods of combining approximate f...
Amit Manjhi, Vladislav Shkapenyuk, Kedar Dhamdhere...
DBPL
1999
Springer
102views Database» more  DBPL 1999»
13 years 10 months ago
Union Types for Semistructured Data
Semistructured databases are treated as dynamically typed: they come equipped with no independent schema or type system to constrain the data. Query languages that are designed fo...
Peter Buneman, Benjamin C. Pierce
PODS
2010
ACM
232views Database» more  PODS 2010»
13 years 10 months ago
Optimal sampling from distributed streams
A fundamental problem in data management is to draw a sample of a large data set, for approximate query answering, selectivity estimation, and query planning. With large, streamin...
Graham Cormode, S. Muthukrishnan, Ke Yi, Qin Zhang