Sciweavers

413 search results - page 46 / 83
» Optimizing Queries Across Diverse Data Sources
Sort
View
ICDE
2004
IEEE
85views Database» more  ICDE 2004»
15 years 11 months ago
A Frequency-based Approach for Mining Coverage Statistics in Data Integration
Query optimization in data integration requires source coverage and overlap statistics. Gathering and storing the required statistics presents many challenges, not the least of wh...
Zaiqing Nie, Subbarao Kambhampati
SIGMOD
2009
ACM
175views Database» more  SIGMOD 2009»
15 years 10 months ago
Ranking distributed probabilistic data
Ranking queries are essential tools to process large amounts of probabilistic data that encode exponentially many possible deterministic instances. In many applications where unce...
Feifei Li, Ke Yi, Jeffrey Jestes
PVLDB
2010
112views more  PVLDB 2010»
14 years 4 months ago
CODS: Evolving Data Efficiently and Scalably in Column Oriented Databases
Database evolution is the process of updating the schema of a database or data warehouse (schema evolution) and evolving the data to the updated schema (data evolution). Database ...
Ziyang Liu, Sivaramakrishnan Natarajan, Bin He, Hu...
JACM
2012
13 years 5 days ago
Continuous sampling from distributed streams
A fundamental problem in data management is to draw and maintain a sample of a large data set, for approximate query answering, selectivity estimation, and query planning. With la...
Graham Cormode, S. Muthukrishnan, Ke Yi, Qin Zhang
EDBT
2008
ACM
161views Database» more  EDBT 2008»
15 years 10 months ago
Load distribution of analytical query workloads for database cluster architectures
Enterprises may have multiple database systems spread across the organization for redundancy or for serving different applications. In such systems, query workloads can be distrib...
Thomas Phan, Wen-Syan Li