Sciweavers

CIDR
2009

LifeRaft: Data-Driven, Batch Processing for the Exploration of Scientific Databases

13 years 4 months ago
LifeRaft: Data-Driven, Batch Processing for the Exploration of Scientific Databases
Workloads that comb through vast amounts of data are gaining importance in the sciences. These workloads consist of "needle in a haystack" queries that are long running and data intensive so that query throughput limits performance. To maximize throughput for data-intensive queries, we put forth LifeRaft: a query processing system that batches queries with overlapping data requirements. Rather than scheduling queries in arrival order, LifeRaft executes queries concurrently against an ordering of the data that maximizes data sharing among queries. This decreases I/O and increases cache utility. However, such batch processing can increase query response time by starving interactive workloads. LifeRaft addresses starvation using techniques inspired by head scheduling in disk drives. Depending upon the workload saturation and queuing times, the system adaptively and incrementally trades-off processing queries in arrival order and data-driven batch processing. Evaluating LifeRaft...
Xiaodan Wang, Randal C. Burns, Tanu Malik
Added 08 Nov 2010
Updated 08 Nov 2010
Type Conference
Year 2009
Where CIDR
Authors Xiaodan Wang, Randal C. Burns, Tanu Malik
Comments (0)