Sciweavers

HPDC
2007
IEEE

Ridge: combining reliability and performance in open grid platforms

13 years 10 months ago
Ridge: combining reliability and performance in open grid platforms
Large-scale donation-based distributed infrastructures need to cope with the inherent unreliability of participant nodes. A widely-used work scheduling technique in such environments is to redundantly schedule the outsourced computations to a number of nodes. We present the design and implementation of RIDGE, a reliabilityaware system which uses a node’s prior performance and behavior to make more effective scheduling decisions. We have implemented RIDGE on top of the BOINC distributed computing infrastructure and have evaluated its performance on a live testbed consisting of 120 PlanetLab nodes. Our experimental results show that RIDGE is able to match or surpass the throughput of the best vanilla BOINC configuration under different reliability environments, by automatically adapting to the characteristics of the underlying environment. In addition, RIDGE is able to provide much lower workunit makespans compared to BOINC, which indicates its desirability in service-oriented enviro...
Krishnaveni Budati, Jason D. Sonnek, Abhishek Chan
Added 02 Jun 2010
Updated 02 Jun 2010
Type Conference
Year 2007
Where HPDC
Authors Krishnaveni Budati, Jason D. Sonnek, Abhishek Chandra, Jon B. Weissman
Comments (0)