Sciweavers

ICPPW
2009
IEEE

Decentralized Load Balancing for Improving Reliability in Heterogeneous Distributed Systems

13 years 11 months ago
Decentralized Load Balancing for Improving Reliability in Heterogeneous Distributed Systems
Abstract—A probabilistic analytical framework for decentralized load balancing (LB) strategies for heterogeneous distributed-computing systems (DCSs) is presented with the overall goal of maximizing the service reliability in the presence of random failures. The service reliability of a DCS is defined as the probability of successfully serving a specified workload before all the computing nodes fail permanently. In the framework considered the service and failure times of nodes are random, the communication times in the network are both tangible and stochastic, and LB is performed synchronously by all the nodes during the runtime of each submitted workload. By taking a novel regenerative stochastic-analysis approach, the service reliability of a two-node DCS is characterized analytically. This formulation, in turn, is used to form and solve an optimization problem, yielding LB policies with maximal reliability. A scalable extension of the two-node formulation to an arbitrary size s...
Jorge E. Pezoa, Sagar Dhakal, Majeed M. Hayat
Added 23 May 2010
Updated 23 May 2010
Type Conference
Year 2009
Where ICPPW
Authors Jorge E. Pezoa, Sagar Dhakal, Majeed M. Hayat
Comments (0)