Sciweavers

EDCC
2005
Springer

Failure Detection with Booting in Partially Synchronous Systems

13 years 9 months ago
Failure Detection with Booting in Partially Synchronous Systems
Unreliable failure detectors are a well known means to enrich asynchronous distributed systems with time-free semantics that allow to solve consensus in the presence of crash failures. Implementing unreliable failure detectors requires a system that provides some synchrony, typically an upper bound on end-to-end message delays. Recently, we introduced an implementation of the perfect failure detector in a novel partially synchronous model, referred to as the Θ-Model, where only the ratio Θ of maximum vs. minimum end-to-end delay of messages that are simultaneously in transit must be known a priori (while the actual delays need not be known and not even be bounded). In this paper, we present an alternative failure detector algorithm, which is based on a clock synchronization algorithm for the Θ-Model. It not only surpasses our first implementation with respect to failure detection time, but also works during the system booting phase.
Josef Widder, Gérard Le Lann, Ulrich Schmid
Added 27 Jun 2010
Updated 27 Jun 2010
Type Conference
Year 2005
Where EDCC
Authors Josef Widder, Gérard Le Lann, Ulrich Schmid
Comments (0)