Sciweavers

1033 search results - page 2 / 207
» An autonomic failure-detection algorithm
Sort
View
IPPS
2007
IEEE
14 years 16 days ago
Fast Failure Detection in a Process Group
Failure detectors represent a very important building block in distributed applications. The speed and the accuracy of the failure detectors is critical to the performance of the ...
Xinjie Li, Monica Brockmeyer
KDD
2005
ACM
178views Data Mining» more  KDD 2005»
13 years 11 months ago
Failure detection and localization in component based systems by online tracking
The increasing complexity of today’s systems makes fast and accurate failure detection essential for their use in mission-critical applications. Various monitoring methods provi...
Haifeng Chen, Guofei Jiang, Cristian Ungureanu, Ke...
EDCC
2005
Springer
13 years 11 months ago
Failure Detection with Booting in Partially Synchronous Systems
Unreliable failure detectors are a well known means to enrich asynchronous distributed systems with time-free semantics that allow to solve consensus in the presence of crash failu...
Josef Widder, Gérard Le Lann, Ulrich Schmid
HPDC
2008
IEEE
14 years 21 days ago
Issues in applying data mining to grid job failure detection and diagnosis
As grid computation systems become larger and more complex, manually diagnosing failures in jobs becomes impractical. Recently, machine-learning techniques have been proposed to d...
Lakshmikant Shrinivas, Jeffrey F. Naughton
SSS
2005
Springer
13 years 11 months ago
On the Possibility and the Impossibility of Message-Driven Self-stabilizing Failure Detection
Abstract. This paper considers message-driven self-stabilizing implementations of unreliable failure detectors. We show that it is impossible to give a deterministic implementation...
Martin Hutle, Josef Widder