Sciweavers

CASCON
1996

Availability management of distributed programs and services

13 years 5 months ago
Availability management of distributed programs and services
Modern distributed applications pose increasing demands for high availability, automatic management, and dynamic con guration of their software systems. This paper presents the architecture of Sampa, a System for Availability Management of Process-based Applications, which aims at ful lling these requirements. The system has been designed to support the management of faulttolerant DCE-based distributed programs according to user-provided and application-speci c availability speci cations. It is supposed to detect and automatically react to faults such as node crashes, network partitions, process crashes, and hang-ups. In this paper, we focus on the design of some of its services { the monitoring, checkpointing, and con guration management facilities { and show how they can be used for managing a generic fault-tolerant service.
Markus Endler
Added 02 Nov 2010
Updated 02 Nov 2010
Type Conference
Year 1996
Where CASCON
Authors Markus Endler
Comments (0)