Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

171

SRDS
1999
IEEE

154views Operating System» more SRDS 1999»

Fault-Tolerant Replication Management in Large-Scale Distributed Storage Systems

15 years 9 months ago

Fault-Tolerant Replication Management in Large-Scale Distributed Storage Systems

Download www.hpl.hp.com

Failures of all forms happen: from losing single network packets to site-wide disasters. Since businesses rely heavily on their data, it is imperative that failures require minimal time and effort to repair and that the service interruption during the failure or repair period should be as short as possible. To this end, the ideal system should repair itself, relying on humans only when absolutely necessary in the repair process. This paper describes one component of a self-healing storage system: the component that allows for automatic recovery of access to data when the power comes back on after a large-scale outage. Our failure recovery protocol is part of a suite of modular protocols that make up the Palladio distributed storage system. This protocol guarantees that service will be repaired quickly and automatically when enough failures are repaired.

Richard A. Golding, Elizabeth Borowsky

Real-time Traffic

Operating Systems | Repair | Single Network Packets | SRDS 1999 | Storage System |

claim paper

Related Content

» FaultTolerant Partial Replication in LargeScale Database Systems

» eSAFE An Extensible Secure and Fault Tolerant Storage System

» An approach for fault tolerant and secure data storage in collaborative work environments

» IPMIbased Efficient Notification Framework for Large Scale Cluster Computing

» DimaX A FaultTolerant MultiAgent Platform

» A Scalable Asynchronous ReplicationBased Strategy for Fault Tolerant MPI Applications

» A FaultTolerant Middleware Architecture for HighAvailability Storage Services

» Decentralized Resource Management and FaultTolerance for Distributed CORBA Applications

» Implementing FaultTolerant Services Using State Machines Beyond Replication

Post Info
More Details (n/a)

Added	04 Aug 2010
Updated	04 Aug 2010
Type	Conference
Year	1999
Where	SRDS
Authors	Richard A. Golding, Elizabeth Borowsky

Comments (0)