Sciweavers

7 search results - page 1 / 2
» Exploring Failure Transparency and the Limits of Generic Rec...
Sort
View
OSDI
2000
ACM
13 years 6 months ago
Exploring Failure Transparency and the Limits of Generic Recovery
: We explore the abstraction of failure transparency in which the operating system provides the illusion of failure-free operation. To provide failure transparency, an operating sy...
David E. Lowell, Subhachandra Chandra, Peter M. Ch...
ISSRE
2002
IEEE
13 years 9 months ago
The Impact of Recovery Mechanisms on the Likelihood of Saving Corrupted State
Recovery systems must save state before a failure occurs to enable the system to recover from the failure. However, recovery will fail if the recovery system saves any state corru...
Subhachandra Chandra, Peter M. Chen
SIGMOD
2002
ACM
91views Database» more  SIGMOD 2002»
14 years 4 months ago
Phoenix Project: Fault-Tolerant Applications
After a system crash, databases recover to the last committed transaction, but applications usually either crash or cannot continue. The Phoenix purpose is to enable application s...
Roger S. Barga, David B. Lomet
ICAC
2008
IEEE
13 years 11 months ago
Runtime Fault-Handling for Job-Flow Management in Grid Environments
The execution of job flow applications is a reality today in academic and industrial domains. In this paper, we propose an approach to adding self-healing behavior to the executio...
Gargi Dasgupta, Onyeka Ezenwoye, Liana Fong, Selim...
DSN
2011
IEEE
12 years 4 months ago
Coercing clients into facilitating failover for object delivery
Abstract—Application-level protocols used for object delivery, such as HTTP, are built atop TCP/IP and inherit its hostabstraction. Given that these services are replicated for s...
Wyatt Lloyd, Michael J. Freedman