Sciweavers

307 search results - page 3 / 62
» On the Integrity of Lightweight Checkpoints
Sort
View
ISCA
2002
IEEE
115views Hardware» more  ISCA 2002»
15 years 3 months ago
SafetyNet: Improving the Availability of Shared Memory Multiprocessors with Global Checkpoint/Recovery
We develop an availability solution, called SafetyNet, that uses a unified, lightweight checkpoint/recovery mechanism to support multiple long-latency fault detection schemes. At...
Daniel J. Sorin, Milo M. K. Martin, Mark D. Hill, ...
105
Voted
ENTCS
2007
113views more  ENTCS 2007»
14 years 10 months ago
Modular Checkpointing for Atomicity
Transient faults that arise in large-scale software systems can often be repaired by re-executing the code in which they occur. Ascribing a meaningful semantics for safe re-execut...
Lukasz Ziarek, Philip Schatz, Suresh Jagannathan
88
Voted
SIAMSC
2010
132views more  SIAMSC 2010»
14 years 8 months ago
New Algorithms for Optimal Online Checkpointing
Frequently, the computation of derivatives for optimizing time-dependent problems is based on the integration of the adjoint differential equation. For this purpose, the knowledge...
Philipp Stumm, Andrea Walther
81
Voted
SRDS
2003
IEEE
15 years 3 months ago
Raptor: Integrating Checkpoints and Thread Migration for Cluster Management
distributed shared-memory (SDSM) provides the abstraction necessary to run shared-memory applications on cost-effective parallel platforms such as clusters of workstations. Howeve...
Hazim Shafi, Evan Speight, John K. Bennett
65
Voted
MICRO
2005
IEEE
122views Hardware» more  MICRO 2005»
15 years 3 months ago
Cherry-MP: Correctly Integrating Checkpointed Early Resource Recycling in Chip Multiprocessors
Meyrem Kirman, Nevin Kirman, José F. Mart&i...