Sciweavers

JFP
2010
107views more  JFP 2010»
13 years 3 months ago
Lightweight checkpointing for concurrent ML
Transient faults that arise in large-scale software systems can often be repaired by re-executing the code in which they occur. Ascribing a meaningful semantics for safe re-execut...
Lukasz Ziarek, Suresh Jagannathan
IPPS
2007
IEEE
13 years 11 months ago
An optimistic checkpointing and selective message logging approach for consistent global checkpoint collection in distributed sy
In this paper, we present an asynchronous consistent global checkpoint collection algorithm which prevents contention for network storage at the file server and hence reduces the...
Qiangfeng Jiang, D. Manivannan