Sciweavers

USENIX
2007

Transparent Checkpoint-Restart of Multiple Processes on Commodity Operating Systems

13 years 6 months ago
Transparent Checkpoint-Restart of Multiple Processes on Commodity Operating Systems
The ability to checkpoint a running application and restart it later can provide many useful benefits including fault recovery, advanced resources sharing, dynamic load balancing and improved service availability. However, applications often involve multiple processes which have dependencies through the operating system. We present a transparent mechanism for commodity operating systems that can checkpoint multiple processes in a consistent state so that they can be restarted correctly at a later time. We introduce an efficient algorithm for recording process relationships and correctly saving and restoring shared state in a manner that leverages existing operating system kernel functionality. We have implemented our system as a loadable kernel module and user-space utilities in Linux. We demonstrate its ability on real-world applications to provide transparent checkpoint-restart functionality without modifying, recompiling, or relinking applications, libraries, or the operating sys...
Oren Laadan, Jason Nieh
Added 02 Oct 2010
Updated 02 Oct 2010
Type Conference
Year 2007
Where USENIX
Authors Oren Laadan, Jason Nieh
Comments (0)