Sciweavers

146 search results - page 1 / 30
» Transparent Checkpoint-Restart of Distributed Applications o...
Sort
View
CLUSTER
2005
IEEE
13 years 10 months ago
Transparent Checkpoint-Restart of Distributed Applications on Commodity Clusters
We have created ZapC, a novel system for transparent coordinated checkpoint-restart of distributed network applications on commodity clusters. ZapC provides a thin virtualization ...
Oren Laadan, Dan B. Phung, Jason Nieh
USENIX
2007
13 years 7 months ago
Transparent Checkpoint-Restart of Multiple Processes on Commodity Operating Systems
The ability to checkpoint a running application and restart it later can provide many useful benefits including fault recovery, advanced resources sharing, dynamic load balancing...
Oren Laadan, Jason Nieh
DSN
2005
IEEE
13 years 10 months ago
Cruz: Application-Transparent Distributed Checkpoint-Restart on Standard Operating Systems
G. John Janakiraman, Jose Renato Santos, Dinesh Su...
PDCAT
2009
Springer
13 years 11 months ago
CheCUDA: A Checkpoint/Restart Tool for CUDA Applications
Abstract—In this paper, a tool named CheCUDA is designed to checkpoint CUDA applications that use GPUs as accelerators. As existing checkpoint/restart implementations do not supp...
Hiroyuki Takizawa, Katsuto Sato, Kazuhiko Komatsu,...