Search Sciweavers | Sciweavers

35 search results - page 2 / 7

» Transparent checkpoints of closed distributed systems in Emu...

click to vote

CLUSTER
2005
IEEE

115views Distributed And Parallel Com...» more CLUSTER 2005»

Transparent Checkpoint-Restart of Distributed Applications on Commodity Clusters

13 years 11 months ago

Download www.cs.columbia.edu

We have created ZapC, a novel system for transparent coordinated checkpoint-restart of distributed network applications on commodity clusters. ZapC provides a thin virtualization ...

Oren Laadan, Dan B. Phung, Jason Nieh

claim paper

Read More »

click to vote

EGC
2005
Springer

127views Distributed And Parallel Com...» more EGC 2005»

Transparent Fault Tolerance for Grid Applications

13 years 11 months ago

Download www.st.ewi.tudelft.nl

A major challenge facing grid applications is the appropriate handling of failures. In this paper we address the problem of making parallel Java applications based on Remote Method...

Pawel Garbacki, Bartosz Biskupski, Henri E. Bal

claim paper

Read More »

click to vote

PODC
1994
ACM

134views Distributed and Parallel Com...» more PODC 1994»

A Checkpoint Protocol for an Entry Consistent Shared Memory System

13 years 10 months ago

Download research.microsoft.com

Workstation clusters are becoming an interesting alternative to dedicated multiprocessors. In this environment, the probability of a failure, during an application's executio...

Nuno Neves, Miguel Castro, Paulo Guedes

claim paper

Read More »

click to vote

IPPS
2005
IEEE

159views Distributed And Parallel Com...» more IPPS 2005»

Current Practice and a Direction Forward in Checkpoint/Restart Implementations for Fault Tolerance

13 years 11 months ago

Download hpc.pnl.gov

Checkpoint/restart is a general idea for which particular implementations enable various functionalities in computer systems, including process migration, gang scheduling, hiberna...

José Carlos Sancho, Fabrizio Petrini, Kei D...

claim paper

Read More »

click to vote

CLUSTER
2003
IEEE

165views Distributed And Parallel Com...» more CLUSTER 2003»

Coordinated Checkpoint versus Message Log for Fault Tolerant MPI

13 years 11 months ago

Download www.cs.utk.edu

— Large Clusters, high availability clusters and Grid deployments often suffer from network, node or operating system faults and thus require the use of fault tolerant programmin...

Aurelien Bouteiller, Pierre Lemarinier, Gér...

claim paper

Read More »

« Prev « First page 2 / 7 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers