Search Sciweavers | Sciweavers

668 search results - page 2 / 134

» Implementing and Evaluating Automatic Checkpointing

158

click to vote

IPPS
2007
IEEE

102views Distributed And Parallel Com...» more IPPS 2007»

DejaVu: Transparent User-Level Checkpointing, Migration, and Recovery for Distributed Systems

16 years 1 months ago

Download www.cecs.uci.edu

In this paper, we present a new fault tolerance system called DejaVu for transparent and automatic checkpointing, migration, and recovery of parallel and distributed applications....

Joseph F. Ruscio, Michael A. Heffner, Srinidhi Var...

claim paper

Read More »

155

click to vote

CCGRID
2006
IEEE

97views Distributed And Parallel Com...» more CCGRID 2006»

Transparent Adaptive Library-Based Checkpointing for Master-Worker Style Parallelism

16 years 1 months ago

Download people.csail.mit.edu

We present a transparent, system-level checkpointing solution for master-worker parallelism that automatically adapts, upon restart, to the number of processor nodes available. Th...

Gene Cooperman, Jason Ansel, Xiaoqin Ma

claim paper

Read More »

178

click to vote

EMSOFT
2006
Springer

100views Software Engineering» more EMSOFT 2006»

Implementing fault-tolerance in real-time systems by automatic program transformations

15 years 10 months ago

Download pop-art.inrialpes.fr

We present a formal approach to implement and certify fault-tolerance in real-time embedded systems. The faultintolerant initial system consists of a set of independent periodic t...

Tolga Ayav, Pascal Fradet, Alain Girault

claim paper

Read More »

204

click to vote

CLUSTER
2003
IEEE

165views Distributed And Parallel Com...» more CLUSTER 2003»

Coordinated Checkpoint versus Message Log for Fault Tolerant MPI

16 years 10 days ago

Download www.cs.utk.edu

— Large Clusters, high availability clusters and Grid deployments often suffer from network, node or operating system faults and thus require the use of fault tolerant programmin...

Aurelien Bouteiller, Pierre Lemarinier, Gér...

claim paper

Read More »

219

click to vote

ASPLOS
2011
ACM

201views Programming Languages» more ASPLOS 2011»

Mementos: system support for long-running computation on RFID-scale devices

14 years 10 months ago

Download www.cs.dartmouth.edu

Transiently powered computing devices such as RFID tags, kinetic energy harvesters, and smart cards typically rely on programs that complete a task under tight time constraints be...

Benjamin Ransford, Jacob Sorber, Kevin Fu

claim paper

Read More »

« Prev « First page 2 / 134 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers