Search Sciweavers | Sciweavers

20

ISCA
2011
IEEE

238views Hardware» more ISCA 2011»

Rebound: scalable checkpointing for coherent shared memory

12 years 9 months ago

As we move to large manycores, the hardware-based global checkpointing schemes that have been proposed for small shared-memory machines do not scale. Scalability barriers include ...

Rishi Agarwal, Pranav Garg, Josep Torrellas

claim paper

Read More »

17

click to vote

IPPS
1998
IEEE

104views Distributed And Parallel Com...» more IPPS 1998»

A Generalized Forward Recovery Checkpointing Scheme

13 years 9 months ago

Download ipdps.cc.gatech.edu

We propose a generalized forward recovery checkpointing scheme, with lookahead execution and rollback validation. This method takes advantage of voting and comparison on multiple v...

Ke Huang, Jie Wu, Eduardo B. Fernández

claim paper

Read More »

14

click to vote

CLUSTER
2003
IEEE

165views Distributed And Parallel Com...» more CLUSTER 2003»

Coordinated Checkpoint versus Message Log for Fault Tolerant MPI

13 years 10 months ago

Download www.cs.utk.edu

— Large Clusters, high availability clusters and Grid deployments often suffer from network, node or operating system faults and thus require the use of fault tolerant programmin...

Aurelien Bouteiller, Pierre Lemarinier, Gér...

claim paper

Read More »

14

click to vote

SIAMSC
2010

132views more SIAMSC 2010»

New Algorithms for Optimal Online Checkpointing

13 years 3 months ago

Download tu-dresden.de

Frequently, the computation of derivatives for optimizing time-dependent problems is based on the integration of the adjoint diﬀerential equation. For this purpose, the knowledge...

Philipp Stumm, Andrea Walther

claim paper

Read More »

11

click to vote

HPDC
2007
IEEE

129views Distributed And Parallel Com...» more HPDC 2007»

Failure-aware checkpointing in fine-grained cycle sharing systems

13 years 11 months ago

Download www.ecn.purdue.edu

Fine-Grained Cycle Sharing (FGCS) systems aim at utilizing the large amount of idle computational resources available on the Internet. Such systems allow guest jobs to run on a ho...

Xiaojuan Ren, Rudolf Eigenmann, Saurabh Bagchi

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers