Search Sciweavers | Sciweavers

11

USENIX
2007

102views Operating System» more USENIX 2007»

Transparent Checkpoint-Restart of Multiple Processes on Commodity Operating Systems

13 years 7 months ago

The ability to checkpoint a running application and restart it later can provide many useful beneﬁts including fault recovery, advanced resources sharing, dynamic load balancing...

Oren Laadan, Jason Nieh

claim paper

Read More »

14

click to vote

PVLDB
2008

110views more PVLDB 2008»

Fault-tolerant stream processing using a distributed, replicated file system

13 years 4 months ago

Download www.cs.washington.edu

We present SGuard, a new fault-tolerance technique for distributed stream processing engines (SPEs) running in clusters of commodity servers. SGuard is less disruptive to normal s...

YongChul Kwon, Magdalena Balazinska, Albert G. Gre...

claim paper

Read More »

38

click to vote

Presentation

324views

Task scheduling algorithm for multicore processor system for minimizing recovery time in case of single node fault

11 years 11 months ago

Download www.slideshare.net

In this paper, we propose a task scheduling al-gorithm for a multicore processor system which reduces the recovery time in case of a single fail-stop failure of a multicore process...

posted by naokishibata

Read More »

31

click to vote

Publication

165views

Task scheduling algorithm for multicore processor system for minimizing recovery time in case of single node fault

11 years 10 months ago

Download ito-lab.naist.jp

In this paper, we propose a task scheduling algorithm for a multicore processor system which reduces the recovery time in case of a single fail-stop failure of a multicore processo...

Shohei Gotoda, Naoki Shibata and Minoru Ito

posted by naokishibata

Read More »

16

click to vote

CLUSTER
2004
IEEE

180views Distributed And Parallel Com...» more CLUSTER 2004»

Improved message logging versus improved coordinated checkpointing for fault tolerant MPI

13 years 9 months ago

Download www.cs.utk.edu

Fault tolerance is a very important concern for critical high performance applications using the MPI library. Several protocols provide automatic and transparent fault detection a...

Pierre Lemarinier, Aurelien Bouteiller, Thomas H&e...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers