Search Sciweavers | Sciweavers

29 search results - page 2 / 6

» Using Golomb Rulers for Optimal Recovery Schemes in Fault To...

click to vote

ICS
2011
Tsinghua U.

278views Distributed And Parallel Com...» more ICS 2011»

High performance linpack benchmark: a fault tolerant implementation without checkpointing

12 years 9 months ago

Download inside.mines.edu

The probability that a failure will occur before the end of the computation increases as the number of processors used in a high performance computing application increases. For l...

Teresa Davies, Christer Karlsson, Hui Liu, Chong D...

claim paper

Read More »

click to vote

IPPS
1998
IEEE

104views Distributed And Parallel Com...» more IPPS 1998»

A Generalized Forward Recovery Checkpointing Scheme

13 years 10 months ago

Download ipdps.cc.gatech.edu

We propose a generalized forward recovery checkpointing scheme, with lookahead execution and rollback validation. This method takes advantage of voting and comparison on multiple v...

Ke Huang, Jie Wu, Eduardo B. Fernández

claim paper

Read More »

click to vote

IPPS
2007
IEEE

161views Distributed And Parallel Com...» more IPPS 2007»

Fault-Tolerant Earliest-Deadline-First Scheduling Algorithm

14 years 16 days ago

Download www.cecs.uci.edu

The general approach to fault tolerance in uniprocessor systems is to maintain enough time redundancy in the schedule so that any task instance can be re-executed in presence of f...

Hakem Beitollahi, Seyed Ghassem Miremadi, Geert De...

claim paper

Read More »

click to vote

IPPS
2003
IEEE

125views Distributed And Parallel Com...» more IPPS 2003»

Recovery Schemes for High Availability and High Performance Distributed Real-Time Computing

13 years 11 months ago

Download www.ipd.bth.se

Clusters and distributed systems offer fault tolerance and high performance through load sharing, and are thus attractive in real-time applications. When all computers are up and ...

Lars Lundberg, Daniel Häggander, Kamilla Klon...

claim paper

Read More »

click to vote

ICPP
1987
IEEE

114views Distributed And Parallel Com...» more ICPP 1987»

A Software-Based Hardware Fault Tolerance Scheme for Multicomputers

13 years 9 months ago

Download www.cs.ucla.edu

-- A hardware fault tolerance scheme for large multicomputers executing time-consuming non-interactive applications is described. Error detection and recovery are done mostly by so...

Yuval Tamir, Eli Gafni

claim paper

Read More »

« Prev « First page 2 / 6 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers