Sciweavers

441 search results - page 14 / 89
» Generic Timing Fault Tolerance using a Timely Computing Base
Sort
View
AINA
2004
IEEE
15 years 3 months ago
Region-based Stage Construction Protocol for Fault tolerant Execution of Mobile Agent
Fault tolerance is essential to the development of reliable mobile agent system in order to guarantee continuous execution of mobile agents. For this purpose, some previous works ...
SungJin Choi, MaengSoon Baik, HongSoo Kim, JunWeon...
88
Voted
CLUSTER
2004
IEEE
15 years 3 months ago
FTC-Charm++: an in-memory checkpoint-based fault tolerant runtime for Charm++ and MPI
As high performance clusters continue to grow in size, the mean time between failure shrinks. Thus, the issues of fault tolerance and reliability are becoming one of the challengi...
Gengbin Zheng, Lixia Shi, Laxmikant V. Kalé
76
Voted
ESCIENCE
2007
IEEE
15 years 6 months ago
Intelligent Selection of Fault Tolerance Techniques on the Grid
The emergence of computational grids has lead to an increased reliance on task schedulers that can guarantee the completion of tasks that are executed on unreliable systems. There...
Daniel C. Vanderster, Nikitas J. Dimopoulos, Randa...
HPCC
2010
Springer
14 years 12 months ago
A Generic Execution Management Framework for Scientific Applications
Managing the execution of scientific applications in a heterogeneous grid computing environment can be a daunting task, particularly for long running jobs. Increasing fault tolera...
Tanvire Elahi, Cameron Kiddle, Rob Simmonds
ICS
2011
Tsinghua U.
14 years 3 months ago
High performance linpack benchmark: a fault tolerant implementation without checkpointing
The probability that a failure will occur before the end of the computation increases as the number of processors used in a high performance computing application increases. For l...
Teresa Davies, Christer Karlsson, Hui Liu, Chong D...