Sciweavers

21 search results - page 1 / 5
» Job-Site Level Fault Tolerance for Cluster and Grid environm...
Sort
View
50
Voted
CLUSTER
2005
IEEE
15 years 3 months ago
Job-Site Level Fault Tolerance for Cluster and Grid environments
Kshitij Limaye, Box Leangsuksun, Zeno Greenwood, S...
65
Voted
IJHPCA
2006
114views more  IJHPCA 2006»
14 years 10 months ago
Fault-Tolerant Scheduling of Fine-Grained Tasks in Grid Environments
Divide-and-conquer is a well-suited programming paradigm for parallel Grid applications. Our Satin system efficiently schedules the finegrained tasks of a divide-and-conquer appli...
Gosia Wrzesinska, Rob van Nieuwpoort, Jason Maasse...
71
Voted
ICDCN
2009
Springer
15 years 5 months ago
FTRepMI: Fault-Tolerant, Sequentially-Consistent Object Replication for Grid Applications
We introduce FTRepMI, a simple fault-tolerant protocol for providing sequential consistency amongst replicated objects in a grid, without using any centralized components. FTRepMI ...
Ana-Maria Oprescu, Thilo Kielmann, Wan Fokkink
FGCS
2002
153views more  FGCS 2002»
14 years 10 months ago
HARNESS fault tolerant MPI design, usage and performance issues
Initial versions of MPI were designed to work efficiently on multi-processors which had very little job control and thus static process models. Subsequently forcing them to suppor...
Graham E. Fagg, Jack Dongarra