Sciweavers

1186 search results - page 34 / 238
» The Communication in Intelligent Distributed Fault Tolerant ...
Sort
View
ICA3PP
2010
Springer
14 years 10 months ago
Checkpointing and Migration of Communication Channels in Heterogeneous Grid Environments
Abstract. A grid checkpointing service providing migration and transparent fault tolerance is important for distributed and parallel applications executed in heterogeneous grids. I...
John Mehnert-Spahn, Michael Schoettner
IPPS
2007
IEEE
15 years 4 months ago
A Fault Tolerance Protocol with Fast Fault Recovery
Fault tolerance is an important issue for large machines with tens or hundreds of thousands of processors. Checkpoint-based methods, currently used on most machines, rollback all ...
Sayantan Chakravorty, Laxmikant V. Kalé
SRDS
2000
IEEE
15 years 2 months ago
Issues Insufficiently Resolved in Century 20 in the Fault-Tolerant Distributed Computing Field
: As Century 21 just opened up, it is a fitting time to reflect on the evolution of the fault-tolerant distributed computing technology that occurred in the last century. The autho...
K. H. Kim
JPDC
2008
132views more  JPDC 2008»
14 years 10 months ago
Assurance of dynamic adaptation in distributed systems
Long running applications often need to adapt due to changing requirements or changing environment. Typically, such adaptation is performed by dynamically adding or removing compo...
Karun N. Biyani, Sandeep S. Kulkarni
SOSP
2005
ACM
15 years 7 months ago
Fault-scalable Byzantine fault-tolerant services
A fault-scalable service can be configured to tolerate increasing numbers of faults without significant decreases in performance. The Query/Update (Q/U) protocol is a new tool t...
Michael Abd-El-Malek, Gregory R. Ganger, Garth R. ...