Sciweavers

695 search results - page 67 / 139
» Cache based fault recovery for distributed systems
Sort
View
CCGRID
2008
IEEE
14 years 11 months ago
Fault Tolerance in Cluster Federations with O2P-CF
Fault tolerance is one of the key issues for large scale applications executed on high performance computing systems. In a cluster federation, clusters are gathered to provide hug...
Thomas Ropars, Christine Morin
CLUSTER
2003
IEEE
15 years 3 months ago
Coordinated Checkpoint versus Message Log for Fault Tolerant MPI
— Large Clusters, high availability clusters and Grid deployments often suffer from network, node or operating system faults and thus require the use of fault tolerant programmin...
Aurelien Bouteiller, Pierre Lemarinier, Gér...
MM
2005
ACM
371views Multimedia» more  MM 2005»
15 years 3 months ago
Data grid for large-scale medical image archive and analysis
Storage and retrieval technology for large-scale medical image systems has matured significantly during the past ten years but many implementations still lack cost-effective backu...
H. K. Huang, Aifeng Zhang, Brent J. Liu, Zheng Zho...
DEXA
2000
Springer
84views Database» more  DEXA 2000»
15 years 2 months ago
Protocol for Taking Object-Based Checkpoints
Object-based checkpoints are consistent in the object-based system but may be inconsistent according to the traditional message-based definition. We present a protocol for taking ...
Katsuya Tanaka, Makoto Takizawa
SRDS
1998
IEEE
15 years 1 months ago
A Metaobject Protocol for Fault-Tolerant CORBA Applications
Abstract: The use of meta-level architectures for the implementation of faulttolerant systems is today very appealing. Nevertheless, all existing fault-tolerant systems based on th...
Marc-Olivier Killijian, Jean-Charles Fabre, Juan-C...