Sciweavers

392 search results - page 1 / 79
» Fault Tolerance in a DSM Cluster Operating System
Sort
View
ARCS
2004
Springer
13 years 8 months ago
Fault Tolerance in a DSM Cluster Operating System
Michael Schöttner, Stefan Frenz, Ralph Gö...
SC
2000
ACM
13 years 8 months ago
Scalable Fault-Tolerant Distributed Shared Memory
This paper shows how a state-of-the-art software distributed shared-memory (DSM) protocol can be efficiently extended to tolerate single-node failures. In particular, we extend a ...
Florin Sultan, Thu D. Nguyen, Liviu Iftode
FPGA
2000
ACM
141views FPGA» more  FPGA 2000»
13 years 8 months ago
Tolerating operational faults in cluster-based FPGAs
In recent years the application space of reconfigurable devices has grown to include many platforms with a strong need for fault tolerance. While these systems frequently contain ...
Vijay Lakamraju, Russell Tessier
PVM
2007
Springer
13 years 10 months ago
Using CMT in SCTP-Based MPI to Exploit Multiple Interfaces in Cluster Nodes
Many existing clusters use inexpensive Gigabit Ethernet and often have multiple interfaces cards to improve bandwidth and enhance fault tolerance. We investigate the use of Concurr...
Brad Penoff, Mike Tsai, Janardhan R. Iyengar, Alan...