Sciweavers

194 search results - page 11 / 39
» A Problem-Specific Fault-Tolerance Mechanism for Asynchronou...
Sort
View
94
Voted
ICTAC
2005
Springer
15 years 5 months ago
Revisiting Failure Detection and Consensus in Omission Failure Environments
It has recently been shown that fair exchange, a security problem in distributed systems, can be reduced to a fault tolerance problem, namely a special form of distributed consensu...
Carole Delporte-Gallet, Hugues Fauconnier, Felix C...
CCGRID
2008
IEEE
15 years 6 months ago
A Technique for Lock-Less Mirroring in Parallel File Systems
—As parallel file systems span larger and larger numbers of nodes in order to provide the performance and scalability necessary for modern cluster applications, the need for fau...
Bradley W. Settlemyer, Walter B. Ligon III
102
Voted
CCGRID
2006
IEEE
15 years 5 months ago
MPI-Mitten: Enabling Migration Technology in MPI
Group communications are commonly used in parallel and distributed environment. However, existing migration mechanisms do not support group communications. This weakness prevents ...
Cong Du, Xian-He Sun
106
Voted
SIGSOFT
2008
ACM
16 years 11 days ago
Experimenting with exception propagation mechanisms in service-oriented architecture
Exception handling is one of the popular means used for improving dependability and supporting recovery in the ServiceOriented Architecture (SOA). This practical experience paper ...
Anatoliy Gorbenko, Alexander Romanovsky, Vyachesla...
ISPA
2007
Springer
15 years 5 months ago
A Resource Discovery and Allocation Mechanism in Large Computational Grids for Media Applications
There has been significant effort to build high throughput computing systems out of many distributed multimedia servers. These systems should accommodate a larger number of servers...
Chun-Fu Lin, Ruay-Shiung Chang