Sciweavers

1166 search results - page 9 / 234
» Crash Management for Distributed Parallel Systems
Sort
View
SAC
2009
ACM
15 years 6 months ago
Response time analysis of software transactional memory-based distributed real-time systems
We consider distributed real-time systems where concurrency control is managed using software transactional memory. For such a method we propose an algorithm to compute an upper b...
Sherif Fadel Fahmy, Binoy Ravindran, E. Douglas Je...
86
Voted
PODC
2010
ACM
15 years 3 months ago
Brief announcement: a shared disk on distributed storage
A shared disk implementation on distributed storage requires consistent behavior of disk operations. Deterministic consensus on such behavior is impossible when even a single stor...
Stefan Vijzelaar, Herbert Bos, Wan Fokkink
90
Voted
IPPS
2007
IEEE
15 years 6 months ago
A Fault Tolerance Protocol with Fast Fault Recovery
Fault tolerance is an important issue for large machines with tens or hundreds of thousands of processors. Checkpoint-based methods, currently used on most machines, rollback all ...
Sayantan Chakravorty, Laxmikant V. Kalé