Sciweavers

231 search results - page 26 / 47
» Asynchronous failure detectors
Sort
View
99
Voted
ICDCS
2005
IEEE
15 years 10 months ago
The Impossibility of Boosting Distributed Service Resilience
We prove two theorems saying that no distributed system in which processes coordinate using reliable registers and -resilient services can solve the consensus problem in the prese...
Paul C. Attie, Rachid Guerraoui, Petr Kouznetsov, ...
IPPS
2009
IEEE
15 years 11 months ago
Compiler-enhanced incremental checkpointing for OpenMP applications
As modern supercomputing systems reach the peta-flop performance range, they grow in both size and complexity. This makes them increasingly vulnerable to failures from a variety ...
Greg Bronevetsky, Daniel Marques, Keshav Pingali, ...
138
Voted
GCC
2007
Springer
15 years 10 months ago
Spaces: Support for Decoupled Communication in Wide-Area Parallel Applications
Wide-area distributed systems like computational grids are emergent infrastructures for high-performance parallel applications. On these systems, communication mechanisms have to ...
Philip Chan, David Abramson
LCPC
2007
Springer
15 years 10 months ago
Compiler-Enhanced Incremental Checkpointing
As modern supercomputing systems reach the peta-flop performance range, they grow in both size and complexity. This makes them increasingly vulnerable to failures from a variety o...
Greg Bronevetsky, Daniel Marques, Keshav Pingali, ...
PDP
2006
IEEE
15 years 10 months ago
A B2B Distributed Replication Service
A deadlock free distributed replication service for B2B CORBA based applications is presented. This service provides persistent storage for commercial transactions performed by B2...
José Javier Astrain, Alberto Córdoba...