Sciweavers

SC
2004
ACM

Fastpath Optimizations for Cluster Recovery in Shared-Disk Systems

13 years 10 months ago
Fastpath Optimizations for Cluster Recovery in Shared-Disk Systems
We describe the design and implementation of a clustering service for a high-performance, shared-disk file system. The service provides failure detection and recovery, reliable endto-end messaging, and a centralized and recoverable management interface. We implement novel optimizations in the voting protocol that resolves cluster membership. Optimizations allow clusters to form as quickly as possible without introducing livelock or requiring timeout parameters to be tuned carefully. Our treatment includes performance results that quantify the scalability of the system and measure recovery times.
Randal C. Burns
Added 30 Jun 2010
Updated 30 Jun 2010
Type Conference
Year 2004
Where SC
Authors Randal C. Burns
Comments (0)