Sciweavers

USENIX
2008

Improving Scalability and Fault Tolerance in an Application Management Infrastructure

13 years 6 months ago
Improving Scalability and Fault Tolerance in an Application Management Infrastructure
This paper explores the challenges associated with distributed application management in large-scale computing environments. In particular, we investigate several techniques for extending Plush, an existing distributed application management framework, to provide improved scalability and fault tolerance without sacrificing performance. One of the main limitations of Plush is the structure of the underlying communication fabric. We explain how we incorporated the use of an overlay tree provided by Mace, a toolkit that simplifies the implementation of overlay networks, in place of the existing communication subsystem in Plush to improve robustness and scalability.
Nikolay Topilski, Jeannie R. Albrecht, Amin Vahdat
Added 02 Oct 2010
Updated 02 Oct 2010
Type Conference
Year 2008
Where USENIX
Authors Nikolay Topilski, Jeannie R. Albrecht, Amin Vahdat
Comments (0)