Sciweavers

EUROSYS
2009
ACM

Transparent checkpoints of closed distributed systems in Emulab

14 years 1 months ago
Transparent checkpoints of closed distributed systems in Emulab
Emulab is a testbed for networked and distributed systems experimentation. Two guiding principles of its design are realism and control of experimentation. There is an inherent tension between these goals, however, and in some aspects of the testbed’s design, Emulab’s implementers favored realism over control. Thus, Emulab provides wide-ranging control over an experiment’s environment and initial conditions, but relatively little control over its execution—in particular, the ability to suspend, preempt, or replay the experiment. We have extended Emulab with a new means of control over experiment execution: the ability to cleanly checkpoint the execution of the set of nodes and networks that comprise an experiment. Conventional checkpoint mechanisms can easily degrade the fidelity of experiment results as a consequence of checkpoint downtimes, overheads of background state saving, and unintended distributed checkpoint synchronization effects. In this paper we demonstrate a che...
Anton Burtsev, Prashanth Radhakrishnan, Mike Hible
Added 10 Mar 2010
Updated 10 Mar 2010
Type Conference
Year 2009
Where EUROSYS
Authors Anton Burtsev, Prashanth Radhakrishnan, Mike Hibler, Jay Lepreau
Comments (0)