Sciweavers

CC
2009
Springer

Live Debugging of Distributed Systems

14 years 5 months ago
Live Debugging of Distributed Systems
Debugging distributed systems is challenging. Although incremental debugging during development finds some bugs, developers are rarely able to fully test their systems under realistic operating conditions prior to deployment. While deploying a system exposes it to realistic conditions, debugging requires the developer to: (i) detect a bug, (ii) gather the system state necessary for diagnosis, and (iii) sift through the gathered state to determine a root cause. In this paper, we present MaceODB, a tool to assist programmers with debugging deployed distributed systems. Programmers define a set of runtime properties for their system, which MaceODB checks for violations during execution. Once MaceODB detects a violation, it provides the programmer with the information to determine its root cause. We have been able to diagnose several non-trivial bugs in existing mature distributed systems using MaceODB; we discuss two of these bugs in this paper. Benchmarks indicate that the approach has l...
Darren Dao, Jeannie R. Albrecht, Charles Edwin Kil
Added 22 Nov 2009
Updated 22 Nov 2009
Type Conference
Year 2009
Where CC
Authors Darren Dao, Jeannie R. Albrecht, Charles Edwin Killian, Amin Vahdat
Comments (0)