Sciweavers

EUROSYS
2006
ACM

Using queries for distributed monitoring and forensics

14 years 1 months ago
Using queries for distributed monitoring and forensics
Distributed systems are hard to build, profile, debug, and test. Monitoring a distributed system – to detect and analyze bugs, test for regressions, identify fault-tolerance problems or security compromises – can be difficult and error-prone. In this paper we argue that declarative development of distributed systems is well suited to tackle these tasks. We present an application logging, monitoring, and debugging facility that we have built on top of the P2 system, comprising an introspection model, an execution tracing component, and a distributed query processor. We use this facility to demonstrate a range of on-line distributed diagnosis tools that range from simple, local state assertions to sophisticated global property detectors on consistent snapshots. These tools are small, simple, and can be deployed piecemeal on-line at any point during a system’s life cycle. Our evaluation suggests that the overhead of our approach to improving and monitoring running distributed sys...
Atul Singh, Petros Maniatis, Timothy Roscoe, Peter
Added 10 Mar 2010
Updated 10 Mar 2010
Type Conference
Year 2006
Where EUROSYS
Authors Atul Singh, Petros Maniatis, Timothy Roscoe, Peter Druschel
Comments (0)