Sciweavers

NSDI
2007

WiDS Checker: Combating Bugs in Distributed Systems

13 years 7 months ago
WiDS Checker: Combating Bugs in Distributed Systems
Despite many efforts, the predominant practice of debugging a distributed system is still printf-based log mining, which is both tedious and error-prone. In this paper, we present WiDS Checker, a unified framework that can check distributed systems through both simulation and reproduced runs from real deployment. All instances of a distributed system can be executed within one simulation process, multiplexed properly to observe the “happensbefore” relationship, thus accurately reveal full system state. A versatile script language allows a developer to refine system properties into straightforward assertions, which the checker inspects for violations. Combining these two components, we are able to check distributed properties that are otherwise impossible to check. We applied WiDS Checker over a suite of complex and real systems and found non-trivial bugs, including one in a previously proven Paxos specification. Our experience demonstrates the usefulness of the checker and allo...
Xuezheng Liu, Wei Lin, Aimin Pan, Zheng Zhang
Added 02 Oct 2010
Updated 02 Oct 2010
Type Conference
Year 2007
Where NSDI
Authors Xuezheng Liu, Wei Lin, Aimin Pan, Zheng Zhang
Comments (0)