Sciweavers

SIGMOD
2005
ACM

Fault-tolerance in the Borealis distributed stream processing system

14 years 4 months ago
Fault-tolerance in the Borealis distributed stream processing system
We present a replication-based approach to fault-tolerant distributed stream processing in the face of node failures, network failures, and network partitions. Our approach aims to reduce the degree of inconsistency in the system while guaranteeing that available inputs capable of being processed are processed within a specified time threshold. This threshold allows a user to trade availability for consistency: a larger time threshold decreases availability but limits inconsistency, while a smaller threshold increases availability but produces more inconsistent results based on partial data. In addition, when failures heal, our scheme corrects previously produced results, ensuring eventual consistency. Our scheme uses a data-serializing operator to ensure that all replicas process data in the same order, and thus remain consistent in the absence of failures. To regain consistency after a failure heals, we experimentally compare approaches based on checkpoint/redo and undo/redo techniq...
Magdalena Balazinska, Hari Balakrishnan, Samuel Ma
Added 08 Dec 2009
Updated 08 Dec 2009
Type Conference
Year 2005
Where SIGMOD
Authors Magdalena Balazinska, Hari Balakrishnan, Samuel Madden, Michael Stonebraker
Comments (0)