Sciweavers

16 search results - page 1 / 4
» Highly-Available, Fault-Tolerant, Parallel Dataflows
Sort
View
SIGMOD
2004
ACM
151views Database» more  SIGMOD 2004»
16 years 6 months ago
Highly-Available, Fault-Tolerant, Parallel Dataflows
We present a technique that masks failures in a cluster to provide high availability and fault-tolerance for long-running, parallelized dataflows. We can use these dataflows to im...
Mehul A. Shah, Joseph M. Hellerstein, Eric A. Brew...
SRDS
1996
IEEE
15 years 11 months ago
Exploiting Data-Flow for Fault-Tolerance in a Wide-Area Parallel System
Wide-area parallel processing systems will soon be available to researchers to solve a range of problems. In these systems, it is certain that host failures and other faults will ...
Anh Nguyen-Tuong, Andrew S. Grimshaw, Mark Hyett
PSLS
1995
15 years 10 months ago
Fault Tolerance via Replication in Coarse Grain Data-Flow
Anh Nguyen-Tuong, Andrew S. Grimshaw, John F. Karp...
189
Voted
IPPS
2003
IEEE
16 years 23 hour ago
Recovery Schemes for High Availability and High Performance Distributed Real-Time Computing
Clusters and distributed systems offer fault tolerance and high performance through load sharing, and are thus attractive in real-time applications. When all computers are up and ...
Lars Lundberg, Daniel Häggander, Kamilla Klon...
188
Voted
DEXAW
2004
IEEE
132views Database» more  DEXAW 2004»
15 years 10 months ago
Using Data-Flow Analysis for Resilience and Result Checking in Peer-To-Peer Computations
To achieve correct execution of peer-to-peer applications on non-reliable resources, we present a portable and distributed algorithm that provides fault tolerance and result checki...
Samir Jafar, Sébastien Varrette, Jean-Louis...