Sciweavers

16 search results - page 1 / 4
» Highly-Available, Fault-Tolerant, Parallel Dataflows
Sort
View
SIGMOD
2004
ACM
151views Database» more  SIGMOD 2004»
14 years 4 months ago
Highly-Available, Fault-Tolerant, Parallel Dataflows
We present a technique that masks failures in a cluster to provide high availability and fault-tolerance for long-running, parallelized dataflows. We can use these dataflows to im...
Mehul A. Shah, Joseph M. Hellerstein, Eric A. Brew...
SRDS
1996
IEEE
13 years 8 months ago
Exploiting Data-Flow for Fault-Tolerance in a Wide-Area Parallel System
Wide-area parallel processing systems will soon be available to researchers to solve a range of problems. In these systems, it is certain that host failures and other faults will ...
Anh Nguyen-Tuong, Andrew S. Grimshaw, Mark Hyett
PSLS
1995
13 years 8 months ago
Fault Tolerance via Replication in Coarse Grain Data-Flow
Anh Nguyen-Tuong, Andrew S. Grimshaw, John F. Karp...
IPPS
2003
IEEE
13 years 9 months ago
Recovery Schemes for High Availability and High Performance Distributed Real-Time Computing
Clusters and distributed systems offer fault tolerance and high performance through load sharing, and are thus attractive in real-time applications. When all computers are up and ...
Lars Lundberg, Daniel Häggander, Kamilla Klon...
DEXAW
2004
IEEE
132views Database» more  DEXAW 2004»
13 years 8 months ago
Using Data-Flow Analysis for Resilience and Result Checking in Peer-To-Peer Computations
To achieve correct execution of peer-to-peer applications on non-reliable resources, we present a portable and distributed algorithm that provides fault tolerance and result checki...
Samir Jafar, Sébastien Varrette, Jean-Louis...