Making cloud intermediate data fault-tolerant

15 years 10 months ago

Download kepler.cs.uiuc.edu

Parallel dataﬂow programs generate enormous amounts of distributed data that are short-lived, yet are critical for completion of the job and for good run-time performance. We call this class of data as intermediate data. This paper is the ﬁrst to address intermediate data as a ﬁrst-class citizen, speciﬁcally targeting and minimizing the eﬀect of run-time server failures on the availability of intermediate data, and thus on performance metrics such as job completion time. We propose new design techniques for a new storage system called ISS (Intermediate Storage System), implement these techniques within Hadoop, and experimentally evaluate the resulting system. Under no failure, the performance of Hadoop augmented with ISS (i.e., job completion time) turns out to be comparable to base Hadoop. Under a failure, Hadoop with ISS outperforms base Hadoop and incurs up to 18% overhead compared to base no-failure Hadoop, depending on the testbed setup. Categories and Subject Descripto...

Steven Y. Ko, Imranul Hoque, Brian Cho, Indranil G

Real-time Traffic

CLOUD 2010 | Distributed And Parallel Computing | Hadoop | Intermediate Data | Job Completion Time |

claim paper

» RACS a case for cloud storage diversity

» NephelePACTs a programming model and execution framework for webscale analytical processin...

» Meshless geometric subdivision

» Correlationmaximizing surrogate gene space for visual mining of gene expression patterns i...

Post Info
More Details (n/a)

Added	10 Jul 2010
Updated	10 Jul 2010
Type	Conference
Year	2010
Where	CLOUD
Authors	Steven Y. Ko, Imranul Hoque, Brian Cho, Indranil Gupta

Comments (0)

Sciweavers

Making cloud intermediate data fault-tolerant

CLOUD 2010 | Distributed And Parallel Computing | Hadoop | Intermediate Data | Job Completion Time |

Explore & Download

Productivity Tools

Sciweavers