Search Sciweavers | Sciweavers

1113 search results - page 2 / 223

» Performance under Failures of DAG-based Parallel Computing

click to vote

IPPS
2009
IEEE

220views Distributed And Parallel Com...» more IPPS 2009»

Robust sequential resource allocation in heterogeneous distributed systems with random compute node failures

13 years 11 months ago

Download www.engr.colostate.edu

—The problem of ﬁnding efﬁcient workload distribution techniques is becoming increasingly important today for heterogeneous distributed systems where the availability of comp...

Vladimir Shestak, Edwin K. P. Chong, Anthony A. Ma...

claim paper

Read More »

click to vote

JSSPP
2004
Springer

143views Distributed And Parallel Com...» more JSSPP 2004»

Performance Implications of Failures in Large-Scale Cluster Scheduling

13 years 10 months ago

Download www.ece.rutgers.edu

As we continue to evolve into large-scale parallel systems, many of them employing hundreds of computing engines to take on mission-critical roles, it is crucial to design those s...

Yanyong Zhang, Mark S. Squillante, Anand Sivasubra...

claim paper

Read More »

click to vote

IPPS
2006
IEEE

216views Distributed And Parallel Com...» more IPPS 2006»

Algorithm-based checkpoint-free fault tolerance for parallel matrix computations on volatile resources

13 years 11 months ago

Download icl.cs.utk.edu

As the desire of scientists to perform ever larger computations drives the size of today’s high performance computers from hundreds, to thousands, and even tens of thousands of ...

Zizhong Chen, Jack Dongarra

claim paper

Read More »

click to vote

PPOPP
2005
ACM

135views Distributed And Parallel Com...» more PPOPP 2005»

Fault tolerant high performance computing by a coding approach

13 years 10 months ago

Download www.cs.utk.edu

As the number of processors in today’s high performance computers continues to grow, the mean-time-to-failure of these computers are becoming signiﬁcantly shorter than the exe...

Zizhong Chen, Graham E. Fagg, Edgar Gabriel, Julie...

claim paper

Read More »

click to vote

MOBIHOC
2010
ACM

199views Computer Networks» more MOBIHOC 2010»

Data preservation under spatial failures in sensor networks

13 years 2 months ago

Download www.cs.sunysb.edu

In this paper, we address the problem of preserving generated data in a sensor network in case of node failures. We focus on the type of node failures that have explicit spatial s...

Navid Hamed Azimi, Himanshu Gupta, Xiaoxiao Hou, J...

claim paper

Read More »

« Prev « First page 2 / 223 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers