Search Sciweavers | Sciweavers

48 search results - page 9 / 10

» Self-stabilizing algorithm for checkpointing in a distribute...

click to vote

ICPP
1987
IEEE

114views Distributed And Parallel Com...» more ICPP 1987»

A Software-Based Hardware Fault Tolerance Scheme for Multicomputers

13 years 9 months ago

Download www.cs.ucla.edu

-- A hardware fault tolerance scheme for large multicomputers executing time-consuming non-interactive applications is described. Error detection and recovery are done mostly by so...

Yuval Tamir, Eli Gafni

claim paper

Read More »

click to vote

IPPS
2007
IEEE

133views Distributed And Parallel Com...» more IPPS 2007»

The Adaptive Code Kitchen: Flexible Tools for Dynamic Application Composition

14 years 3 days ago

Download people.cs.vt.edu

Driven by the increasing componentization of scientiﬁc codes, the deployment of high-end system infrastructures such as the Grid, and the desire to support high level problem so...

Pilsung Kang 0002, Mike Heffner, Joy Mukherjee, Na...

claim paper

Read More »

click to vote

ICPP
2007
IEEE

89views Distributed And Parallel Com...» more ICPP 2007»

Fault-Driven Re-Scheduling For Improving System-level Fault Resilience

14 years 4 days ago

Download www.cs.iit.edu

The productivity of HPC system is determined not only by their performance, but also by their reliability. The conventional method to limit the impact of failures is checkpointing...

Yawei Li, Prashasta Gujrati, Zhiling Lan, Xian-He ...

claim paper

Read More »

click to vote

HPDC
2009
IEEE

101views Distributed And Parallel Com...» more HPDC 2009»

Interconnect agnostic checkpoint/restart in open MPI

14 years 17 days ago

Download www.osl.iu.edu

Long running High Performance Computing (HPC) applications at scale must be able to tolerate inevitable faults if they are to harness current and future HPC systems. Message Passi...

Joshua Hursey, Timothy Mattox, Andrew Lumsdaine

claim paper

Read More »

click to vote

CCGRID
2007
IEEE

93views Distributed And Parallel Com...» more CCGRID 2007»

Reparallelization and Migration of OpenMP Programs

14 years 5 days ago

Download www2.informatik.uni-erlangen.de

Typical computational grid users target only a single cluster and have to estimate the runtime of their jobs. Job schedulers prefer short-running jobs to maintain a high system ut...

Michael Klemm, Matthias Bezold, Stefan Gabriel, Ro...

claim paper

Read More »

« Prev « First page 9 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers