Sciweavers

PVM
2015
Springer
7 years 11 months ago
Plan B: Interruption of Ongoing MPI Operations to Support Failure Recovery
Advanced failure recovery strategies in HPC system benefit tremendously from in-place failure recovery, in which the MPI infrastructure can survive process crashes and resume com...
PVM
2015
Springer
7 years 11 months ago
A Memory Management System Optimized for BDMPI's Memory and Execution Model
There is a growing need to perform large computations on small systems, as access to large systems is not widely available and cannot keep up with the scaling of data. BDMPI was r...
PVM
2015
Springer
7 years 11 months ago
On the Impact of Synchronizing Clocks and Processes on Benchmarking MPI Collectives
We consider the problem of accurately measuring the time to complete an MPI collective operation, as the result strongly depends on how the time is measured. Our goal is to develo...
PVM
2015
Springer
7 years 11 months ago
Toward Operating System Support for Scalable Multithreaded Message Passing
Modern CPU architectures provide a large number of processing cores and application programmers are increasingly looking at hybrid programming models, where multiple threads of a ...
PVM
2015
Springer
7 years 11 months ago
Isomorphic, Sparse MPI-like Collective Communication Operations for Parallel Stencil Computations
We propose a specification and discuss implementations of collective operations for parallel stencil-like computations that are not supported well by the current MPI 3.1 neighbor...