Sciweavers

PVM
2005
Springer
13 years 10 months ago
A Peer-to-Peer Framework for Robust Execution of Message Passing Parallel Programs on Grids
Abstract. This paper presents P2P-MPI, a middleware aimed at computational grids. From the programmer point of view, P2P-MPI provides a message-passing programming model which enab...
Stéphane Genaud, Choopan Rattanapoka
PVM
2005
Springer
13 years 10 months ago
Collective Error Detection for MPI Collective Operations
Abstract. An MPI profiling library is a standard mechanism for intercepting MPI calls by applications. Profiling libraries are so named because they are commonly used to gather p...
Christopher Falzone, Anthony Chan, Ewing L. Lusk, ...
PVM
2005
Springer
13 years 10 months ago
Scalable Fault Tolerant MPI: Extending the Recovery Algorithm
ct Fault Tolerant MPI (FT-MPI)[6] was designed as a solution to allow applications different methods to handle process failures beyond simple check-point restart schemes. The init...
Graham E. Fagg, Thara Angskun, George Bosilca, Jel...
PVM
2005
Springer
13 years 10 months ago
Implementing OpenMP for Clusters on Top of MPI
llc is a language designed to extend OpenMP to distributed memory systems. Work in progress on the implementation of a compiler that translates llc code and targets distributed mem...
Antonio J. Dorta, José M. Badía, Enr...
PVM
2005
Springer
13 years 10 months ago
New User-Guided and ckpt-Based Checkpointing Libraries for Parallel MPI Applications
We present design and implementation details as well as performance results for two new parallel checkpointing libraries developed by us for parallel MPI applications. The first o...
Pawel Czarnul, Marcin Fraczak
PVM
2005
Springer
13 years 10 months ago
Analysis of the Component Architecture Overhead in Open MPI
Abstract. Component architectures provide a useful framework for developing an extensible and maintainable code base upon which largescale software projects can be built. Component...
Brian Barrett, Jeffrey M. Squyres, Andrew Lumsdain...
PVM
2005
Springer
13 years 10 months ago
Designing a Common Communication Subsystem
Communication subsystems are used in high-performance parallel computing systems to abstract the lower network layer. By using a communication subsystem, an upper middleware librar...
Darius Buntinas, William Gropp
PVM
2005
Springer
13 years 10 months ago
Benchmarking One-Sided Communication with SKaMPI 5
Werner Augustin, Marc-Oliver Straub, Thomas Worsch
PVM
2005
Springer
13 years 10 months ago
A Comparison of Three MPI Implementations for Red Storm
Abstract. Cray Red Storm is a new distributed memory massively parallel computing platform designed to scale to tens of thousands of nodes. Red Storm has a custom network designed ...
Ron Brightwell