Sciweavers

489 search results - page 1 / 98
» An Extensible Framework for Distributed Testing of MPI Imple...
Sort
View
PVM
2007
Springer
13 years 10 months ago
An Extensible Framework for Distributed Testing of MPI Implementations
Complex code bases require continual testing to ensure that both new development and routine maintenance do not create unintended side effects. Automation of regression testing is...
Joshua Hursey, Ethan Mallove, Jeffrey M. Squyres, ...
ISSTA
2006
ACM
13 years 10 months ago
Improving distributed memory applications testing by message perturbation
We present initial work on perturbation techniques that cause the manifestation of timing-related bugs in distributed memory Message Passing Interface (MPI)-based applications. Th...
Richard W. Vuduc, Martin Schulz, Daniel J. Quinlan...
PVM
2007
Springer
13 years 10 months ago
Extending the MPI-2 Generalized Request Interface
Abstract. The MPI-2 standard added a new feature to MPI called generalized requests. Generalized requests allow users to add new nonblocking operations to MPI while still using man...
Robert Latham, William Gropp, Robert B. Ross, Raje...
IPPS
2007
IEEE
13 years 11 months ago
The Design and Implementation of Checkpoint/Restart Process Fault Tolerance for Open MPI
To be able to fully exploit ever larger computing platforms, modern HPC applications and system software must be able to tolerate inevitable faults. Historically, MPI implementati...
Joshua Hursey, Jeffrey M. Squyres, Timothy Mattox,...
CCGRID
2006
IEEE
13 years 10 months ago
Proposal of MPI Operation Level Checkpoint/Rollback and One Implementation
With the increasing number of processors in modern HPC(High Performance Computing) systems, there are two emergent problems to solve. One is scalability, the other is fault tolera...
Yuan Tang, Graham E. Fagg, Jack Dongarra