Sciweavers

1150 search results - page 10 / 230
» Advanced MPI Programming
Sort
View
CLUSTER
2003
IEEE
15 years 3 months ago
Improving the Performance of MPI Derived Datatypes by Optimizing Memory-Access Cost
The MPI Standard supports derived datatypes, which allow users to describe noncontiguous memory layout and communicate noncontiguous data with a single communication function. Thi...
Surendra Byna, William D. Gropp, Xian-He Sun, Raje...
PVM
2004
Springer
15 years 3 months ago
Heterogeneous Parallel Computing Across Multidomain Clusters
We propose lightweight middleware solutions that facilitate and simplify the execution of MPI programs across multidomain clusters. The system described in this paper leverages H2O...
Peter Hwang, Dawid Kurzyniec, Vaidy S. Sunderam
IPPS
2006
IEEE
15 years 3 months ago
A decomposition approach for optimizing the performance of MPI libraries
MPI provides a portable message passing interface for many parallel execution platforms but may lead to inefficiencies for some platforms and applications. In this article we sho...
O. Hartmann, Matthias Kühnemann, Thomas Raube...
EUROPAR
2003
Springer
15 years 3 months ago
KOJAK - A Tool Set for Automatic Performance Analysis of Parallel Programs
Abstract. Today’s parallel computers with SMP nodes provide both multithreading and message passing as their modes of parallel execution. As a consequence, performance analysis a...
Bernd Mohr, Felix Wolf
SBACPAD
2005
IEEE
177views Hardware» more  SBACPAD 2005»
15 years 3 months ago
Automatic Data-Flow Graph Generation of MPI Programs
The Data-Flow Graph (DFG) of a parallel application is frequently used to take scheduling decisions, based on the information that it models (dependencies among the tasks and volu...
Rafael Ennes Silva, Guilherme P. Pezzi, Nicolas Ma...