Sciweavers

338 search results - page 27 / 68
» Automated Performance Prediction of Message-Passing Parallel...
Sort
View
CLUSTER
2009
IEEE
15 years 8 months ago
Reducing network contention with mixed workloads on modern multicore, clusters
Abstract—Multi-core systems are now extremely common in modern clusters. In the past commodity systems may have had up to two or four CPUs per compute node. In modern clusters, t...
Matthew J. Koop, Miao Luo, Dhabaleswar K. Panda
EUROPAR
2006
Springer
15 years 5 months ago
Model-Based Performance Diagnosis of Master-Worker Parallel Computations
Parallel performance tuning naturally involves a diagnosis process to locate and explain sources of program inefficiency. Proposed is an approach that exploits parallel computation...
Li Li, Allen D. Malony
ICDCS
1998
IEEE
15 years 5 months ago
LBF: A Performance Metric for Program Reorganization
We introduce a new performance metric, called Load Balancing Factor (LBF), to assist programmers with evaluating different tuning alternatives. The LBF metric differs from traditi...
Hyeonsang Eom, Jeffrey K. Hollingsworth
EUROPAR
2004
Springer
15 years 6 months ago
Implementing MPI on the BlueGene/L Supercomputer
The BlueGene/L supercomputer will consist of 65,536 dual-processor compute nodes interconnected by two high-speed networks: a three-dimensional torus network and a tree topology ne...
George Almási, Charles Archer, José ...
EUROPAR
1997
Springer
15 years 4 months ago
The Performance Potential of Value and Dependence Prediction
Abstract. The serialization constraints induced by the detection and enforcement of true data dependences have always been regarded as requirements for correct execution. We propos...
Mikko H. Lipasti, John Paul Shen