Sciweavers

25 search results - page 3 / 5
» Implementing OpenMP for Clusters on Top of MPI
Sort
View
PPOPP
2012
ACM
12 years 27 days ago
PARRAY: a unifying array representation for heterogeneous parallelism
This paper introduces a programming interface called PARRAY (or Parallelizing ARRAYs) that supports system-level succinct programming for heterogeneous parallel systems like GPU c...
Yifeng Chen, Xiang Cui, Hong Mei
ISPA
2005
Springer
13 years 10 months ago
Reducing the Overhead of Intra-Node Communication in Clusters of SMPs
Abstract. This article presents the C++ library vShark which reduces the intranode communication overhead of parallel programs on clusters of SMPs. The library is built on top of m...
Sascha Hunold, Thomas Rauber
WOTUG
2008
13 years 6 months ago
Process-Oriented Collective Operations
Abstract. Distributing process-oriented programs across a cluster of machines requires careful attention to the effects of network latency. The MPI standard, widely used for cluste...
John Markus Bjørndalen, Adam T. Sampson
CCGRID
2005
IEEE
13 years 11 months ago
Implementing MPI-IO atomic mode without file system support
The ROMIO implementation of the MPI-IO standard provides a portable infrastructure for use on top of any number of different underlying storage targets. These different targets va...
Robert B. Ross, Robert Latham, William Gropp, Raje...
CLUSTER
2009
IEEE
13 years 10 months ago
Finding a tradeoff between host interrupt load and MPI latency over Ethernet
—Achieving high-performance message passing on top of generic ETHERNET hardware suffers from the NIC interruptdriven model where coalescing is usually involved. We present an in-...
Brice Goglin, Nathalie Furmento