Sciweavers

PDCN
2007

Averages, distributions and scalability of MPI communication times for ethernet and myrinet networks

13 years 6 months ago
Averages, distributions and scalability of MPI communication times for ethernet and myrinet networks
Most modern parallel computers are clusters using Myrinet or Ethernet communication networks. Several studies have been published comparing the performance of these two networks for parallel computing, however these focus on average performance, and do not address the distributions of communication times, which can have long tails due to contention effects. In the case of Ethernet with TCP, retransmit timeouts (RTOs) can also occur. Slow communication events may have significant impact, particularly for applications requiring frequent synchronization, where the performance is determined by the slowest process. We have analysed the distributions of communication times for standard MPI routines on Ethernet with TCP and Myrinet with GM communications networks on the same cluster, and studied the scalability of the distributions as the number of communicating processes is increased, and the effect of RTOs for Ethernet with TCP. KEY WORDS MPI benchmarks, parallel computer, network performa...
Nor Asilah Wati Abdul Hamid, Paul D. Coddington
Added 30 Oct 2010
Updated 30 Oct 2010
Type Conference
Year 2007
Where PDCN
Authors Nor Asilah Wati Abdul Hamid, Paul D. Coddington
Comments (0)