Sciweavers

901 search results - page 102 / 181
» Hiding Communication Latency in Data Parallel Applications
Sort
View
121
Voted
CLUSTER
2009
IEEE
15 years 7 months ago
Message passing for GPGPU clusters: CudaMPI
—We present and analyze two new communication libraries, cudaMPI and glMPI, that provide an MPI-like message passing interface to communicate data stored on the graphics cards of...
Orion S. Lawlor
97
Voted
HPCA
2000
IEEE
15 years 5 months ago
A Prefetching Technique for Irregular Accesses to Linked Data Structures
Prefetching offers the potential to improve the performance of linked data structure (LDS) traversals. However, previously proposed prefetching methods only work well when there i...
Magnus Karlsson, Fredrik Dahlgren, Per Stenstr&oum...
99
Voted
FGCN
2007
IEEE
109views Communications» more  FGCN 2007»
15 years 7 months ago
Flow Balancing Hardware for Parallel TCP Streams on Long Fat Pipe Network
Parallel TCP streams are used for data transfer between clusters in today's high performance applications. When parallel TCP streams are used on LFN, part of streams fail to ...
Yutaka Sugawara, Mary Inaba, Kei Hiraki
120
Voted
PACT
2009
Springer
15 years 7 months ago
Parallel FFT with Eden Skeletons
The notion of Fast Fourier Transformation (FFT) describes a range of efficient algorithms to compute the discrete Fourier transformation, frequency distribution in a signal. FFT pl...
Jost Berthold, Mischa Dieterle, Oleg Lobachev, Rit...
100
Voted
LCN
2006
IEEE
15 years 6 months ago
Considerations of SCTP Retransmission Delays for Thin Streams
The popularity of distributed interactive applications has exploded in the last few years. For example, massive multi-player online games have become a fast growing, multi-million...
Jon Pedersen, Carsten Griwodz, Pål Halvorsen