Sciweavers

29 search results - page 5 / 6
» Parallel FFT Algorithms on Network-on-Chips
Sort
View
CCGRID
2011
IEEE
12 years 9 months ago
Small Discrete Fourier Transforms on GPUs
– Efficient implementations of the Discrete Fourier Transform (DFT) for GPUs provide good performance with large data sizes, but are not competitive with CPU code for small data ...
S. Mitra, A. Srinivasan
PVM
2010
Springer
13 years 4 months ago
Toward Performance Models of MPI Implementations for Understanding Application Scaling Issues
Abstract. Designing and tuning parallel applications with MPI, particularly at large scale, requires understanding the performance implications of different choices of algorithms ...
Torsten Hoefler, William Gropp, Rajeev Thakur, Jes...
PPSC
1997
13 years 7 months ago
The Future Fast Fourier Transform?
It seems likely that improvements in arithmetic speed will continue to outpace advances in communication bandwidth. Furthermore, as more and more problems are working on huge datas...
Alan Edelman, Peter McCorquodale, Sivan Toledo
IPPS
1997
IEEE
13 years 10 months ago
Performance Analysis and Optimization on a Parallel Atmospheric General Circulation Model Code
An analysis is presented of the primary factors influencing the performance of a parallel implementation of the UCLA atmospheric general circulation model (AGCM) on distributedme...
John Z. Lou, John D. Farrara
ICPP
2007
IEEE
14 years 17 days ago
Energy-Efficient Scheduling for Parallel Applications Running on Heterogeneous Clusters
High performance clusters have been widely used to provide amazing computing capability for both commercial and scientific applications. However, huge power consumption has preven...
Ziliang Zong, Xiao Qin, Xiaojun Ruan, Kiranmai Bel...