Sciweavers

ICS
2009
Tsinghua U.

Computer generation of fast fourier transforms for the cell broadband engine

13 years 11 months ago
Computer generation of fast fourier transforms for the cell broadband engine
The Cell BE is a multicore processor with eight vector accelerators (called SPEs) that implement explicit cache management through direct memory access engines. While the Cell has an impressive floating point peak performance, programming and optimizing for it is difficult as it requires explicit memory management, multithreading, streaming, and vectorization. We address this problem for the discrete Fourier transform (DFT) by extending Spiral, a program generation system, to automatically generate highly optimized implementations for the Cell. The extensions include multi-SPE parallelization and explicit memory streaming, both pert a high abstraction level using rewriting systems operating on Spiral’s internal domain-specific language. Further, we support latency and throughput optimizations, single and double precision, and different data formats. The performance of Spiral’s computer generated code is comparable with and sometimes better than existing DFT implementations, whe...
Srinivas Chellappa, Franz Franchetti, Markus P&uum
Added 20 May 2010
Updated 20 May 2010
Type Conference
Year 2009
Where ICS
Authors Srinivas Chellappa, Franz Franchetti, Markus Püschel
Comments (0)