Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

12

EUROPAR
2008
Springer

favoriteEmaildiscussreport

112views Distributed And Parallel Com...» more EUROPAR 2008»

Optimized Pipelined Parallel Merge Sort on the Cell BE

13 years 6 months ago

Optimized Pipelined Parallel Merge Sort on the Cell BE

Download pv.fernuni-hagen.de

Chip multiprocessors designed for streaming applications such as Cell BE offer impressive peak performance but suffer from limited bandwidth to offchip main memory. As the number of cores is expected to rise further, this bottleneck will become more critical in the coming years. Hence, memory-efficient algorithms are required. As a case study, we investigate parallel sorting on Cell BE as a problem of great importance and as a challenge where the ratio between computation and memory transfer is very low. Our previous work led to a parallel mergesort that reduces memory bandwidth requirements by pipelining between SPEs, but the allocation of SPEs was rather ad-hoc. In our present work, we investigate mappings of merger nodes to SPEs. The mappings are designed to provide optimal trade-offs between load balancing, buffer memory consumption, and communication load on the on-chip bus. We solve this multi-objective optimization problem by deriving an integer linear programming formulation an...

Jörg Keller, Christoph W. Kessler

Real-time Traffic

Distributed And Parallel Computing | EUROPAR 2008 | Mappings | Memory Bandwidth Requirements | Merger Nodes |

claim paper

Related Content

» Hybrid Parallel Sort on the Cell Processor

» Optimized OnChipPipelined Mergesort on the CellBE

» CellSort High Performance Sorting on the Cell Processor

» On CostOptimal Merge of Two Intransitive Sorted Sequences

» Recognizing and representing proper interval graphs in parallel using merging and sorting

» Outofcore distribution sort in the FG programming environment

» Optimized onchip pipelining of memoryintensive computations on the cell BE

» AASort A New Parallel Sorting Algorithm for MultiCore SIMD Processors

» Parallel subdivision surface rendering and animation on the Cell BE processor

Post Info
More Details (n/a)

Added	19 Oct 2010
Updated	19 Oct 2010
Type	Conference
Year	2008
Where	EUROPAR
Authors	Jörg Keller, Christoph W. Kessler

Comments (0)