Sciweavers

338 search results - page 48 / 68
» Automated Performance Prediction of Message-Passing Parallel...
Sort
View
ICS
2001
Tsinghua U.
15 years 7 months ago
Slice-processors: an implementation of operation-based prediction
We describe the Slice Processor micro-architecture that implements a generalized operation-based prefetching mechanism. Operation-based prefetchers predict the series of operation...
Andreas Moshovos, Dionisios N. Pnevmatikatos, Amir...
DSD
2008
IEEE
165views Hardware» more  DSD 2008»
15 years 9 months ago
Application Analysis for Parallel Processing
Effective mapping of multimedia applications on massively parallel embedded systems is a challenging demand in the domain of compiler design. The software implementations of emerg...
Muhammad Rashid, Damien Picard, Bernard Pottier
109
Voted
HIPC
2007
Springer
15 years 8 months ago
Experiments with a Parallel External Memory System
Abstract. The theory of bulk-synchronous parallel computing has produced a large number of attractive algorithms, which are provably optimal in some sense, but typically require th...
Mohammad R. Nikseresht, David A. Hutchinson, Anil ...
110
Voted
HPCA
2007
IEEE
16 years 2 months ago
Exploiting Postdominance for Speculative Parallelization
Task-selection policies are critical to the performance of any architecture that uses speculation to extract parallel tasks from a sequential thread. This paper demonstrates that ...
Mayank Agarwal, Kshitiz Malik, Kevin M. Woley, Sam...
99
Voted
MICRO
2009
IEEE
222views Hardware» more  MICRO 2009»
15 years 9 months ago
Qilin: exploiting parallelism on heterogeneous multiprocessors with adaptive mapping
Heterogeneous multiprocessors are growingly important in the multi-core era due to their potential for high performance and energy efficiency. In order for software to fully real...
Chi-Keung Luk, Sunpyo Hong, Hyesoon Kim