Sciweavers

892 search results - page 18 / 179
» Adaptive Parallelism for OpenMP Task Parallel Programs
Sort
View
IPPS
2005
IEEE
15 years 7 months ago
Runtime Empirical Selection of Loop Schedulers on Hyperthreaded SMPs
Hyperthreaded (HT) and simultaneous multithreaded (SMT) processors are now available in commodity workstations and servers. This technology is designed to increase throughput by e...
Yun Zhang, Michael Voss
ICASSP
2011
IEEE
14 years 5 months ago
Parallel implementation of Multi-dimensional Ensemble Empirical Mode Decomposition
In this paper, we propose and evaluate two parallel implementations of Multi-dimensional Ensemble Empirical Mode Decomposition (MEEMD) for multi-core (CPU) and many-core (GPU) arc...
Li-Wen Chang, Men-Tzung Lo, Nasser Anssari, Ke-Hsi...
HPCA
2009
IEEE
16 years 2 months ago
Design and implementation of software-managed caches for multicores with local memory
Heterogeneous multicores, such as Cell BE processors and GPGPUs, typically do not have caches for their accelerator cores because coherence traffic, cache misses, and latencies fr...
Sangmin Seo, Jaejin Lee, Zehra Sura
109
Voted
EUROPAR
2010
Springer
15 years 2 months ago
Estimating and Exploiting Potential Parallelism by Source-Level Dependence Profiling
Manual parallelization of programs is known to be difficult and error-prone, and there are currently few ways to measure the amount of potential parallelism in the original sequent...
Jonathan Mak, Karl-Filip Faxén, Sverker Jan...
126
Voted
PPOPP
2010
ACM
15 years 11 months ago
Lazy binary-splitting: a run-time adaptive work-stealing scheduler
We present Lazy Binary Splitting (LBS), a user-level scheduler of nested parallelism for shared-memory multiprocessors that builds on existing Eager Binary Splitting work-stealing...
Alexandros Tzannes, George C. Caragea, Rajeev Baru...