Sciweavers

131 search results - page 5 / 27
» Automatic thread distribution for nested parallelism in Open...
Sort
View
106
Voted
PPOPP
2009
ACM
15 years 10 months ago
OpenMP to GPGPU: a compiler framework for automatic translation and optimization
GPGPUs have recently emerged as powerful vehicles for generalpurpose high-performance computing. Although a new Compute Unified Device Architecture (CUDA) programming model from N...
Seyong Lee, Seung-Jai Min, Rudolf Eigenmann
60
Voted
IPPS
2006
IEEE
15 years 3 months ago
Techniques supporting threadprivate in OpenMP
This paper presents the alternatives available to support threadprivate data in OpenMP and evaluates them. We show how current compilation systems rely on custom techniques for im...
Xavier Martorell, Marc González, Alejandro ...
106
Voted
ISHPC
2000
Springer
15 years 1 months ago
Implementation and Evaluation of OpenMP for Hitachi SR8000
This paper describes the implementation and evaluation of the OpenMP compiler designed for the Hitachi SR8000 Super Technical Server. The compiler performs parallelization for the ...
Yasunori Nishitani, Kiyoshi Negishi, Hiroshi Ohta,...
98
Voted
IPPS
2007
IEEE
15 years 4 months ago
A Comprehensive Analysis of OpenMP Applications on Dual-Core Intel Xeon SMPs
Hybrid chip multithreaded SMPs present new challenges as well as new opportunities to maximize performance. Our intention is to discover the optimal operating configuration of suc...
Ryan E. Grant, Ahmad Afsahi
EUROPAR
2004
Springer
15 years 3 months ago
Evaluating OpenMP Performance Analysis Tools with the APART Test Suite
The APART working group is developing the APART Test Suite (ATS) for evaluating (automatic) performance analysis tools with respect to their correctness – that is, their ability...
Michael Gerndt, Bernd Mohr, Jesper Larsson Trä...