Sciweavers

115 search results - page 11 / 23
» Fusion of Loops for Parallelism and Locality
Sort
View
HPCC
2005
Springer
15 years 5 months ago
A Loop-Aware Search Strategy for Automated Performance Analysis
Abstract. Automated online search is a powerful technique for performance diagnosis. Such a search can change the types of experiments it performs while the program is running, mak...
Eli D. Collins, Barton P. Miller
ICS
2009
Tsinghua U.
15 years 6 months ago
Performance modeling and automatic ghost zone optimization for iterative stencil loops on GPUs
Iterative stencil loops (ISLs) are used in many applications and tiling is a well-known technique to localize their computation. When ISLs are tiled across a parallel architecture...
Jiayuan Meng, Kevin Skadron
ICFP
2010
ACM
15 years 23 days ago
Regular, shape-polymorphic, parallel arrays in Haskell
We present a novel approach to regular, multi-dimensional arrays in Haskell. The main highlights of our approach are that it (1) is purely functional, (2) supports reuse through s...
Gabriele Keller, Manuel M. T. Chakravarty, Roman L...
ISHPC
2003
Springer
15 years 4 months ago
On the Implementation of OpenMP 2.0 Extensions in the Fujitsu PRIMEPOWER Compiler
Abstract. The OpenMP Architecture Review Board has released version 2.0 of the OpenMP Fortran language specification in November 2000, and version 2.0 of the OpenMP C/C++ language...
Hidetoshi Iwashita, Masanori Kaneko, Masaki Aoki, ...
IEEEPACT
1999
IEEE
15 years 4 months ago
Localizing Non-Affine Array References
Existing techniques can enhance the locality of arrays indexed by affine functions of induction variables. This paper presents a technique to localize non-affine array references,...
Nicholas Mitchell, Larry Carter, Jeanne Ferrante