Sciweavers

115 search results - page 6 / 23
» Fusion of Loops for Parallelism and Locality
Sort
View
ICPP
1999
IEEE
15 years 1 months ago
A Framework for Interprocedural Locality Optimization Using Both Loop and Data Layout Transformations
There has been much work recently on improving the locality performance of loop nests in scientific programs through the use of loop as well as data layout optimizations. However,...
Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanuja...
ICS
1992
Tsinghua U.
15 years 1 months ago
Optimizing for parallelism and data locality
Previous research has used program transformation to introduce parallelism and to exploit data locality. Unfortunately,these twoobjectives have usuallybeen considered independentl...
Ken Kennedy, Kathryn S. McKinley
GCC
2003
Springer
15 years 2 months ago
Research and Application of Distributed Fusion System Based on Grid Computing
Grid computing is used to realize the connection of distributed, heterogeneous computing resources through grid, assist in accomplishing computing task. Focusing on the weakness of...
Yu Su, Hai Zhao, Wei-ji Su, Gang Wang, Xiao-dan Zh...
ICPPW
2008
IEEE
15 years 4 months ago
Performance Analysis and Optimization of Parallel Scientific Applications on CMP Cluster Systems
Chip multiprocessors (CMP) are widely used for high performance computing. Further, these CMPs are being configured in a hierarchical manner to compose a node in a cluster system....
Xingfu Wu, Valerie E. Taylor, Charles W. Lively, S...
85
Voted
IPPS
2010
IEEE
14 years 7 months ago
DynTile: Parametric tiled loop generation for parallel execution on multicore processors
Abstract--Loop tiling is an important compiler transformation used for enhancing data locality and exploiting coarsegrained parallelism. Tiled codes in which tile sizes are runtime...
Albert Hartono, Muthu Manikandan Baskaran, J. Rama...