Sciweavers

125 search results - page 2 / 25
» Loop Striping: Maximize Parallelism for Nested Loops
Sort
View
ISPAN
2005
IEEE
13 years 11 months ago
Maximum Loop Distribution and Fusion for Two-level Loops Considering Code Size
In this paper, we propose a technique combining loop distribution with loop fusion to improve the timing performance without increasing the code size of the transformed loops. We ...
Meilin Liu, Qingfeng Zhuge, Zili Shao, Chun Xue, M...
ICPP
1996
IEEE
13 years 9 months ago
Polynomial-Time Nested Loop Fusion with Full Parallelism
Data locality and synchronization overhead are two important factors that affect the performance of applications on multiprocessors. Loop fusion is an effective way for reducing s...
Edwin Hsing-Mean Sha, Chenhua Lang, Nelson L. Pass...
HIPS
1998
IEEE
13 years 9 months ago
Further Results for Improving Loop Interchange in Non-Adjacent and Imperfectly Nested Loops
Loop interchange is a powerful restructuring technique for supporting vectorization and parallelization. In this paper, we propose a technique which is better to determine whether...
Tsung-Chuan Huang, Cheng-Ming Yang
LCPC
1993
Springer
13 years 9 months ago
Maximizing Loop Parallelism and Improving Data Locality via Loop Fusion and Distribution
Abstract. Loop fusion is a program transformation that merges multiple loops into one. It is e ective for reducing the synchronization overhead of parallel loops and for improving ...
Ken Kennedy, Kathryn S. McKinley
CASES
2001
ACM
13 years 9 months ago
Combined partitioning and data padding for scheduling multiple loop nests
With the widening performance gap between processors and main memory, efficient memory accessing behavior is necessary for good program performance. Loop partition is an effective...
Zhong Wang, Edwin Hsing-Mean Sha, Xiaobo Hu