Search Sciweavers | Sciweavers

115 search results - page 7 / 23

» Fusion of Loops for Parallelism and Locality

122

click to vote

IPPS
2010
IEEE

144views Distributed And Parallel Com...» more IPPS 2010»

Restructuring parallel loops to curb false sharing on multicore architectures

14 years 11 months ago

Download www.cs.txstate.edu

The memory hierarchy of most multicore systems contains one or more levels of cache that is shared among multiple cores. The shared-cache architecture presents many opportunities f...

Santosh Sarangkar, Apan Qasem

claim paper

Read More »

124

click to vote

CASES
2009
ACM

234views System Software» more CASES 2009»

CGRA express: accelerating execution using dynamic operation fusion

15 years 7 months ago

Download cccp.eecs.umich.edu

Coarse-grained reconﬁgurable architectures (CGRAs) present an appealing hardware platform by providing programmability with the potential for high computation throughput, scalab...

Yongjun Park, Hyunchul Park, Scott A. Mahlke

claim paper

Read More »

151

click to vote

ASPLOS
1994
ACM

163views Programming Languages» more ASPLOS 1994»

Compiler Optimizations for Improving Data Locality

15 years 5 months ago

Download userweb.cs.utexas.edu

In the past decade, processor speed has become significantly faster than memory speed. Small, fast cache memories are designed to overcome this discrepancy, but they are only effe...

Steve Carr, Kathryn S. McKinley, Chau-Wen Tseng

claim paper

Read More »

113

click to vote

ICPP
1996
IEEE

98views Distributed And Parallel Com...» more ICPP 1996»

Scheduling of Wavefront Parallelism on Scalable Shared-memory Multiprocessors

15 years 5 months ago

Download www.eecg.toronto.edu

Tiling exploits temporal reuse carried by an outer loop of a loop nest to enhance cache locality. Loop skewing is typically required to make tiling legal. This restricts parallelis...

Naraig Manjikian, Tarek S. Abdelrahman

claim paper

Read More »

111

click to vote

SAC
2002
ACM

159views Applied Computing» more SAC 2002»

Automatic code generation for executing tiled nested loops onto parallel architectures

15 years 27 days ago

Download www.cslab.ece.ntua.gr

This paper presents a novel approach for the problem of generating tiled code for nested for-loops using a tiling transformation. Tiling or supernode transformation has been widel...

Georgios I. Goumas, Maria Athanasaki, Nectarios Ko...

claim paper

Read More »

« Prev « First page 7 / 23 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers