Search Sciweavers | Sciweavers

52 search results - page 2 / 11

» Loop Alignment for Memory Accesses Optimization

click to vote

CASES
2008
ACM

217views System Software» more CASES 2008»

Efficient vectorization of SIMD programs with non-aligned and irregular data access hardware

13 years 7 months ago

Download www.cecs.uci.edu

Automatic vectorization of programs for partitioned-ALU SIMD (Single Instruction Multiple Data) processors has been difficult because of not only data dependency issues but also n...

Hoseok Chang, Wonyong Sung

claim paper

Read More »

click to vote

CONCURRENCY
2006

140views more CONCURRENCY 2006»

An efficient memory operations optimization technique for vector loops on Itanium 2 processors

13 years 5 months ago

Download www.prism.uvsq.fr

To keep up with a large degree of instruction level parallelism (ILP), the Itanium 2 cache systems use a complex organization scheme: load/store queues, banking and interleaving. ...

William Jalby, Christophe Lemuet, Sid Ahmed Ali To...

claim paper

Read More »

click to vote

MICRO
2000
IEEE

176views Hardware» more MICRO 2000»

An Advanced Optimizer for the IA-64 Architecture

13 years 5 months ago

Download www.info.uni-karlsruhe.de

level of abstraction, compared with the program representation for scalar optimizations. For example, loop unrolling and loop unrolland-jam transformations exploit the large regist...

Rakesh Krishnaiyer, Dattatraya Kulkarni, Daniel M....

claim paper

Read More »

click to vote

ICPP
1998
IEEE

222views Distributed And Parallel Com...» more ICPP 1998»

A memory-layout oriented run-time technique for locality optimization

13 years 10 months ago

Download home.eng.iastate.edu

Exploiting locality at run-time is a complementary approach to a compiler approach for those applications with dynamic memory access patterns. This paper proposes a memory-layout ...

Yong Yan, Xiaodong Zhang, Zhao Zhang

claim paper

Read More »

click to vote

ICS
1992
Tsinghua U.

104views Distributed And Parallel Com...» more ICS 1992»

Optimizing for parallelism and data locality

13 years 9 months ago

Download userweb.cs.utexas.edu

Previous research has used program transformation to introduce parallelism and to exploit data locality. Unfortunately,these twoobjectives have usuallybeen considered independentl...

Ken Kennedy, Kathryn S. McKinley

claim paper

Read More »

« Prev « First page 2 / 11 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers