loop | Sciweavers

13

IRREGULAR
1995
Springer

105views Distributed And Parallel Com...» more IRREGULAR 1995»

Run-Time Parallelization of Irregular DOACROSS Loops

13 years 7 months ago

Dependencies between iterations of loop structures cannot always be determined at compile-time because they may depend on input data which is known only at run-time. A prime examp...

V. Prasad Krothapalli, Thulasiraman Jeyaraman, Mar...

claim paper

Read More »

9

click to vote

CGO
2004
IEEE

115views Software Engineering» more CGO 2004»

Single-Dimension Software Pipelining for Multi-Dimensional Loops

13 years 8 months ago

Download www.cgo.org

Traditionally, software pipelining is applied either to the innermost loop of a given loop nest or from the innermost loop to outer loops. In this paper, we propose a threestep ap...

Hongbo Rong, Zhizhong Tang, Ramaswamy Govindarajan...

claim paper

Read More »

17

click to vote

CF
2007
ACM

116views Applied Computing» more CF 2007»

Identifying potential parallelism via loop-centric profiling

13 years 8 months ago

Download www.cse.ohio-state.edu

The transition to multithreaded, multi-core designs places a greater responsibility on programmers and software for improving performance; thread-level parallelism (TLP) will be i...

Tipp Moseley, Daniel A. Connors, Dirk Grunwald, Ra...

claim paper

Read More »

23

click to vote

ASPLOS
1994
ACM

163views Programming Languages» more ASPLOS 1994»

Compiler Optimizations for Improving Data Locality

13 years 8 months ago

Download userweb.cs.utexas.edu

In the past decade, processor speed has become significantly faster than memory speed. Small, fast cache memories are designed to overcome this discrepancy, but they are only effe...

Steve Carr, Kathryn S. McKinley, Chau-Wen Tseng

claim paper

Read More »

14

click to vote

IEEEPACT
1998
IEEE

129views Distributed And Parallel Com...» more IEEEPACT 1998»

A Matrix-Based Approach to the Global Locality Optimization Problem

13 years 8 months ago

Download cucis.ece.northwestern.edu

Global locality analysis is a technique for improving the cache performance of a sequence of loop nests through a combination of loop and data layout optimizations. Pure loop tran...

Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanuja...

claim paper

Read More »

19

click to vote

IPPS
1999
IEEE

156views Distributed And Parallel Com...» more IPPS 1999»

Cascaded Execution: Speeding Up Unparallelized Execution on Shared-Memory Multiprocessors

13 years 8 months ago

Download www.cs.virginia.edu

Both inherently sequential code and limitations of analysis techniques prevent full parallelization of many applications by parallelizing compilers. Amdahl's Law tells us tha...

Ruth E. Anderson, Thu D. Nguyen, John Zahorjan

claim paper

Read More »

8

click to vote

ICPP
1999
IEEE

164views Distributed And Parallel Com...» more ICPP 1999»

Access Descriptor Based Locality Analysis for Distributed-Shared Memory Multiprocessors

13 years 8 months ago

Download polaris.cs.uiuc.edu

Most of today's multiprocessors have a DistributedShared Memory (DSM) organization, which enables scalability while retaining the convenience of the shared-memory programming...

Angeles G. Navarro, Rafael Asenjo, Emilio L. Zapat...

claim paper

Read More »

10

click to vote

ICPP
1999
IEEE

116views Distributed And Parallel Com...» more ICPP 1999»

Compiler Optimizations for I/O-Intensive Computations

13 years 8 months ago

Download cucis.ece.northwestern.edu

This paper describes transformation techniques for out-of-core programs (i.e., those that deal with very large quantities of data) based on exploiting locality using a combination...

Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanuja...

claim paper

Read More »

12

click to vote

IPPS
2003
IEEE

170views Distributed And Parallel Com...» more IPPS 2003»

Loop Dissevering: A Technique for Temporally Partitioning Loops in Dynamically Reconfigurable Computing Platforms

13 years 9 months ago

Download w3.ualg.pt

This paper presents a technique, called loop dissevering, to temporally partitioning any type of loop presented in programming languages. The technique can be used in the presence...

João M. P. Cardoso

claim paper

Read More »

6

click to vote

ISPA
2004
Springer

123views Distributed And Parallel Com...» more ISPA 2004»

An Inspector-Executor Algorithm for Irregular Assignment Parallelization

13 years 9 months ago

Download www.des.udc.es

Abstract. A loop with irregular assignment computations contains loopcarried output data dependences that can only be detected at run-time. In this paper, a load-balanced method ba...

Manuel Arenaz, Juan Touriño, Ramon Doallo

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers