Search Sciweavers | Sciweavers

115 search results - page 16 / 23

» Fusion of Loops for Parallelism and Locality

168

click to vote

IEEEPACT
1999
IEEE

157views Distributed And Parallel Com...» more IEEEPACT 1999»

On Reducing False Sharing while Improving Locality on Shared Memory Multiprocessors

15 years 10 months ago

Download cucis.ece.northwestern.edu

The performance of applications on large shared-memory multiprocessors with coherent caches depends on the interaction between the granularity of data sharing, the size of the coh...

Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanuja...

claim paper

Read More »

183

click to vote

PLDI
1995
ACM

122views Programming Languages» more PLDI 1995»

Improving Balanced Scheduling with Compiler Optimizations that Increase Instruction-Level Parallelism

15 years 9 months ago

Download reference.kfupm.edu.sa

Traditional list schedulers order instructions based on an optimistic estimate of the load latency imposed by the hardware and therefore cannot respond to variations in memory lat...

Jack L. Lo, Susan J. Eggers

claim paper

Read More »

144

click to vote

LCPC
1997
Springer

104views System Software» more LCPC 1997»

Reducing Synchronization Overhead for Compiler-Parallelized Codes

15 years 9 months ago

Download www.cs.umd.edu

Software distributed-shared-memory (DSM) systems providean appealingtarget for parallelizing compilers due to their flexibility. Previous studies demonstrate such systems can prov...

Hwansoo Han, Chau-Wen Tseng, Peter J. Keleher

claim paper

Read More »

144

click to vote

ICS
2005
Tsinghua U.

122views Distributed And Parallel Com...» more ICS 2005»

Think globally, search locally

15 years 11 months ago

Download iss.ices.utexas.edu

A key step in program optimization is the determination of optimal values for code optimization parameters such as cache tile sizes and loop unrolling factors. One approach, which...

Kamen Yotov, Keshav Pingali, Paul Stodghill

claim paper

Read More »

187

click to vote

ICPP
1997
IEEE

169views Distributed And Parallel Com...» more ICPP 1997»

Automatic Partitioning of Data and Computations on Scalable Shared Memory Multiprocessors

15 years 10 months ago

Download www.eecg.toronto.edu

Abstract—This paper describes an algorithm for deriving data and computation partitions on scalable shared memory multiprocessors. The algorithm establishes afﬁnity relationshi...

Sudarsan Tandri, Tarek S. Abdelrahman

claim paper

Read More »

« Prev « First page 16 / 23 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers