Search Sciweavers | Sciweavers

14 search results - page 3 / 3

» Reducing Memory Latency via Read-after-Read Memory Dependenc...

click to vote

ICS
2000
Tsinghua U.

142views Distributed And Parallel Com...» more ICS 2000»

Push vs. pull: data movement for linked data structures

13 years 9 months ago

Download www.cs.duke.edu

As the performance gap between the CPU and main memory continues to grow, techniques to hide memory latency are essential to deliver a high performance computer system. Prefetchin...

Chia-Lin Yang, Alvin R. Lebeck

claim paper

Read More »

click to vote

IEEEPACT
1998
IEEE

156views Distributed And Parallel Com...» more IEEEPACT 1998»

Adaptive Scheduling of Computations and Communications on Distributed Memory Systems

13 years 9 months ago

Download faculty.kfupm.edu.sa

Compile-time scheduling is one approach to extract parallelism which has proved effective when the execution behavior is predictable. Unfortunately, the performance of most priori...

Mayez A. Al-Mouhamed, Homam Najjari

claim paper

Read More »

click to vote

ISCA
2000
IEEE

111views Hardware» more ISCA 2000»

Understanding the backward slices of performance degrading instructions

13 years 9 months ago

Download www.ece.lsu.edu

For many applications, branch mispredictions and cache misses limit a processor’s performance to a level well below its peak instruction throughput. A small fraction of static i...

Craig B. Zilles, Gurindar S. Sohi

claim paper

Read More »

click to vote

MICRO
2002
IEEE

131views Hardware» more MICRO 2002»

Pointer cache assisted prefetching

13 years 10 months ago

Download cseweb.ucsd.edu

Data prefetching effectively reduces the negative effects of long load latencies on the performance of modern processors. Hardware prefetchers employ hardware structures to predic...

Jamison D. Collins, Suleyman Sair, Brad Calder, De...

claim paper

Read More »

« Prev « First page 3 / 3 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers