Sciweavers

ICS
1998
Tsinghua U.
13 years 9 months ago
Dependence Driven Execution for Multiprogrammed Multiprocessor
Abstract Barrier synchronizations can be very expensive on multiprogramming environment because no process can go past a barrier until all the processes have arrived. If a process ...
Suvas Vajracharya, Dirk Grunwald
ICS
1998
Tsinghua U.
13 years 9 months ago
Monitoring Shared Virtual Memory Performance on a Myrinet-based PC Cluster
Cheng Liao, Dongming Jiang, Liviu Iftode, Margaret...
ICS
1998
Tsinghua U.
13 years 9 months ago
A General Algorithm for Tiling the Register Level
Marta Jiménez, José M. Llaberí...
ICS
1998
Tsinghua U.
13 years 9 months ago
Vector Architectures: Past, Present and Future
Roger Espasa, Mateo Valero, James E. Smith
ICS
1998
Tsinghua U.
13 years 9 months ago
Techniques for Empirical Testing of Parallel Random Number Generators
Parallel computers are now commonly used for computational science and engineering, and many applications in these areas use random number generators. For some applications, such ...
Paul D. Coddington, Sung Hoon Ko
ICS
1998
Tsinghua U.
13 years 9 months ago
Load Execution Latency Reduction
In order to achieve high performance, contemporary microprocessors must effectively process the four major instruction types: ALU, branch, load, and store instructions. This paper...
Bryan Black, Brian Mueller, Stephanie Postal, Ryan...
ICS
1998
Tsinghua U.
13 years 9 months ago
Data Prefetching for Software DSMs
In this paper we propose and evaluate the Adaptive++ technique, a novel runtime-only data prefetching strategy for software-based distributed shared-memory systems (software DSMs)...
Ricardo Bianchini, Raquel Pinto, Claudio Luis de A...