Sciweavers

ICS
1995
Tsinghua U.
15 years 3 months ago
A Data Cache with Multiple Caching Strategies Tuned to Different Types of Locality
Current data cache organizations fail to deliver high performance in scalar processors for many vector applications. There are two main reasons for this loss of performance: the u...
Antonio González, Carlos Aliagas, Mateo Val...
ICS
1995
Tsinghua U.
15 years 3 months ago
Run-Time Methods for Parallelizing Partially Parallel Loops
In this paper we give a new run–time technique for finding an optimal parallel execution schedule for a partially parallel loop, i.e., a loop whose parallelization requires syn...
Lawrence Rauchwerger, Nancy M. Amato, David A. Pad...
102
Voted
ICS
1995
Tsinghua U.
15 years 3 months ago
Optimum Modulo Schedules for Minimum Register Requirements
Modulo scheduling is an e cient technique for exploiting instruction level parallelism in a variety of loops, resulting in high performance code but increased register requirement...
Alexandre E. Eichenberger, Edward S. Davidson, San...
ICS
1995
Tsinghua U.
15 years 3 months ago
Idiom Recognition in the Polaris Parallelizing Compiler
The elimination of induction variables and the parallelization of reductions in FORTRAN programs have been shown to be integral to performance improvement on parallel computers 7,...
William M. Pottenger, Rudolf Eigenmann
83
Voted
ICS
1995
Tsinghua U.
15 years 3 months ago
Data Forwarding in Scalable Shared-Memory Multiprocessors
David Koufaty, Xiangfeng Chen, David K. Poulsen, J...
Distributed and Parallel Computing
Top of PageReset Settings