Sciweavers

2784 search results - page 263 / 557
» Instruction Level Parallelism
Sort
View
ICPPW
2003
IEEE
15 years 9 months ago
Load Balancing on PC Clusters with the Super-Programming Model
Recent work in high-performance computing has shifted attention to PC clusters.. For PC-clusters, member nodes are independent computers connected by generalpurpose networks. The ...
Dejiang Jin, Sotirios G. Ziavras
CC
1999
Springer
116views System Software» more  CC 1999»
15 years 8 months ago
A Programmable ANSI C Transformation Engine
Abstract. Source code transformations are a very effective method of parallelizing and improving the efficiency of programs. Unfortunately most compiler systems require implementin...
Maarten Boekhold, Ireneusz Karkowski, Henk Corpora...
ICS
1999
Tsinghua U.
15 years 8 months ago
Reducing cache misses using hardware and software page placement
As the gap between memory and processor speeds continues to widen, cache efficiency is an increasingly important component of processor performance. Compiler techniques have been...
Timothy Sherwood, Brad Calder, Joel S. Emer
IEEEPACT
1998
IEEE
15 years 8 months ago
Optimistic Register Coalescing
Register coalescing is used, as part of register allocation, to reduce the number of register copies. Developing efficient register coalescing heuristics is particularly important ...
Jinpyo Park, Soo-Mook Moon
128
Voted
IPPS
1998
IEEE
15 years 8 months ago
An Enhanced Co-Scheduling Method Using Reduced MS-State Diagrams
Instruction scheduling methods based on the construction of state diagrams (or automata) have been used for architectures involving deeply pipelined function units. However, the s...
Ramaswamy Govindarajan, N. S. S. Narasimha Rao, Er...