Sciweavers

MICRO
1994
IEEE
99views Hardware» more  MICRO 1994»
13 years 8 months ago
Data relocation and prefetching for programs with large data sets
Numerical applications frequently contain nested loop structures that process large arrays of data. The execution of these loop structures often produces memory preference pattern...
Yoji Yamada, John Gyllenhall, Grant Haab, Wen-mei ...
MICRO
1994
IEEE
123views Hardware» more  MICRO 1994»
13 years 8 months ago
The effects of predicated execution on branch prediction
High performance architectures have always had to deal with the performance-limiting impact of branch operations. Microprocessor designs are going to have to deal with this proble...
Gary S. Tyson
MICRO
1994
IEEE
85views Hardware» more  MICRO 1994»
13 years 8 months ago
A high-performance microarchitecture with hardware-programmable functional units
This paper explores a novel way to incorporate hardware-programmable resources into a processor microarchitecture to improve the performance of general-purpose applications. Throu...
Rahul Razdan, Michael D. Smith
MICRO
1994
IEEE
118views Hardware» more  MICRO 1994»
13 years 8 months ago
Characterizing the impact of predicated execution on branch prediction
Branch instructions are recognized as a major impediment to exploiting instruction level parallelism. Even with sophisticated branch prediction techniques, many frequently execute...
Scott A. Mahlke, Richard E. Hank, Roger A. Bringma...
MICRO
1994
IEEE
81views Hardware» more  MICRO 1994»
13 years 8 months ago
Register file port requirements of transport triggered architectures
Exploitation of large amounts of instruction level parallelism requires a large amount of connectivity between the shared register file and the function units; this connectivity i...
Jan Hoogerbrugge, Henk Corporaal
MICRO
1994
IEEE
113views Hardware» more  MICRO 1994»
13 years 8 months ago
Static branch frequency and program profile analysis
: Program profiles identify frequently executed portions of a program, which are the places at which optimizations offer programmers and compilers the greatest benefit. Compilers, ...
Youfeng Wu, James R. Larus
MICRO
1994
IEEE
124views Hardware» more  MICRO 1994»
13 years 8 months ago
A comparison of two pipeline organizations
We examine two pipeline structures which are employed in commercial microprocessors. The first is the load-use interlock (LUI) pipeline, which employs an interlock to ensure corre...
Michael Golden, Trevor N. Mudge
MICRO
1994
IEEE
96views Hardware» more  MICRO 1994»
13 years 8 months ago
A fill-unit approach to multiple instruction issue
Multiple issue of instructions occurs in superscalar and VLIW machines. This paper investigates a third type of machine design, which combines the advantages of code compatibility...
Manoj Franklin, Mark Smotherman