Sciweavers

MICRO
2002
IEEE
103views Hardware» more  MICRO 2002»
13 years 10 months ago
Cherry: checkpointed early resource recycling in out-of-order microprocessors
This paper presents CHeckpointed Early Resource RecYcling (Cherry), a hybrid mode of execution based on ROB and checkpointing that decouples resource recycling and instruction ret...
José F. Martínez, Jose Renau, Michae...
MICRO
2002
IEEE
97views Hardware» more  MICRO 2002»
13 years 10 months ago
Three extensions to register integration
Register integration (or just integration) is a register renaming discipline that implements instruction reuse via physical register sharing. Initially developed to perform squash...
Vlad Petric, Anne Bracy, Amir Roth
MICRO
2002
IEEE
104views Hardware» more  MICRO 2002»
13 years 10 months ago
Reducing register ports for higher speed and lower energy
The key issues for register file design in high-performance processors are access time and energy. While previous work has focused on reducing the number of registers, we propose...
Il Park, Michael D. Powell, T. N. Vijaykumar
MICRO
2002
IEEE
119views Hardware» more  MICRO 2002»
13 years 10 months ago
Reduced code size modulo scheduling in the absence of hardware support
Josep Llosa, Stefan M. Freudenberger
MICRO
2002
IEEE
97views Hardware» more  MICRO 2002»
13 years 10 months ago
Instruction fetch deferral using static slack
In this paper we present an approach to boosting performance and tolerating latency by deferring non-critical instructions into a deferred queue for later processing. As such, ins...
Gregory A. Muthler, David Crowe, Sanjay J. Patel, ...
MICRO
2002
IEEE
104views Hardware» more  MICRO 2002»
13 years 10 months ago
Compiling for instruction cache performance on a multithreaded architecture
Instruction cache aware compilation seeks to lay out a program in memory in such a way that cache conflicts between procedures are minimized. It does this through profile-driven...
Rakesh Kumar, Dean M. Tullsen
MICRO
2002
IEEE
173views Hardware» more  MICRO 2002»
13 years 10 months ago
Vector vs. superscalar and VLIW architectures for embedded multimedia benchmarks
Multimedia processing on embedded devices requires an architecture that leads to high performance, low power consumption, reduced design complexity, and small code size. In this p...
Christoforos E. Kozyrakis, David A. Patterson
MICRO
2002
IEEE
122views Hardware» more  MICRO 2002»
13 years 10 months ago
Microarchitectural denial of service: insuring microarchitectural fairness
Simultaneous multithreading seeks to improve the aggregate computation bandwidth of a processor core by sharing resources such as functional units, caches, TLB and so on. To date,...
Dirk Grunwald, Soraya Ghiasi
MICRO
2002
IEEE
143views Hardware» more  MICRO 2002»
13 years 10 months ago
Effective instruction scheduling techniques for an interleaved cache clustered VLIW processor
Clustering is a common technique to overcome the wire delay problem incurred by the evolution of technology. Fully-distributed architectures, where the register file, the functio...
Enric Gibert, F. Jesús Sánchez, Anto...
MICRO
2002
IEEE
111views Hardware» more  MICRO 2002»
13 years 10 months ago
Managing static leakage energy in microprocessor functional units
Steve Dropsho, Volkan Kursun, David H. Albonesi, S...