Sciweavers

MICRO
1995
IEEE

Zero-cycle loads: microarchitecture support for reducing load latency

13 years 7 months ago
Zero-cycle loads: microarchitecture support for reducing load latency
Untolerated load instruction latencies often have a significant impact on overall program performance. As one means of mitigating this effect, we present an aggressive hardware-based mechanism that provides effective support for reducing the latency of load instructions. Through the judicious use of instruction predecode, base register caching, and fast address calculation, it becomes possible to complete load instructions up to two cycles earlier than traditional pipeline designs. For a pipeline with one cycle data cache access, this results in what we term a zero-cycle load. A zero-cycle load produces a result prior to reaching the execute stage of the pipeline, allowing subsequent dependent instructions to issue unfettered by load dependencies. Programs executing on processors with support for zero-cycle loads experience significantly fewer pipeline stalls due to load instructions and increased overall performance. We present two pipeline designs supporting zero-cycle loads: one ...
Todd M. Austin, Gurindar S. Sohi
Added 26 Aug 2010
Updated 26 Aug 2010
Type Conference
Year 1995
Where MICRO
Authors Todd M. Austin, Gurindar S. Sohi
Comments (0)