HPCA 2012 | Sciweavers

18

HPCA
2012
IEEE

286views Distributed And Parallel Com...» more HPCA 2012»

11 years 11 months ago

Since the onset of pipelined processors, balancing the delay of the microarchitectural pipeline stages such that each microarchitectural pipeline stage has an equal delay has been...

John Sartori, Ben Ahrens, Rakesh Kumar

claim paper

Read More »

16

click to vote

HPCA
2012
IEEE

239views Distributed And Parallel Com...» more HPCA 2012»

System-level implications of disaggregated memory

11 years 11 months ago

Download web.eecs.umich.edu

Recent research on memory disaggregation introduces a new architectural building block—the memory blade—as a cost-effective approach for memory capacity expansion and sharing ...

Kevin T. Lim, Yoshio Turner, Jose Renato Santos, A...

claim paper

Read More »

22

click to vote

HPCA
2012
IEEE

327views Distributed And Parallel Com...» more HPCA 2012»

Decoupled dynamic cache segmentation

11 years 11 months ago

Download taco.cs.utsa.edu

The least recently used (LRU) replacement policy performs poorly in the last-level cache (LLC) because temporal locality of memory accesses is ﬁltered by ﬁrst and second level...

Samira Manabi Khan, Zhe Wang, Daniel A. Jimé...

claim paper

Read More »

20

click to vote

HPCA
2012
IEEE

229views Distributed And Parallel Com...» more HPCA 2012»

Pacman: Tolerating asymmetric data races with unintrusive hardware

11 years 11 months ago

Download iacoma.cs.uiuc.edu

Data races are a major contributor to parallel software unreliability. A type of race that is both common and typically harmful is the Asymmetric data race. It occurs when at leas...

Shanxiang Qi, Norimasa Otsuki, Lois Orosa Nogueira...

claim paper

Read More »

17

click to vote

HPCA
2012
IEEE

252views Distributed And Parallel Com...» more HPCA 2012»

SCD: A scalable coherence directory with flexible sharer set encoding

11 years 11 months ago

Download www.stanford.edu

Large-scale CMPs with hundreds of cores require a directory-based protocol to maintain cache coherence. However, previously proposed coherence directories are hard to scale beyond...

Daniel Sanchez, Christos Kozyrakis

claim paper

Read More »

16

click to vote

HPCA
2012
IEEE

252views Distributed And Parallel Com...» more HPCA 2012»

Staged Reads: Mitigating the impact of DRAM writes on DRAM reads

11 years 11 months ago

Download www.hpl.hp.com

Main memory latencies have always been a concern for system performance. Given that reads are on the critical path for CPU progress, reads must be prioritized over writes. However...

Niladrish Chatterjee, Naveen Muralimanohar, Rajeev...

claim paper

Read More »

15

click to vote

HPCA
2012
IEEE

240views Distributed And Parallel Com...» more HPCA 2012»

Flexible register management using reference counting

11 years 11 months ago

Download www.cis.upenn.edu

Conventional out-of-order processors that use a uniﬁed physical register ﬁle allocate and reclaim registers explicitly using a free list that operates as a circular queue. We ...

Steven Battle, Andrew D. Hilton, Mark Hempstead, A...

claim paper

Read More »

12

click to vote

HPCA
2012
IEEE

259views Distributed And Parallel Com...» more HPCA 2012»

BulkSMT: Designing SMT processors for atomic-block execution

11 years 11 months ago

Download iacoma.cs.uiuc.edu

Multiprocessor architectures that continuously execute atomic blocks (or chunks) of instructions can improve performance and software productivity. However, all of the prior propo...

Xuehai Qian, Benjamin Sahelices, Josep Torrellas

claim paper

Read More »

20

click to vote

HPCA
2012
IEEE

291views Distributed And Parallel Com...» more HPCA 2012»

Balancing DRAM locality and parallelism in shared memory CMP systems

11 years 11 months ago

Download lph.ece.utexas.edu

Modern memory systems rely on spatial locality to provide high bandwidth while minimizing memory device power and cost. The trend of increasing the number of cores that share memo...

Min Kyu Jeong, Doe Hyun Yoon, Dam Sunwoo, Mike Sul...

claim paper

Read More »

14

click to vote

HPCA
2012
IEEE

291views Distributed And Parallel Com...» more HPCA 2012»

Booster: Reactive core acceleration for mitigating the effects of process variation and application imbalance in low-voltage chi

11 years 11 months ago

Download www.cse.ohio-state.edu

Lowering supply voltage is one of the most effective techniques for reducing microprocessor power consumption. Unfortunately, at low voltages, chips are very sensitive to process ...

Timothy N. Miller, Xiang Pan, Renji Thomas, Naser ...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers