memory hierarchy | Sciweavers

251

DAC
2012
ACM

221views Computer Architecture» more DAC 2012»

Cache revive: architecting volatile STT-RAM caches for enhanced performance in CMPs

13 years 10 months ago

Spin-Transfer Torque RAM (STT-RAM) is an emerging non-volatile memory (NVM) technology that has the potential to replace the conventional on-chip SRAM caches for designing a more ...

Adwait Jog, Asit K. Mishra, Cong Xu, Yuan Xie, Vij...

claim paper

Read More »

239

click to vote

PLDI
2012
ACM

289views Programming Languages» more PLDI 2012»

Adaptive input-aware compilation for graphics engines

13 years 10 months ago

Download cccp.eecs.umich.edu

While graphics processing units (GPUs) provide low-cost and efﬁcient platforms for accelerating high performance computations, the tedious process of performance tuning required...

Mehrzad Samadi, Amir Hormati, Mojtaba Mehrara, Jan...

claim paper

Read More »

260

click to vote

PPOPP
2012
ACM

280views Distributed and Parallel Com...» more PPOPP 2012»

PARRAY: a unifying array representation for heterogeneous parallelism

14 years 3 months ago

Download sei.pku.edu.cn

This paper introduces a programming interface called PARRAY (or Parallelizing ARRAYs) that supports system-level succinct programming for heterogeneous parallel systems like GPU c...

Yifeng Chen, Xiang Cui, Hong Mei

claim paper

Read More »

362

click to vote

CSE
2012
IEEE

322views Theoretical Computer Science» more CSE 2012»

Accelerating Quantum Monte Carlo Simulations of Real Materials on GPU Clusters

14 years 3 months ago

Download saahpc.ncsa.illinois.edu

—Continuum quantum Monte Carlo (QMC) has proved to be an invaluable tool for predicting the properties of matter from fundamental principles. By solving the manybody Schr¨odinge...

Kenneth Esler, Jeongnim Kim, David M. Ceperley, Lu...

claim paper

Read More »

249

click to vote

ARC
2012
Springer

280views Hardware» more ARC 2012»

Scalable Memory Hierarchies for Embedded Manycore Systems

14 years 3 months ago

Download www.csce.uark.edu

As the size of FPGA devices grows following Moore’s law, it becomes possible to put a complete manycore system onto a single FPGA chip. The centralized memory hierarchy on typica...

Sen Ma, Miaoqing Huang, Eugene Cartwright, David L...

claim paper

Read More »

379

click to vote

HIPEAC
2011
Springer

290views System Software» more HIPEAC 2011»

Decoupled zero-compressed memory

14 years 7 months ago

Download www.cs.utah.edu

For each computer system generation, there are always applications or workloads for which the main memory size is the major limitation. On the other hand, in many cases, one could...

Julien Dusser, André Seznec

claim paper

Read More »

277

click to vote

HIPEAC
2011
Springer

246views System Software» more HIPEAC 2011»

NoC-aware cache design for multithreaded execution on tiled chip multiprocessors

14 years 7 months ago

Download www.cs.pitt.edu

In chip multiprocessors (CMPs), data accesslatency dependson the memory hierarchy organization, the on-chip interconnect (NoC), and the running workload. Reducing data access late...

Ahmed Abousamra, Alex K. Jones, Rami G. Melhem

claim paper

Read More »

246

Voted

JCPHY
2011

192views more JCPHY 2011»

Fast analysis of molecular dynamics trajectories with graphics processing units - Radial distribution function histogramming

14 years 10 months ago

Download www.ks.uiuc.edu

The calculation of radial distribution functions (RDFs) from molecular dynamics trajectory data is a common and computationally expensive analysis task. The rate limiting step in ...

Benjamin G. Levine, John E. Stone, Axel Kohlmeyer

claim paper

Read More »

238

Voted

PE
2010
Springer

175views Optimization» more PE 2010»

Generalized ERSS tree model: Revisiting working sets

15 years 2 months ago

Download pire.fiu.edu

Accurately characterizing the resource usage of an application at various levels in the memory hierarchy has been a long-standing research problem. Existing characterization studi...

Ricardo Koller, Akshat Verma, Raju Rangaswami

claim paper

Read More »

237

click to vote

IPPS
2010
IEEE

144views Distributed And Parallel Com...» more IPPS 2010»

Restructuring parallel loops to curb false sharing on multicore architectures

15 years 5 months ago

Download www.cs.txstate.edu

The memory hierarchy of most multicore systems contains one or more levels of cache that is shared among multiple cores. The shared-cache architecture presents many opportunities f...

Santosh Sarangkar, Apan Qasem

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers