Search Sciweavers | Sciweavers

182 search results - page 32 / 37

» Scheduling complex streaming applications on the Cell proces...

137

click to vote

HPCA
2006
IEEE

107views Distributed And Parallel Com...» more HPCA 2006»

Store vectors for scalable memory dependence prediction and scheduling

16 years 4 months ago

Download www.cc.gatech.edu

Allowing loads to issue out-of-order with respect to earlier unresolved store addresses is very important for extracting parallelism in large-window superscalar processors. Blindl...

Samantika Subramaniam, Gabriel H. Loh

claim paper

Read More »

140

click to vote

HPCA
2009
IEEE

176views Distributed And Parallel Com...» more HPCA 2009»

Design and implementation of software-managed caches for multicores with local memory

16 years 5 months ago

Download www.multicoreinfo.com

Heterogeneous multicores, such as Cell BE processors and GPGPUs, typically do not have caches for their accelerator cores because coherence traffic, cache misses, and latencies fr...

Sangmin Seo, Jaejin Lee, Zehra Sura

claim paper

Read More »

174

click to vote

PLDI
2012
ACM

289views Programming Languages» more PLDI 2012»

Adaptive input-aware compilation for graphics engines

13 years 7 months ago

Download cccp.eecs.umich.edu

While graphics processing units (GPUs) provide low-cost and efﬁcient platforms for accelerating high performance computations, the tedious process of performance tuning required...

Mehrzad Samadi, Amir Hormati, Mojtaba Mehrara, Jan...

claim paper

Read More »

141

click to vote

IJPP
2006

145views more IJPP 2006»

Deterministic Parallel Processing

15 years 4 months ago

Download www.science.uva.nl

Abstract. In order to address the problems faced in the wireless communications domain, picoChip has devised the picoArrayTM . The picoArrayTM is a tiled-processor architecture, co...

Gajinder Panesar, Daniel Towner, Andrew Duller, Al...

claim paper

Read More »

164

click to vote

ASPLOS
2010
ACM

228views Programming Languages» more ASPLOS 2010»

Micro-pages: increasing DRAM efficiency with locality-aware data placement

15 years 7 months ago

Download www.cs.utah.edu

Power consumption and DRAM latencies are serious concerns in modern chip-multiprocessor (CMP or multi-core) based compute systems. The management of the DRAM row buffer can signif...

Kshitij Sudan, Niladrish Chatterjee, David Nellans...

claim paper

Read More »

« Prev « First page 32 / 37 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers