Search Sciweavers | Sciweavers

2703 search results - page 374 / 541

» Optimizing memory transactions

109

click to vote

ICCD
2004
IEEE

106views Hardware» more ICCD 2004»

Energy Characterization of Hardware-Based Data Prefetching

16 years 16 days ago

Download www.ecs.umass.edu

This paper evaluates several hardware-based data prefetching techniques from an energy perspective, and explores their energy/performance tradeoffs. We present detailed simulation...

Yao Guo, Saurabh Chheda, Israel Koren, C. Mani Kri...

claim paper

Read More »

121

Voted

IPPS
2009
IEEE

133views Distributed And Parallel Com...» more IPPS 2009»

Exploring the effect of block shapes on the performance of sparse kernels

15 years 10 months ago

Download www.cslab.ece.ntua.gr

In this paper we explore the impact of the block shape on blocked and vectorized versions of the Sparse Matrix-Vector Multiplication (SpMV) kernel and build upon previous work by ...

Vasileios Karakasis, Georgios I. Goumas, Nectarios...

claim paper

Read More »

162

Voted

IPPS
2009
IEEE

181views Distributed And Parallel Com...» more IPPS 2009»

Designing multi-leader-based Allgather algorithms for multi-core clusters

15 years 10 months ago

Download www.cse.ohio-state.edu

The increasing demand for computational cycles is being met by the use of multi-core processors. Having large number of cores per node necessitates multi-core aware designs to ext...

Krishna Chaitanya Kandalla, Hari Subramoni, Gopala...

claim paper

Read More »

114

click to vote

EUROPAR
2009
Springer

136views Distributed And Parallel Com...» more EUROPAR 2009»

PSPIKE: A Parallel Hybrid Sparse Linear System Solver

15 years 10 months ago

Download www.cs.purdue.edu

The availability of large-scale computing platforms comprised of tens of thousands of multicore processors motivates the need for the next generation of highly scalable sparse line...

Murat Manguoglu, Ahmed H. Sameh, Olaf Schenk

claim paper

Read More »

148

Voted

ICMCS
2008
IEEE

208views Multimedia» more ICMCS 2008»

Fast computation of general Fourier Transforms on GPUS

15 years 10 months ago

Download research.microsoft.com

We present an implementation of general FFTs for graphics processing units (GPUs). Unlike most existing GPU FFT implementations, we handle both complex and real data of any size t...

Brandon Lloyd, Chas Boyd, Naga K. Govindaraju

claim paper

Read More »

« Prev « First page 374 / 541 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers