Search Sciweavers | Sciweavers

6 search results - page 1 / 2

» Optimising Memory Bandwidth Use for Matrix-Vector Multiplica...

176

click to vote

ARC
2010
Springer

387views Hardware» more ARC 2010»

Optimising Memory Bandwidth Use for Matrix-Vector Multiplication in Iterative Methods

15 years 10 months ago

Download cas.ee.ic.ac.uk

Computing the solution to a system of linear equations is a fundamental problem in scientiﬁc computing, and its acceleration has drawn wide interest in the FPGA community [1–3]...

David Boland, George A. Constantinides

claim paper

Read More »

154

click to vote

ICPP
2008
IEEE

139views Distributed And Parallel Com...» more ICPP 2008»

Improving the Performance of Multithreaded Sparse Matrix-Vector Multiplication Using Index and Value Compression

15 years 9 months ago

Download solar.cslab.ece.ntua.gr

Abstract—The Sparse Matrix-Vector Multiplication kernel exhibits limited potential for taking advantage of modern shared memory architectures due to its large memory bandwidth re...

Kornilios Kourtis, Georgios I. Goumas, Nectarios K...

claim paper

Read More »

159

Voted

ICPP
2009
IEEE

170views Distributed And Parallel Com...» more ICPP 2009»

Perfomance Models for Blocked Sparse Matrix-Vector Multiplication Kernels

15 years 9 months ago

Download www.cslab.ece.ntua.gr

—Sparse Matrix-Vector multiplication (SpMV) is a very challenging computational kernel, since its performance depends greatly on both the input matrix and the underlying architec...

Vasileios Karakasis, Georgios I. Goumas, Nectarios...

claim paper

Read More »

125

Voted

IFL
1999
Springer

108views Formal Methods» more IFL 1999»

Optimising Recursive Functions Yielding Multiple Results in Tuples in a Lazy Functional Language

15 years 7 months ago

Download www.st.cs.ru.nl

Abstract. We discuss a new optimisation for recursive functions yielding multiple results in tuples for lazy functional languages, like Clean and Haskell. This optimisation improve...

John H. G. van Groningen

claim paper

Read More »

204

click to vote

ARC
2012
Springer

317views Hardware» more ARC 2012»

A High Throughput FPGA-Based Implementation of the Lanczos Method for the Symmetric Extremal Eigenvalue Problem

13 years 11 months ago

Download cas.ee.ic.ac.uk

Iterative numerical algorithms with high memory bandwidth requirements but medium-size data sets (matrix size ∼ a few 100s) are highly appropriate for FPGA acceleration. This pap...

Abid Rafique, Nachiket Kapre, George A. Constantin...

claim paper

Read More »

« Prev « First page 1 / 2 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers