Optimising Memory Bandwidth Use for Matrix-Vector Multiplication in Iterative Methods

16 years 2 months ago

Download cas.ee.ic.ac.uk

Computing the solution to a system of linear equations is a fundamental problem in scientiﬁc computing, and its acceleration has drawn wide interest in the FPGA community [1–3]. One class of algorithms to solve these systems, iterative methods, has drawn particular interest, with recent literature showing large performance improvements over general purpose processors (GPPs). In several iterative methods, this performance gain is largely a result of parallelisation of the matrixvector multiplication, an operation that occurs in many applications and hence has also been widely studied on FPGAs [4, 5]. However, whilst the performance of matrix-vector multiplication on FPGAs is generally I/O bound [4], the nature of iterative methods allows the use of onchip memory buﬀers to increase the bandwidth, providing the potential for signiﬁcantly more parallelism [6]. Unfortunately, existing approaches have generally only either been capable of solving large matrices with limited improveme...

David Boland, George A. Constantinides

Real-time Traffic

ARC 2010 | Hardware | Iterative Methods | Large Performance Improvements | Performance |

claim paper

Post Info
More Details (n/a)

Added	18 May 2010
Updated	18 May 2010
Type	Conference
Year	2010
Where	ARC
Authors	David Boland, George A. Constantinides

Comments (0)

Sciweavers

Optimising Memory Bandwidth Use for Matrix-Vector Multiplication in Iterative Methods

ARC 2010 | Hardware | Iterative Methods | Large Performance Improvements | Performance |

Explore & Download

Productivity Tools

Sciweavers