Scheduling of QR Factorization Algorithms on SMP and Multi-Core Architectures

13 years 10 months ago

Download userweb.cs.utexas.edu

This paper examines the scalable parallel implementation of QR factorization of a general matrix, targeting SMP and multi-core architectures. Two implementations of algorithms-by-blocks are presented. Each implementation views a block of a matrix as the fundamental unit of data, and likewise, operations over these blocks as the primary unit of computation. The ﬁrst is a conventional blocked algorithm similar to those included in libFLAME and LAPACK but expressed in a way that allows operations in the so-called critical path of execution to be computed as soon as their dependencies are satisﬁed. The second algorithm captures a higher degree of parallelism with an approach based on Givens rotations while preserving the performance beneﬁts of algorithms based on blocked Householder transformations. We show that the implementation eﬀort is greatly simpliﬁed by expressing the algorithms in code with the FLAME/FLASH API, which allows matrices stored by blocks to be viewed and mana...

Gregorio Quintana-Ortí, Enrique S. Quintana

Real-time Traffic

Conventional Blocked Algorithm | Distributed And Parallel Computing | PDP 2008 | Scalable Parallel Implementation | So-called Critical Path |

claim paper

» Tile QR factorization with parallel panel processing for multicore architectures

» A Class of Parallel Tiled Linear Algebra Algorithms for Multicore Architectures

» ULE A Modern Scheduler for FreeBSD

» Towards an Efficient Tile Matrix Inversion of Symmetric Positive Definite Matrices on Mult...

Post Info
More Details (n/a)

Added	01 Jun 2010
Updated	01 Jun 2010
Type	Conference
Year	2008
Where	PDP
Authors	Gregorio Quintana-Ortí, Enrique S. Quintana-Ortí, Ernie Chan, Robert A. van de Geijn, Field G. Van Zee

Comments (0)

Sciweavers

Scheduling of QR Factorization Algorithms on SMP and Multi-Core Architectures

Conventional Blocked Algorithm | Distributed And Parallel Computing | PDP 2008 | Scalable Parallel Implementation | So-called Critical Path |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers