Tile QR factorization with parallel panel processing for multicore architectures

15 years 4 months ago

Download icl.cs.utk.edu

To exploit the potential of multicore architectures, recent dense linear algebra libraries have used tile algorithms, which consist in scheduling a Directed Acyclic Graph (DAG) of tasks of fine granularity where nodes represent tasks, either panel factorization or update of a block-column, and edges represent dependencies among them. Although past approaches already achieve high performance on moderate and large square matrices, their way of processing a panel in sequence leads to limited performance when factorizing tall and skinny matrices or small square matrices. We present a new fully asynchronous method for computing a QR factorization on shared-memory multicore architectures that overcomes this bottleneck. Our contribution is to adapt an existing algorithm that performs a panel factorization in parallel (named Communication-Avoiding QR and initially designed for distributed-memory machines), to the context of tile algorithms using asynchronous computations. An experimental stud...

Bilel Hadri, Hatem Ltaief, Emmanuel Agullo, Jack D

Real-time Traffic

Distributed And Parallel Computing | IPPS 2010 | Multicore Architectures | Panel Factorization | Tile Algorithms |

claim paper

» A Fully Empirical Autotuned Dense QR Factorization for Multicore Architectures

» A Class of Parallel Tiled Linear Algebra Algorithms for Multicore Architectures

» Scaling LAPACK panel operations using parallel cache assignment

» Towards an Efficient Tile Matrix Inversion of Symmetric Positive Definite Matrices on Mult...

» Parallel TwoSided Matrix Reduction to Band Bidiagonal Form on Multicore Architectures

Post Info
More Details (n/a)

Added	13 Feb 2011
Updated	13 Feb 2011
Type	Journal
Year	2010
Where	IPPS
Authors	Bilel Hadri, Hatem Ltaief, Emmanuel Agullo, Jack Dongarra

Comments (0)

Sciweavers

Tile QR factorization with parallel panel processing for multicore architectures

Distributed And Parallel Computing | IPPS 2010 | Multicore Architectures | Panel Factorization | Tile Algorithms |

Explore & Download

Productivity Tools

Sciweavers