A Fully Empirical Autotuned Dense QR Factorization for Multicore Architectures

13 years 5 days ago

Download www.netlib.org

: Tuning numerical libraries has become more diﬃcult over time, as systems get more sophisticated. In particular, modern multicore machines make the behaviour of algorithms hard to forecast and model. In this paper, we tackle the issue of tuning a dense QR factorization on multicore architectures. We show that it is hard to rely on a model, which motivates us to design a fully empirical approach. We exhibit few strong empirical properties that enable us to eﬃciently prune the search space. Our method is automatic, fast and reliable. The tuning process is indeed fully performed at install time in less than one and ten minutes on ﬁve out of seven platforms. We achieve an average performance varying from 97% to 100% of the optimum performance depending on the platform. This work is a basis for autotuning the PLASMA library and enabling easy performance portability across hardware systems. Key-words: Autotuning, empirical tuning, multicore, dense linear algebra, QR factorization ∗ ...

Emmanuel Agullo, Jack Dongarra, Rajib Nath, Stanim

Real-time Traffic

Dense Linear Algebra | Distributed And Parallel Computing | EUROPAR 2011 | Performance Portability | QR Factorization |

claim paper

Post Info
More Details (n/a)

Added	20 Dec 2011
Updated	20 Dec 2011
Type	Journal
Year	2011
Where	EUROPAR
Authors	Emmanuel Agullo, Jack Dongarra, Rajib Nath, Stanimire Tomov

Comments (0)

Sciweavers

A Fully Empirical Autotuned Dense QR Factorization for Multicore Architectures

Dense Linear Algebra | Distributed And Parallel Computing | EUROPAR 2011 | Performance Portability | QR Factorization |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers