Sciweavers

PPOPP
2009
ACM

Solving dense linear systems on platforms with multiple hardware accelerators

14 years 12 months ago
Solving dense linear systems on platforms with multiple hardware accelerators
In a previous paper we show how the FLAME methods and tools provide a solution to compute dense dense linear algebra operations on a multi-GPU platform with reasonable performance while requiring little programming effort. In this paper we generalize the approach for systems with multiple hardware accelerators, and incorporate software implementations of standard cache/memory coherence techniques from computer architecture to improve the performance. Our experimental evaluation on an NVIDIA Tesla S870 platform delivers a peak performance well over 400 GFLOPS.
Enrique S. Quintana-Ortí, Francisco D. Igua
Added 25 Nov 2009
Updated 25 Nov 2009
Type Conference
Year 2009
Where PPOPP
Authors Enrique S. Quintana-Ortí, Francisco D. Igual, Gregorio Quintana-Ortí, Robert A. van de Geijn
Comments (0)