Build to order linear algebra kernels

14 years 9 months ago

Download rintintin.colorado.edu

—The performance bottleneck for many scientiﬁc applications is the cost of memory access inside linear algebra kernels. Tuning such kernels for memory efﬁciency is a complex task that degrades the productivity of computational scientists. Software libraries such as the Basic Linear Algebra Subprograms (BLAS) ameliorate this problem by providing a standard interface for which computer scientists and hardware vendors have created highly-tuned implementations. Scientiﬁc applications often require a sequence of BLAS operations, which presents further opportunities for memory optimization. However, because BLAS are tuned in isolation they do not take advantage of these opportunities. This phenomenon motivated the recent addition of several routines to the BLAS that each perform a sequence operations. Unfortunately, the exact sequence of operations needed in a given situation is highly application dependent, so many more such routines are needed. In this paper we present preliminary ...

Jeremy G. Siek, Ian Karlin, Elizabeth R. Jessup

Real-time Traffic

Basic Linear Algebra | BLAS | Distributed And Parallel Computing | IPPS 2008 | Linear Algebra |

claim paper

Post Info
More Details (n/a)

Added	31 May 2010
Updated	31 May 2010
Type	Conference
Year	2008
Where	IPPS
Authors	Jeremy G. Siek, Ian Karlin, Elizabeth R. Jessup

Comments (0)

Sciweavers

Build to order linear algebra kernels

Basic Linear Algebra | BLAS | Distributed And Parallel Computing | IPPS 2008 | Linear Algebra |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers