Search Sciweavers | Sciweavers

36 search results - page 3 / 8

» LAPACK: a portable linear algebra library for high-performan...

click to vote

ICS
1997
Tsinghua U.

117views Distributed And Parallel Com...» more ICS 1997»

Optimizing Matrix Multiply Using PHiPAC: A Portable, High-Performance, ANSI C Coding Methodology

13 years 11 months ago

Download www.icsi.berkeley.edu

Modern microprocessors can achieve high performance on linear algebra kernels but this currently requires extensive machine-speci c hand tuning. We have developed a methodology wh...

Jeff Bilmes, Krste Asanovic, Chee-Whye Chin, James...

claim paper

Read More »

click to vote

PARA
1995
Springer

174views Applied Computing» more PARA 1995»

A Proposal for a Set of Parallel Basic Linear Algebra Subprograms

13 years 11 months ago

Download phase.hpcc.jp

This paper describes a proposal for a set of Parallel Basic Linear Algebra Subprograms PBLAS. The PBLAS are targeted at distributed vector-vector, matrix-vector and matrixmatrix...

Jaeyoung Choi, Jack Dongarra, Susan Ostrouchov, An...

claim paper

Read More »

click to vote

IPPS
2006
IEEE

216views Distributed And Parallel Com...» more IPPS 2006»

Algorithm-based checkpoint-free fault tolerance for parallel matrix computations on volatile resources

14 years 1 months ago

Download icl.cs.utk.edu

As the desire of scientists to perform ever larger computations drives the size of today’s high performance computers from hundreds, to thousands, and even tens of thousands of ...

Zizhong Chen, Jack Dongarra

claim paper

Read More »

click to vote

SC
2009
ACM

240views Applied Computing» more SC 2009»

Dynamic task scheduling for linear algebra algorithms on distributed-memory multicore systems

14 years 2 months ago

Download www.cs.utk.edu

This paper presents a dynamic task scheduling approach to executing dense linear algebra algorithms on multicore systems (either shared-memory or distributed-memory). We use a tas...

Fengguang Song, Asim YarKhan, Jack Dongarra

claim paper

Read More »

click to vote

PPOPP
2010
ACM

222views Distributed and Parallel Com...» more PPOPP 2010»

Scaling LAPACK panel operations using parallel cache assignment

14 years 4 months ago

Download www.cs.utsa.edu

In LAPACK many matrix operations are cast as block algorithms which iteratively process a panel using an unblocked algorithm and then update a remainder matrix using the high perf...

Anthony M. Castaldo, R. Clint Whaley

claim paper

Read More »

« Prev « First page 3 / 8 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers