Search Sciweavers | Sciweavers

38 search results - page 3 / 8

» Parallel Tiled QR Factorization for Multicore Architectures

click to vote

ASPLOS
2009
ACM

248views Programming Languages» more ASPLOS 2009»

QR decomposition on GPUs

14 years 9 months ago

Download users.ece.gatech.edu

QR decomposition is a computationally intensive linear algebra operation that factors a matrix A into the product of a unitary matrix Q and upper triangular matrix R. Adaptive sys...

Andrew Kerr, Dan Campbell, Mark Richards

claim paper

Read More »

click to vote

SC
2009
ACM

240views Applied Computing» more SC 2009»

Dynamic task scheduling for linear algebra algorithms on distributed-memory multicore systems

14 years 3 months ago

Download www.cs.utk.edu

This paper presents a dynamic task scheduling approach to executing dense linear algebra algorithms on multicore systems (either shared-memory or distributed-memory). We use a tas...

Fengguang Song, Asim YarKhan, Jack Dongarra

claim paper

Read More »

click to vote

AAECC
2007
Springer

87views Algorithms» more AAECC 2007»

Towards an accurate performance modeling of parallel sparse factorization

13 years 8 months ago

Download www-rocq.inria.fr

We present a simulation-based performance model to analyze a parallel sparse LU factorization algorithm on modern cached-based, high-end parallel architectures. We consider supern...

Laura Grigori, Xiaoye S. Li

claim paper

Read More »

click to vote

JEC
2006

88views more JEC 2006»

Synchroscalar: Evaluation of an embedded, multi-core architecture for media applications

13 years 8 months ago

Download www.cs.ucsb.edu

We present an overview of the Synchroscalar single-chip, multi-core processor. Through the design of Synchroscalar, we find that high energy efficiency and low complexity can be a...

John Oliver, Ravishankar Rao, Diana Franklin, Fred...

claim paper

Read More »

click to vote

PPOPP
2010
ACM

222views Distributed and Parallel Com...» more PPOPP 2010»

Scaling LAPACK panel operations using parallel cache assignment

14 years 5 months ago

Download www.cs.utsa.edu

In LAPACK many matrix operations are cast as block algorithms which iteratively process a panel using an unblocked algorithm and then update a remainder matrix using the high perf...

Anthony M. Castaldo, R. Clint Whaley

claim paper

Read More »

« Prev « First page 3 / 8 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers