Search Sciweavers | Sciweavers

24 search results - page 4 / 5

» Computing Rank-Revealing QR Factorizations of Dense Matrices

click to vote

CORR
2010
Springer

153views Education» more CORR 2010»

Towards an Efficient Tile Matrix Inversion of Symmetric Positive Definite Matrices on Multicore Architectures

13 years 7 months ago

Download vecpar.fe.up.pt

The algorithms in the current sequential numerical linear algebra libraries (e.g. LAPACK) do not parallelize well on multicore architectures. A new family of algorithms, the tile a...

Emmanuel Agullo, Henricus Bouwmeester, Jack Dongar...

claim paper

Read More »

click to vote

EUROPAR
2003
Springer

157views Distributed And Parallel Com...» more EUROPAR 2003»

Improving Performance of Hypermatrix Cholesky Factorization

14 years 18 days ago

Download people.ac.upc.edu

Abstract. This paper shows how a sparse hypermatrix Cholesky factorization can be improved. This is accomplished by means of eﬃcient codes which operate on very small dense matri...

José R. Herrero, Juan J. Navarro

claim paper

Read More »

click to vote

PPOPP
2010
ACM

222views Distributed and Parallel Com...» more PPOPP 2010»

Scaling LAPACK panel operations using parallel cache assignment

14 years 4 months ago

Download www.cs.utsa.edu

In LAPACK many matrix operations are cast as block algorithms which iteratively process a panel using an unblocked algorithm and then update a remainder matrix using the high perf...

Anthony M. Castaldo, R. Clint Whaley

claim paper

Read More »

click to vote

IPPS
2009
IEEE

88views Distributed And Parallel Com...» more IPPS 2009»

Minimizing startup costs for performance-critical threading

14 years 2 months ago

Download www.cs.utsa.edu

—Using the well-known ATLAS and LAPACK dense linear algebra libraries, we demonstrate that the parallel management overhead (PMO) can grow with problem size on even statically sc...

Anthony M. Castaldo, R. Clint Whaley

claim paper

Read More »

click to vote

IPPS
2010
IEEE

117views Distributed And Parallel Com...» more IPPS 2010»

Performance evaluation of concurrent collections on high-performance multicore computing systems

13 years 5 months ago

Download vuduc.org

This paper is the first extensive performance study of a recently proposed parallel programming model, called Concurrent Collections (CnC). In CnC, the programmer expresses her co...

Aparna Chandramowlishwaran, Kathleen Knobe, Richar...

claim paper

Read More »

« Prev « First page 4 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers