Sciweavers

40 search results - page 5 / 8
» Parallel Dense Gauss-Seidel Algorithm on Many-Core Processor...
Sort
View
CORR
2008
Springer
76views Education» more  CORR 2008»
13 years 7 months ago
Communication-avoiding parallel and sequential QR factorizations
We present parallel and sequential dense QR factorization algorithms that are optimized to avoid communication. Some of these are novel, and some extend earlier work. Communicatio...
James Demmel, Laura Grigori, Mark Hoemmen, Julien ...
PLDI
1993
ACM
13 years 11 months ago
Global Optimizations for Parallelism and Locality on Scalable Parallel Machines
Data locality is critical to achievinghigh performance on large-scale parallel machines. Non-local data accesses result in communication that can greatly impact performance. Thus ...
Jennifer-Ann M. Anderson, Monica S. Lam
ISPA
2004
Springer
14 years 23 days ago
Parallel Volume Rendering with Early Ray Termination for Visualizing Large-Scale Datasets
Abstract. This paper presents an efficient parallel algorithm for volume rendering of large-scale datasets. Our algorithm focuses on an optimization technique, namely early ray te...
Manabu Matsui, Fumihiko Ino, Kenichi Hagihara
TPDS
2008
97views more  TPDS 2008»
13 years 7 months ago
Solving Systems of Linear Equations on the CELL Processor Using Cholesky Factorization
: The STI CELL processor introduces pioneering solutions in processor architecture. At the same time it presents new challenges for the development of numerical algorithms. One is ...
Jakub Kurzak, Alfredo Buttari, Jack Dongarra
ICPP
2008
IEEE
14 years 1 months ago
Thermal Management for 3D Processors via Task Scheduling
A rising horizon in chip fabrication is the 3D integration technology. It stacks two or more dies vertically with a dense, high-speed interface to increase the device density and ...
Xiuyi Zhou, Yi Xu, Yu Du, Youtao Zhang, Jun Yang 0...