Faster matrix-vector multiplication on GeForce 8800GTX

14 years 9 months ago

Download www.nvidia.com

Recently a GPU has acquired programmability to perform general purpose computation fast by running ten thousands of threads concurrently. This paper presents a new algorithm for dense matrix-vector multiplication on NVIDIA CUDA architecture. The experimental results on GeForce 8800GTX show that the proposed algorithm runs maximum 15.69 (resp., 32.88) times faster than the sgemv routine in

N. Fujimoto

Real-time Traffic

Dense Matrix-vector Multiplication | Distributed And Parallel Computing | General Purpose Computation | IPPS 2008 | NVIDIA CUDA Architecture |

claim paper

Post Info
More Details (n/a)

Added	31 May 2010
Updated	31 May 2010
Type	Conference
Year	2008
Where	IPPS
Authors	N. Fujimoto

Comments (0)

Sciweavers

Faster matrix-vector multiplication on GeForce 8800GTX

Dense Matrix-vector Multiplication | Distributed And Parallel Computing | General Purpose Computation | IPPS 2008 | NVIDIA CUDA Architecture |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers