Reconfigurable Sparse Matrix-Vector Multiplication on FPGAs

15 years 2 months ago

Download www.ecs.umass.edu

Cache-based, general purpose CPUs perform at a small fraction of their maximum floating point performance when executing memory-intensive simulations, such as those required for sparse matrix-vector multiplication. This effect is due to the memory bottleneck that is encountered with large arrays that must be stored in dynamic RAM. An FPGA core designed for a target performance that does not unnecessarily exceed the memory imposed bottleneck can be distributed, along with multiple memory interfaces, into a scalable architecture that overcomes the bandwidth limitation of a single interface. Interconnected cores can work together to solve a computing problem and exploit a bandwidth that is the sum of the bandwidth available from all of their connected memory interfaces. This work demonstrates this concept of scalability with two memory interfaces through the use of an available FPGA prototyping platform. It is shown that our reconfigurable approach is scalable as performance roughly doubl...

Russell Tessier, Salma Mirza, J. Blair Perot

Real-time Traffic

ERSA 2010 | FPGAs | Hardware | Maximum Floating Point | Sparse Matrix-vector Multiplication |

claim paper

» Sparse MatrixVector Multiplication for Finite Element Method Matrices on FPGAs

» Sparse MatrixVector Multiplication Design on FPGAs

» Sparse MatrixVector multiplication on FPGAs

Post Info
More Details (n/a)

Added	11 Feb 2011
Updated	11 Feb 2011
Type	Journal
Year	2010
Where	ERSA
Authors	Russell Tessier, Salma Mirza, J. Blair Perot

Comments (0)

Sciweavers

Reconfigurable Sparse Matrix-Vector Multiplication on FPGAs

ERSA 2010 | FPGAs | Hardware | Maximum Floating Point | Sparse Matrix-vector Multiplication |

Explore & Download

Productivity Tools

Sciweavers