Floating-point sparse matrix-vector multiply for FPGAs

14 years 11 months ago

Download ic.ese.upenn.edu

Large, high density FPGAs with high local distributed memory bandwidth surpass the peak ﬂoating-point performance of high-end, general-purpose processors. Microprocessors do not deliver near their peak ﬂoating-point performance on efﬁcient algorithms that use the Sparse Matrix-Vector Multiply (SMVM) kernel. In fact, it is not uncommon for microprocessors to yield only 10–20% of their peak ﬂoating-point performance when computing SMVM. We develop and analyze a scalable SMVM implementation on modern FPGAs and show that it can sustain high throughput, near peak, ﬂoating-point performance. For benchmark matrices from

Michael DeLorimier, André DeHon

Real-time Traffic