A Vector Caching Scheme for Streaming FPGA SpMV Accelerators

10 years 4 days ago

Download www.idi.ntnu.no

The sparse matrix – vector multiplication (SpMV) kernel is important for many scientiﬁc computing applications. Implementing SpMV in a way that best utilizes hardware resources is challenging due to input-dependent memory access patterns. FPGA-based accelerators that buﬀer the entire irregular-access part in on-chip memory enable highly eﬃcient SpMV implementations, but are limited to smaller matrices due to on-chip memory limits. Conversely, conventional caches can work with large matrices, but cache misses can cause many stalls that decrease eﬃciency. In this paper, we explore the intersection between these approaches and attempt to combine the strengths of each. We propose a hardware-software caching scheme that exploits preprocessing to enable performant and area-eﬀective SpMV acceleration. Our experiments with a set of large sparse matrices indicate that our scheme can achieve nearly

Yaman Umuroglu, Magnus Jahre

Real-time Traffic

ARC 2015 | Hardware |

claim paper

Post Info
More Details (n/a)

Added	16 Apr 2016
Updated	16 Apr 2016
Type	Journal
Year	2015
Where	ARC
Authors	Yaman Umuroglu, Magnus Jahre

Comments (0)

Sciweavers

A Vector Caching Scheme for Streaming FPGA SpMV Accelerators

ARC 2015 | Hardware |

Explore & Download

Productivity Tools

Sciweavers