Efficient Floating-point Based Block LU Decomposition on FPGAs

14 years 6 months ago

Download halcyon.usc.edu

In this paper, we propose an architecture for floatingpoint based LU decomposition for large-sized matrices. Our proposed architecture is based on the well known concept of blocking and uses pipelined floating-point units to obtain high throughput. We first analyze the effects of block size and the deeply pipelined floating-point units on the performance of the architecture. We analyze and compare the performance of our double-precision based design with that of a GPP based design. Initial results show that an improvement of upto 23x in the total computation time can be achieved. We then, analyze the impact of algorithm level design (by varying block size) on the system-wide energy dissipation and resource-usage of our designs.

Gokul Govindu, Viktor K. Prasanna, Vikash Daga, Sr

Real-time Traffic

Block Size | ERSA 2004 | ERSA 2007 | Floatingpoint Based Lu | Pipelined Floating-point Units |

claim paper

» Efficient Packet Matching for Gigabit Network Intrusion Detection using TCAMs

Post Info
More Details (n/a)

Added	30 Oct 2010
Updated	30 Oct 2010
Type	Conference
Year	2004
Where	ERSA
Authors	Gokul Govindu, Viktor K. Prasanna, Vikash Daga, Sridhar Gangadharpalli, V. Sridhar

Comments (0)

Sciweavers

Efficient Floating-point Based Block LU Decomposition on FPGAs

Block Size | ERSA 2004 | ERSA 2007 | Floatingpoint Based Lu | Pipelined Floating-point Units |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers