Improving Memory-System Performance of Sparse Matrix-Vector Multiplication

14 years 2 months ago

Download www.cs.tau.ac.il

Sparse matrix-vector multiplication is an important kernel that often runs ineﬃciently on superscalar RISC processors. This paper describes techniques that increase instruction-level parallelism and improve performance. The techniques include reordering to reduce cache misses originally due to Das et al., blocking to reduce load instructions, and prefetching to prevent multiple load-store units from stalling simultaneously. The techniques improve performance from about 40 Mﬂops (on a well-ordered matrix) to over 100 Mﬂops on a 266 Mﬂops machine.

Sivan Toledo

Real-time Traffic

PPSC 1997 | PPSC 2001 | Sparse Matrix-vector Multiplication | Superscalar Risc Processors | Techniques Improve Performance |

claim paper

Post Info
More Details (n/a)

Added	01 Nov 2010
Updated	01 Nov 2010
Type	Conference
Year	1997
Where	PPSC
Authors	Sivan Toledo

Comments (0)

Sciweavers

Improving Memory-System Performance of Sparse Matrix-Vector Multiplication

PPSC 1997 | PPSC 2001 | Sparse Matrix-vector Multiplication | Superscalar Risc Processors | Techniques Improve Performance |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers