Search Sciweavers | Sciweavers

492 search results - page 75 / 99

» Predictable performance in SMT processors

140

click to vote

ARCS
2008
Springer

136views Software Engineering» more ARCS 2008»

An Optimized ZGEMM Implementation for the Cell BE

15 years 5 months ago

Download www.unixer.de

: The architecture of the IBM Cell BE processor represents a new approach for designing CPUs. The fast execution of legacy software has to stand back in order to achieve very high ...

Timo Schneider, Torsten Hoefler, Simon Wunderlich,...

claim paper

Read More »

142

click to vote

AAECC
2007
Springer

111views Algorithms» more AAECC 2007»

When cache blocking of sparse matrix vector multiply works and why

15 years 3 months ago

Download www.eecs.berkeley.edu

Abstract. We present new performance models and a new, more compact data structure for cache blocking when applied to the sparse matrixvector multiply (SpM×V) operation, y ← y +...

Rajesh Nishtala, Richard W. Vuduc, James Demmel, K...

claim paper

Read More »

123

click to vote

MICRO
2005
IEEE

114views Hardware» more MICRO 2005»

Address-Indexed Memory Disambiguation and Store-to-Load Forwarding

15 years 8 months ago

Download ipa.ece.illinois.edu

This paper describes a scalable, low-complexity alternative to the conventional load/store queue (LSQ) for superscalar processors that execute load and store instructions speculat...

Sam S. Stone, Kevin M. Woley, Matthew I. Frank

claim paper

Read More »

227

click to vote

ASPLOS
2009
ACM

137views Programming Languages» more ASPLOS 2009»

RapidMRC: approximating L2 miss rate curves on commodity systems for online optimizations

16 years 3 months ago

Download www.eecg.toronto.edu

Miss rate curves (MRCs) are useful in a number of contexts. In our research, online L2 cache MRCs enable us to dynamically identify optimal cache sizes when cache-partitioning a s...

David K. Tam, Reza Azimi, Livio Soares, Michael St...

claim paper

Read More »

121

click to vote

MICRO
2008
IEEE

138views Hardware» more MICRO 2008»

Hybrid analytical modeling of pending cache hits, data prefetching, and MSHRs

15 years 9 months ago

Download www.ece.ubc.ca

As the number of transistors integrated on a chip continues to increase, a growing challenge is accurately modeling performance in the early stages of processor design. Analytical...

Xi E. Chen, Tor M. Aamodt

claim paper

Read More »

« Prev « First page 75 / 99 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers