Search Sciweavers | Sciweavers

134 search results - page 2 / 27

» Locating cache performance bottlenecks using data profiling

click to vote

MICRO
1997
IEEE

90views Hardware» more MICRO 1997»

ProfileMe: Hardware Support for Instruction-Level Profiling on Out-of-Order Processors

13 years 11 months ago

Download waldspurger.org

Profile data is valuable for identifying performance bottlenecks and guiding optimizations. Periodic sampling of a processor's performance monitoring hardware is an effective...

Jeffrey Dean, James E. Hicks, Carl A. Waldspurger,...

claim paper

Read More »

click to vote

VALUETOOLS
2006
ACM

167views Hardware» more VALUETOOLS 2006»

Detailed cache simulation for detecting bottleneck, miss reason and optimization potentialities

14 years 1 months ago

Download itec.uka.de

Cache locality optimization is an eﬃcient way for reducing the idle time of modern processors in waiting for needed data. This kind of optimization can be achieved either on the...

Jie Tao, Wolfgang Karl

claim paper

Read More »

click to vote

LCTRTS
2007
Springer

161views System Software» more LCTRTS 2007»

Addressing instruction fetch bottlenecks by using an instruction register file

14 years 1 months ago

Download ww2.cs.fsu.edu

The Instruction Register File (IRF) is an architectural extension for providing improved access to frequently occurring instructions. An optimizing compiler can exploit an IRF by ...

Stephen Roderick Hines, Gary S. Tyson, David B. Wh...

claim paper

Read More »

click to vote

ISLPED
2004
ACM

137views Hardware» more ISLPED 2004»

Location cache: a low-power L2 cache system

14 years 1 months ago

Download www.ece.uc.edu

While set-associative caches incur fewer misses than directmapped caches, they typically have slower hit times and higher power consumption, when multiple tag and data banks are p...

Rui Min, Wen-Ben Jone, Yiming Hu

claim paper

Read More »

click to vote

CF
2006
ACM

203views Applied Computing» more CF 2006»

Intermediately executed code is the key to find refactorings that improve temporal data locality

13 years 9 months ago

Download escher.elis.UGent.be

The growing speed gap between memory and processor makes an efficient use of the cache ever more important to reach high performance. One of the most important ways to improve cac...

Kristof Beyls, Erik H. D'Hollander

claim paper

Read More »

« Prev « First page 2 / 27 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers