cache misses | Sciweavers

168

HIPC
2007
Springer

112views Distributed And Parallel Com...» more HIPC 2007»

Direct Coherence: Bringing Together Performance and Scalability in Shared-Memory Multiprocessors

15 years 11 months ago

Traditional directory-based cache coherence protocols suﬀer from long-latency cache misses as a consequence of the indirection introduced by the home node, which must be accessed...

Alberto Ros, Manuel E. Acacio, José M. Garc...

claim paper

Read More »

118

click to vote

IPPS
2007
IEEE

116views Distributed And Parallel Com...» more IPPS 2007»

Performance Analysis of a Family of WHT Algorithms

15 years 11 months ago

Download www.cs.drexel.edu

This paper explores the correlation of instruction counts and cache misses to runtime performance for a large family of divide and conquer algorithms to compute the Walsh–Hadama...

Michael Andrews, Jeremy Johnson

claim paper

Read More »

141

click to vote

ICSEA
2007
IEEE

85views Software Engineering» more ICSEA 2007»

A Model for the Effect of Caching on Algorithmic Efficiency in Radix based Sorting

15 years 11 months ago

Download heim.ifi.uio.no

— This paper demonstrates that the algorithmic performance of end user programs may be greatly affected by the two or three level caching scheme of the processor, and we introduc...

Arne Maus, Stein Gjessing

claim paper

Read More »

147

Voted

DATE
2008
IEEE

171views Hardware» more DATE 2008»

Cache Aware Mapping of Streaming Applications on a Multiprocessor System-on-Chip

15 years 11 months ago

Download www.date-conference.com

Efﬁcient use of the memory hierarchy is critical for achieving high performance in a multiprocessor systemon-chip. An external memory that is shared between processors is a bottl...

Arno Moonen, Marco Bekooij, Rene van den Berg, Jef...

claim paper

Read More »

135

click to vote

DATE
2008
IEEE

165views Hardware» more DATE 2008»

Dynamic Round-Robin Task Scheduling to Reduce Cache Misses for Embedded Systems

15 years 11 months ago

Download www.date-conference.com

Modern embedded CPU systems rely on a growing number of software features, but this growth increases the memory footprint and increases the need for efficient instruction and data...

Ken W. Batcher, Robert A. Walker

claim paper

Read More »

150

click to vote

HPCA
2005
IEEE

124views Distributed And Parallel Com...» more HPCA 2005»

Using Virtual Load/Store Queues (VLSQs) to Reduce the Negative Effects of Reordered Memory Instructions

16 years 5 months ago

Download www.ece.umd.edu

The use of large instruction windows coupled with aggressive out-oforder and prefetching capabilities has provided significant improvements in processor performance. In this paper...

Aamer Jaleel, Bruce L. Jacob

claim paper

Read More »

147

click to vote

DAC
2008
ACM

160views Computer Architecture» more DAC 2008»

Latency and bandwidth efficient communication through system customization for embedded multiprocessors

16 years 6 months ago

Download www.ece.umd.edu

We present a cross-layer customization methodology for latency and bandwidth efficient inter-core communication in embedded multiprocessors. The methodology integrates compiler, o...

Chenjie Yu, Peter Petrov

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers