Revisiting Cache Block Superloading

15 years 9 months ago

Download www.csl.cornell.edu

Abstract. Technological advances and increasingly complex and dynamic application behavior argue for revisiting mechanisms that adapt logical cache block size to application characteristics. This approach to bridging the processor/memory performance gap has been studied before, but mostly via trace-driven simulation, looking only at L1 caches. Given changes in hardware/software technology, we revisit the general approach: we propose a transparent, phase-adaptive, low-complexity mechanism for L2 superloading and evaluate it on a full-system simulator for 23 SPEC CPU2000 codes. Targeting L2 beneﬁts instruction and data fetches. We investigate cache blocks of 32-512B, conﬁrming that no ﬁxed size performs well for all applications: diﬀerences range from 5-49% between best and worst ﬁxed block sizes. Our scheme obtains performance similar to the per application best static block size. In a few cases, we minimally decrease performance compared to the best static size, but best size...

Matthew A. Watkins, Sally A. McKee, Lambert Schael

Real-time Traffic

Block Size | Cache Block Size | Cache Blocks | HIPEAC 2009 |

claim paper

Added	25 Jul 2010
Updated	25 Jul 2010
Type	Conference
Year	2009
Where	HIPEAC
Authors	Matthew A. Watkins, Sally A. McKee, Lambert Schaelicke

Sciweavers

Revisiting Cache Block Superloading

Block Size | Cache Block Size | Cache Blocks | HIPEAC 2009 |

Explore & Download

Productivity Tools

Sciweavers