Scalable cache memory design for large-scale SMT architectures

14 years 7 months ago

Download www.cs.utah.edu

The cache hierarchy design in existing SMT and superscalar processors is optimized for latency, but not for bandwidth. The size of the L1 data cache did not scale over the past decade. Instead, larger unified L2 and L3 caches were introduced. This cache hierarchy has a high overhead due to the principle of containment. It also has a complex design to maintain cache coherence across all levels. Furthermore, this cache hierarchy is not suitable for future large-scale SMT processors, which will demand high bandwidth instruction and data caches with a large number of ports. This paper suggests the elimination of the cache hierarchy and replacing it with one-level caches for instruction and data. Multiple instruction caches can be used in parallel to scale the instruction fetch bandwidth and the overall cache capacity. A one-level data cache can be split into a number of block-interleaved cache banks to serve multiple memory requests in parallel. An interconnect is used to connect the data ...

Muhamed F. Mudawar

Real-time Traffic

Cache | Cache Hierarchy | Data Cache | Hardware | WMPI 2004 |

claim paper

Post Info
More Details (n/a)

Added	30 Jun 2010
Updated	30 Jun 2010
Type	Conference
Year	2004
Where	WMPI
Authors	Muhamed F. Mudawar

Comments (0)

Sciweavers

Scalable cache memory design for large-scale SMT architectures

Cache | Cache Hierarchy | Data Cache | Hardware | WMPI 2004 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers