Using supplier locality in power-aware interconnects and caches in chip multiprocessors

14 years 12 days ago

Download www.ece.uvic.ca

Conventional snoopy-based chip multiprocessors take an aggressive approach broadcasting snoop requests to all nodes. In addition each node checks all received requests. This approach reduces the latency of cache to cache transfer misses at the expense of increasing power. In this paper we show that a large portion of interconnect/cache transactions are redundant as many snoop requests miss in the remote nodes. We exploit this inefficiency and introduce power optimization techniques for chip multiprocessors. Our optimizations rely on the observation that in a snoopy-based shared memory system the data supplier can be predicted with high accuracy. Our optimizations reduce power by eliminating unnecessary activity at both the requester and the supplier end of snoop requests. We reduce power as we (a) avoid broadcasting snoop requests to all processors and (b) avoid tag lookup for all nodes and for all requests arriving. In particular, we use supplier locality and introduce the following ...

Ehsan Atoofian, Amirali Baniasadi

Real-time Traffic

Chip Multiprocessors | JSA 2008 | Snoop Requests | Tag Lookup |

claim paper

Post Info
More Details (n/a)

Added	13 Dec 2010
Updated	13 Dec 2010
Type	Journal
Year	2008
Where	JSA
Authors	Ehsan Atoofian, Amirali Baniasadi

Comments (0)

Sciweavers

Using supplier locality in power-aware interconnects and caches in chip multiprocessors

Chip Multiprocessors | JSA 2008 | Snoop Requests | Tag Lookup |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers