Improving support for locality and fine-grain sharing in chip multiprocessors

14 years 7 months ago

Download www.cs.rochester.edu

Both commercial and scientiﬁc workloads beneﬁt from concurrency and exhibit data sharing across threads/processes. The resulting sharing patterns are often ﬁne-grain, with the modiﬁed cache lines still residing in the writer’s primary cache when accessed. Chip multiprocessors present an opportunity to optimize for ﬁne-grain sharing using direct access to remote processor components through low-latency on-chip interconnects. In this paper, we present Adaptive Replication, Migration, and producer-Consumer Optimization (ARMCO), a coherence protocol that, to the best of our knowledge, is the ﬁrst to exploit direct access to the L1 caches of remote processors (rather than via coherence mechanisms) in order to support ﬁne-grain sharing. Our goal is to provide support for tightly coupled sharing by recognizing and adapting to common sharing patterns such as migratory, producer-consumer, multiple-reader, and multiple readwrite. The protocol places data close to where it is mos...

Hemayet Hossain, Sandhya Dwarkadas, Michael C. Hua

Real-time Traffic

Direct Access | Hardware | IEEEPACT 2008 | Sharing | Sharing Patterns |

claim paper

Post Info
More Details (n/a)

Added	31 May 2010
Updated	31 May 2010
Type	Conference
Year	2008
Where	IEEEPACT
Authors	Hemayet Hossain, Sandhya Dwarkadas, Michael C. Huang

Comments (0)

Sciweavers

Improving support for locality and fine-grain sharing in chip multiprocessors

Direct Access | Hardware | IEEEPACT 2008 | Sharing | Sharing Patterns |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers