Thread clustering: sharing-aware scheduling on SMP-CMP-SMT multiprocessors

16 years 2 months ago

Download www.eecg.toronto.edu

The major chip manufacturers have all introduced chip multiprocessing (CMP) and simultaneous multithreading (SMT) technology into their processing units. As a result, even low-end computing systems and game consoles have become shared memory multiprocessors with L1 and L2 cache sharing within a chip. Mid- and large-scale systems will have multiple processing chips and hence consist of an SMPCMP-SMT conﬁguration with non-uniform data sharing overheads. Current operating system schedulers are not aware of these new cache organizations, and as a result, distribute threads across processors in a way that causes many unnecessary, long-latency cross-chip cache accesses. In this paper we describe the design and implementation of a scheme to schedule threads based on sharing patterns detected online using features of standard performance monitoring units (PMUs) available in today’s processing units. The primary advantage of using the PMU infrastructure is that it is ﬁne-grained (down to...

David K. Tam, Reza Azimi, Michael Stumm

Real-time Traffic

Cache Line | Cross-chip Cache Accesses | EUROSYS 2007 | Performance Monitoring Units | System Software |

claim paper

Added	10 Mar 2010
Updated	10 Mar 2010
Type	Conference
Year	2007
Where	EUROSYS
Authors	David K. Tam, Reza Azimi, Michael Stumm

Sciweavers

Thread clustering: sharing-aware scheduling on SMP-CMP-SMT multiprocessors

Cache Line | Cross-chip Cache Accesses | EUROSYS 2007 | Performance Monitoring Units | System Software |

Explore & Download

Productivity Tools

Sciweavers