Memory management in NUMA multicore systems: trapped between cache contention and interconnect overhead

14 years 8 months ago

Download people.inf.ethz.ch

Multiprocessors based on processors with multiple cores usually include a non-uniform memory architecture (NUMA); even current 2-processor systems with 8 cores exhibit non-uniform memory access times. As the cores of a processor share a common cache, the issues of memory management and process mapping must be revisited. We ﬁnd that optimizing only for data locality can counteract the beneﬁts of cache contention avoidance and vice versa. Therefore, system software must take both data locality and cache contention into account to achieve good performance, and memory management cannot be decoupled from process scheduling. We present a detailed analysis of a commercially available NUMA-multicore architecture, the Intel Nehalem. We describe two scheduling algorithms: maximum-local, which optimizes for maximum data locality, and its extension, N-MASS, which reduces data locality to avoid the performance degradation caused by cache contention. N-MASS is ﬁne-tuned to support memory mana...

Zoltan Majo, Thomas R. Gross

Real-time Traffic

Hardware | IWMM 2011 | Memory Architecture | Resource Contention | Uniform Memory Access |

claim paper

Added	15 Sep 2011
Updated	15 Sep 2011
Type	Journal
Year	2011
Where	IWMM
Authors	Zoltan Majo, Thomas R. Gross

Sciweavers

Memory management in NUMA multicore systems: trapped between cache contention and interconnect overhead

Hardware | IWMM 2011 | Memory Architecture | Resource Contention | Uniform Memory Access |

Explore & Download

Productivity Tools

Sciweavers