Sciweavers

IWMM
2011
Springer

Memory management in NUMA multicore systems: trapped between cache contention and interconnect overhead

13 years 2 months ago
Memory management in NUMA multicore systems: trapped between cache contention and interconnect overhead
Multiprocessors based on processors with multiple cores usually include a non-uniform memory architecture (NUMA); even current 2-processor systems with 8 cores exhibit non-uniform memory access times. As the cores of a processor share a common cache, the issues of memory management and process mapping must be revisited. We find that optimizing only for data locality can counteract the benefits of cache contention avoidance and vice versa. Therefore, system software must take both data locality and cache contention into account to achieve good performance, and memory management cannot be decoupled from process scheduling. We present a detailed analysis of a commercially available NUMA-multicore architecture, the Intel Nehalem. We describe two scheduling algorithms: maximum-local, which optimizes for maximum data locality, and its extension, N-MASS, which reduces data locality to avoid the performance degradation caused by cache contention. N-MASS is fine-tuned to support memory mana...
Zoltan Majo, Thomas R. Gross
Added 15 Sep 2011
Updated 15 Sep 2011
Type Journal
Year 2011
Where IWMM
Authors Zoltan Majo, Thomas R. Gross
Comments (0)