Sciweavers

ISPASS
2010
IEEE

Memphis: Finding and fixing NUMA-related performance problems on multi-core platforms

14 years 6 months ago
Memphis: Finding and fixing NUMA-related performance problems on multi-core platforms
—Until recently, most high-end scientific applications have been immune to performance problems caused by NonUniform Memory Access (NUMA). However, current trends in micro-processor design are pushing NUMA to smaller and smaller scales. This paper examines the current state of NUMA and makes several contributions. First, we summarize the performance problems that NUMA can present for multithreaded applications and describe methods of addressing them. Second, we demonstrate that NUMA can indeed be a significant problem for scientific applications, showing that it can mean the difference between an application scaling perfectly and failing to scale at all. Third, we describe, in increasing order of usefulness, three methods of using hardware performance counters to aid in finding NUMA-related problems. Finally, we introduce Memphis, a data-centric toolset that uses Instruction Based Sampling to help pinpoint problematic memory accesses, and demonstrate how we used it to improve the per...
Collin McCurdy, Jeffrey S. Vetter
Added 17 May 2010
Updated 17 May 2010
Type Conference
Year 2010
Where ISPASS
Authors Collin McCurdy, Jeffrey S. Vetter
Comments (0)