Sciweavers

CCGRID
2008
IEEE

MPI Collectives on Modern Multicore Clusters: Performance Optimizations and Communication Characteristics

14 years 6 months ago
MPI Collectives on Modern Multicore Clusters: Performance Optimizations and Communication Characteristics
The advances in multicore technology and modern interconnects is rapidly accelerating the number of cores deployed in today’s commodity clusters. A majority of parallel applications written in MPI employ collective operations in their communication kernels. Optimization of these operations on the multicore platforms is the key to obtaining good performance speed-ups. However, designing these operations on the modern multicores is a non-trivial task. Modern multicores such as Intel’s Clovertown and AMD’s Opteron feature various architectural attributes resulting in interesting ramifications. For example, Clovertown deploys shared L2 caches for a pair of cores whereas in Opteron, L2 caches are exclusive to a core. Understanding the impact of these architectures on communication performance is crucial to designing efficient collective algorithms. In this paper, we systematically evaluate these architectures and use these insights to develop efficient collective operations such a...
Amith R. Mamidala, Rahul Kumar, Debraj De, Dhabale
Added 29 May 2010
Updated 29 May 2010
Type Conference
Year 2008
Where CCGRID
Authors Amith R. Mamidala, Rahul Kumar, Debraj De, Dhabaleswar K. Panda
Comments (0)