Enabling Concurrent Multithreaded MPI Communication on Multicore Petascale Systems

13 years 11 months ago

Download www.mcs.anl.gov

With the ever-increasing numbers of cores per node on HPC systems, applications are increasingly using threads to exploit the shared memory within a node, combined with MPI across nodes. Achieving high performance when a large number of concurrent threads make MPI calls is a challenging task for an MPI implementation. We describe the design and implementation of our solution in MPICH2 to achieve highperformance multithreaded communication on the IBM Blue Gene/P. We use a combination of a multichannel-enabled network interface, ﬁnegrained locks, lock-free atomic operations, and specially designed queues to provide a high degree of concurrent access while still maintaining MPI’s message-ordering semantics. We present performance results that demonstrate that our new design improves the multithreaded message rate by a factor of 3.6 compared with the existing implementation on the BG/P. Our solutions are also applicable to other high-end systems that have parallel network access capabi...

Gábor Dózsa, Sameer Kumar, Pavan Bal

Real-time Traffic

Distributed And Parallel Computing | Implementation | Lock-free Atomic Operations | MPI Implementation | PVM 2010 |

claim paper

Post Info
More Details (n/a)

Added	30 Jan 2011
Updated	30 Jan 2011
Type	Journal
Year	2010
Where	PVM
Authors	Gábor Dózsa, Sameer Kumar, Pavan Balaji, Darius Buntinas, David Goodell, William Gropp, Joe Ratterman, Rajeev Thakur

Comments (0)

Sciweavers

Enabling Concurrent Multithreaded MPI Communication on Multicore Petascale Systems

Distributed And Parallel Computing | Implementation | Lock-free Atomic Operations | MPI Implementation | PVM 2010 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers