A decomposition approach for optimizing the performance of MPI libraries

14 years 6 months ago

Download www.cecs.uci.edu

MPI provides a portable message passing interface for many parallel execution platforms but may lead to inefﬁciencies for some platforms and applications. In this article we show that the performance of both, standard libraries and vendor-speciﬁc libraries, can be improved by an orthogonal organization of the processors in 2D or 3D meshes and by decomposing the collective communication operations into several phases. We describe an adaptive approach with a conﬁguration phase to determine for a speciﬁc execution platform and a speciﬁc MPI library which decomposition leads to the best performance. This may also depend on the number of processors and the size of the messages to be transferred. The decomposition approach has been implemented in the form of a library extension which is called for each activation of a collective MPI operation. This has the advantage that neither the application programs nor the MPI library need to be changed while leading to signiﬁcant performan...

O. Hartmann, Matthias Kühnemann, Thomas Raube

Real-time Traffic

Collective Mpi Operations | Distributed And Parallel Computing | Execution Platform | IPPS 2006 | MPI Library |

claim paper

Post Info
More Details (n/a)

Added	12 Jun 2010
Updated	12 Jun 2010
Type	Conference
Year	2006
Where	IPPS
Authors	O. Hartmann, Matthias Kühnemann, Thomas Rauber, Gudula Rünger

Comments (0)

Sciweavers

A decomposition approach for optimizing the performance of MPI libraries

Collective Mpi Operations | Distributed And Parallel Computing | Execution Platform | IPPS 2006 | MPI Library |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers