Collective operations in NEC's high-performance MPI libraries

14 years 5 months ago

Download www.cecs.uci.edu

We give an overview of the algorithms and implementations in the high-performance MPI libraries MPI/SX and MPI/ES of some of the most important collective operations of MPI (the Message Passing Interface). The infrastructure of MPI/SX makes it easy to incorporate new algorithms and algorithms for common special cases (e.g. a single SX node, or a single MPI process per SX node). Algorithms that are among the best known are employed, and special hardware features of the SX architecture and Internode Crossbar Switch (IXS) are exploited wherever possible. We discuss in more detail the implementation of MPI Barrier, MPI Bcast, the MPI reduction collectives, MPI Alltoall, and the gather/scatter collectives. Performance ﬁgures and comparisons to straightforward algorithms are given for a large SX-8 system, and for the Earth Simulator. The measurements show excellent absolute performance, and demonstrate the scalability of MPI/SX and MPI/ES to systems with large numbers of nodes.

Hubert Ritzdorf, Jesper Larsson Träff

Real-time Traffic

Algorithms | Distributed And Parallel Computing | IPPS 2006 | Single Mpi Process | SX Node |

claim paper

Post Info
More Details (n/a)

Added	12 Jun 2010
Updated	12 Jun 2010
Type	Conference
Year	2006
Where	IPPS
Authors	Hubert Ritzdorf, Jesper Larsson Träff

Comments (0)

Sciweavers

Collective operations in NEC's high-performance MPI libraries

Algorithms | Distributed And Parallel Computing | IPPS 2006 | Single Mpi Process | SX Node |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers