Optimization of MPI collective communication on BlueGene/L systems

16 years 28 days ago

Download www.cs.brown.edu

BlueGene/L is currently the world’s fastest supercomputer. It consists of a large number of low power dual-processor compute nodes interconnected by high speed torus and collective networks. Because compute nodes do not have shared memory, MPI is the the natural programming model for this machine. The BlueGene/L MPI library is a port of MPICH2. In this paper we discuss the implementation of MPI collectives on BlueGene/L. The MPICH2 implementation of MPI collectives is based on point-to-point communication primitives. This turns out to be suboptimal for a number of reasons. Machine-optimized MPI collectives are necessary to harness the performance of BlueGene/L. We discuss these optimized MPI collectives, describing the algorithms and presenting performance results measured with targeted micro-benchmarks on real BlueGene/L hardware with up to 4096 compute nodes. Categories and Subject Descriptors C.4 [Computer Systems Organization]: Performance

George Almási, Philip Heidelberger, Charles

Real-time Traffic

Compute Nodes | ICS 2005 | Machine-optimized MPI Collectives | MPI Collectives | Theoretical Computer Science |

claim paper

» Astronomical realtime streaming signal processing on a Blue GeneL supercomputer

» Parallel genomic sequencesearch on a massively parallel system

» Link Shutdown Opportunities During Collective Communications in 3D Torus Nets

» Collective communication on architectures that support simultaneous communication over mul...

» Open Issues in MPI Implementation

» A tracedriven emulation framework to predict scalability of large clusters in presence of ...

» Performance characterization of molecular dynamics techniques for biomolecular simulations

» Topologyaware task mapping for reducing communication contention on large parallel machine...

Post Info
More Details (n/a)

Added	27 Jun 2010
Updated	27 Jun 2010
Type	Conference
Year	2005
Where	ICS
Authors	George Almási, Philip Heidelberger, Charles Archer, Xavier Martorell, C. Christopher Erway, José E. Moreira, Burkhard D. Steinmacher-Burow, Yili Zheng

Comments (0)

Sciweavers

Optimization of MPI collective communication on BlueGene/L systems

Compute Nodes | ICS 2005 | Machine-optimized MPI Collectives | MPI Collectives | Theoretical Computer Science |

Explore & Download

Productivity Tools

Sciweavers