Using Multirail Networks in High-Performance Clusters

15 years 5 months ago

Download hpc.pnl.gov

Using multiple independent networks (also known as rails) is an emerging technique to overcome bandwidth limitations and enhance fault tolerance of current high-performance parallel computers. In this paper we present and analyze various algorithms to allocate multiple communication rails, including static and dynamic allocation schemes. An analytical lower bound on the number of rails required for static rail allocation is shown. We also present an extensive experimental comparison of the behavior of various algorithms in terms of bandwidth and latency. We show that striping messages over multiple rails can substantially reduce network latency, depending on average message size, network load, and allocation scheme. The compared methods include a static rail allocation, a basic round-robin rail allocation, a local-dynamic allocation based on local knowledge, and a dynamic rail allocation that reserves both communication endpoints of a message before sending it. The last method is show...

Salvador Coll, Eitan Frachtenberg, Fabrizio Petrin

Real-time Traffic

Allocation Scheme | CLUSTER 2001 | Distributed And Parallel Computing | Rail Allocation | Static Rail Allocation |

claim paper

» A multicoreenabled multirail communication engine

» Toward Understanding Soft Faults in High Performance Cluster Networks

» Using CMT in SCTPBased MPI to Exploit Multiple Interfaces in Cluster Nodes

» Predictive models for bandwidth sharing in high performance clusters

» Telegraphos HighPerformance Networking for Parallel Processing on Workstation Clusters

» Madeleine II a Portable and Efficient Communication Library for HighPerformance Cluster Co...

» Contextaware address translation for high performance SMP cluster system

» High Performance Relay Mechanism for MPI Communication Libraries Run on Multiple Private I...

Post Info
More Details (n/a)

Added	23 Aug 2010
Updated	23 Aug 2010
Type	Conference
Year	2001
Where	CLUSTER
Authors	Salvador Coll, Eitan Frachtenberg, Fabrizio Petrini, Adolfy Hoisie, Leonid Gurvits

Comments (0)

Sciweavers

Using Multirail Networks in High-Performance Clusters

Allocation Scheme | CLUSTER 2001 | Distributed And Parallel Computing | Rail Allocation | Static Rail Allocation |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers