A Fully Adaptive Fault-Tolerant Routing Methodology Based on Intermediate Nodes

16 years 26 days ago

Download www.disca.upv.es

Massively parallel computing systems are being built with thousands of nodes. Because of the high number of components, it is critical to keep these systems running even in the presence of failures. Interconnection networks play a key-role in these systems, and this paper proposes a fault-tolerant routing methodology for use in such networks. The methodology supports any minimal routing function (including fully adaptive routing), does not degrade performance in the absence of faults, does not disable any healthy node, and is easy to implement both in meshes and tori. In order to avoid network failures, the methodology uses a simple mechanism: for some source-destination pairs, packets are forwarded to the destination node through a set of intermediate nodes (without being ejected from the network). The methodology is shown to tolerate a large number of faults (e.g., ﬁve/nine faults when using two/three intermediate nodes in a 3D torus). Furthermore, the methodology oﬀers a graciou...

Nils Agne Nordbotten, María Engracia G&oacu

Real-time Traffic

Fault-tolerant Routing Methodology | Intermediate Nodes | Methodology | NPC 2004 |

claim paper

» Fault Tolerant Network on Chip Switching With Graceful Performance Degradation

Post Info
More Details (n/a)

Added	02 Jul 2010
Updated	02 Jul 2010
Type	Conference
Year	2004
Where	NPC
Authors	Nils Agne Nordbotten, María Engracia Gómez, Jose Flich, Pedro López, Antonio Robles, Tor Skeie, Olav Lysne, José Duato

Comments (0)

Sciweavers

A Fully Adaptive Fault-Tolerant Routing Methodology Based on Intermediate Nodes

Fault-tolerant Routing Methodology | Intermediate Nodes | Methodology | NPC 2004 |

Explore & Download

Productivity Tools

Sciweavers