Sciweavers

SC
2005
ACM

An Application-Based Performance Characterization of the Columbia Supercluster

14 years 5 months ago
An Application-Based Performance Characterization of the Columbia Supercluster
Columbia is a 10,240-processor supercluster consisting of 20 Altix nodes with 512 processors each, and currently ranked as one of the fastest computers in the world. In this paper, we present the performance characteristics of Columbia obtained on up to four computing nodes interconnected via the InfiniBand and/or NUMAlink4 communication fabrics. We evaluate floatingpoint performance, memory bandwidth, message passing communication speeds, and compilers using a subset of the HPC Challenge benchmarks, and some of the NAS Parallel Benchmarks including the multi-zone versions. We present detailed performance results for three scientific applications of interest to NASA, one from molecular dynamics, and two from computational fluid dynamics. Our results show that both the NUMAlink4 and InfiniBand interconnects hold promise for multi-node application scaling to at least 2048 processors.
Rupak Biswas, M. Jahed Djomehri, Robert Hood, Haoq
Added 26 Jun 2010
Updated 26 Jun 2010
Type Conference
Year 2005
Where SC
Authors Rupak Biswas, M. Jahed Djomehri, Robert Hood, Haoqiang Jin, Cetin C. Kiris, Subhash Saini
Comments (0)