Sciweavers

SC
2004
ACM

Analysis and Performance Results of a Molecular Modeling Application on Merrimac

14 years 5 months ago
Analysis and Performance Results of a Molecular Modeling Application on Merrimac
The Merrimac supercomputer uses stream processors and a highradix network to achieve high performance at low cost and low power. The stream architecture matches the capabilities of modern semiconductor technology with compute-intensive parallel applications. We present a detailed case study of porting the GROMACS molecular-dynamics force calculation to Merrimac. The characteristics of the architecture which stress locality, parallelism, and decoupling of memory operations and computation, allow for high performance of compiler optimized code. The rich set of hardware memory operations and the ample computation bandwidth of the Merrimac processor present a wide range of algorithmic trade-offs and optimizations which may be generalized to several scientific computing domains. We use a cycle-accurate hardware simulator to analyze the performance bottlenecks of the various implementations and to measure application run-time. A comparison with the highly optimized GROMACS code, tuned for ...
Mattan Erez, Jung Ho Ahn, Ankit Garg, William J. D
Added 30 Jun 2010
Updated 30 Jun 2010
Type Conference
Year 2004
Where SC
Authors Mattan Erez, Jung Ho Ahn, Ankit Garg, William J. Dally, Eric Darve
Comments (0)