Sciweavers

112 search results - page 14 / 23
» Self-consistent MPI Performance Requirements
Sort
View
ICPADS
2010
IEEE
15 years 25 days ago
Effective Performance Measurement at Petascale Using IPM
As supercomputers are being built from an ever increasing number of processing elements, the effort required to achieve a substantial fraction of the system peak performance is con...
Karl Fürlinger, Nicholas J. Wright, David Ski...
148
Voted
SC
2005
ACM
15 years 8 months ago
Performance Modeling and Tuning Strategies of Mixed Mode Collective Communications
On SMP clusters, mixed mode collective MPI communications, which use shared memory communications within SMP nodes and point-to-point communications between SMP nodes, are more eļ...
Meng-Shiou Wu, Ricky A. Kendall, Kyle Wright, Zhao...
ICPP
2008
IEEE
15 years 9 months ago
Scalable Dynamic Load Balancing Using UPC
An asynchronous work-stealing implementation of dynamic load balance is implemented using Unified Parallel C (UPC) and evaluated using the Unbalanced Tree Search (UTS) benchmark ...
Stephen Olivier, Jan Prins
ICS
2009
Tsinghua U.
15 years 9 months ago
MPI-aware compiler optimizations for improving communication-computation overlap
Several existing compiler transformations can help improve communication-computation overlap in MPI applications. However, traditional compilers treat calls to the MPI library as ...
Anthony Danalis, Lori L. Pollock, D. Martin Swany,...
111
Voted
ICPP
1996
IEEE
15 years 7 months ago
Parallel Implementation of Cone Beam Tomography
Abstract - Three dimensional computed tomography is a computationally intensive procedure, requiring large amounts of R A M and processing power. Parallel methods for two dimension...
David A. Reimann, Vipin Chaudhary, Michael J. Flyn...