Sciweavers

270 search results - page 5 / 54
» A decomposition approach for optimizing the performance of M...
Sort
View
PDCN
2004
13 years 10 months ago
A tunable collective communication framework on a cluster of SMPS
In this paper we investigate a tunable MPI collective communications library on a cluster of SMPs. Most tunable collective communications libraries select optimal algorithms for i...
Meng-Shiou Wu, Ricky A. Kendall, Srinivas Aluru
LCN
2002
IEEE
14 years 1 months ago
Improving Processor Availability in the MPI Implementation for the ASCI/Red Supercomputer
This paper describes how a portable benchmark suite that measures the ability of an MPI implementation to overlap computation and communication can be used to discover and diagnos...
Ron Brightwell, William Lawry, Arthur B. Maccabe, ...
IPPS
2002
IEEE
14 years 1 months ago
Effective Cross-Platform, Multilevel Parallelism via Dynamic Adaptive Execution
This paper presents preliminary efforts to develop compilation and execution environments that achieve performance portability of multilevel parallelization on hierarchical archit...
Walden Ko, Mark N. Yankelevsky, Dimitrios S. Nikol...
ICCCN
2008
IEEE
14 years 3 months ago
Instrumentation and Analysis of MPI Queue Times on the SeaStar High-Performance Network
—Understanding the communication behavior and network resource usage of parallel applications is critical to achieving high performance and scalability on systems with tens of th...
Ron Brightwell, Kevin T. Pedretti, Kurt B. Ferreir...
TPDS
2002
126views more  TPDS 2002»
13 years 8 months ago
P-3PC: A Point-to-Point Communication Model for Automatic and Optimal Decomposition of Regular Domain Problems
One of the most fundamental problems automatic parallelization tools are confronted with is to find an optimal domain decomposition for a given application. For regular domain prob...
Frank J. Seinstra, Dennis Koelma