Sciweavers

2932 search results - page 29 / 587
» Optimizing Memory System Performance for Communication in Pa...
Sort
View
PVM
2010
Springer
13 years 6 months ago
Enabling Concurrent Multithreaded MPI Communication on Multicore Petascale Systems
With the ever-increasing numbers of cores per node on HPC systems, applications are increasingly using threads to exploit the shared memory within a node, combined with MPI across ...
Gábor Dózsa, Sameer Kumar, Pavan Bal...
HPCC
2007
Springer
14 years 2 months ago
Parallel Performance Prediction for Multigrid Codes on Distributed Memory Architectures
We propose a model for describing the parallel performance of multigrid software on distributed memory architectures. The goal of the model is to allow reliable predictions to be m...
Giuseppe Romanazzi, Peter K. Jimack
PPOPP
1997
ACM
13 years 11 months ago
Shared Memory Performance Profiling
This paper describes a new approach to finding performance bottlenecks in shared-memory parallel programs and its embodiment in the Paradyn Parallel Performance Tools running with...
Zhichen Xu, James R. Larus, Barton P. Miller
PACT
2001
Springer
14 years 10 days ago
Optimizing Metacomputing with Communication-Computation Overlap
In the framework of distributed object systems, this paper presents the concepts and an implementation of an overlapping mechanism between communication and computation. This mecha...
Françoise Baude, Denis Caromel, Nathalie Fu...
IPPS
2007
IEEE
14 years 2 months ago
Evaluation of Remote Memory Access Communication on the Cray XT3
This paper evaluates remote memory access (RMA) communication capabilities and performance on the Cray XT3. We discuss properties of the network hardware and Portals networking so...
Vinod Tipparaju, Andriy Kot, Jarek Nieplocha, Moni...