Sciweavers

2932 search results - page 110 / 587
» Optimizing Memory System Performance for Communication in Pa...
Sort
View
JPDC
2006
106views more  JPDC 2006»
13 years 9 months ago
Performance characteristics of the multi-zone NAS parallel benchmarks
We describe a new suite of computational benchmarks that models applications featuring multiple levels of parallelism. Such parallelism is often available in realistic flow comput...
Haoqiang Jin, Rob F. Van der Wijngaart
CCGRID
2008
IEEE
14 years 3 months ago
Experiences with Fine-Grained Distributed Supercomputing on a 10G Testbed
This paper shows how lightpath-based networks can allow challenging, fine-grained parallel supercomputing applications to be run on a grid, using parallel retrograde analysis on ...
Kees Verstoep, Jason Maassen, Henri E. Bal, John W...
IPPS
2005
IEEE
14 years 2 months ago
Optimal Mapping of a Parallel Application Processes onto Heterogeneous Platform
The paper is devoted to analysis of a strategy of computation distribution on heterogeneous parallel systems. According to this strategy processes of parallel program are distribu...
Alexey Kalinov, Sergey Klimov
PARA
2004
Springer
14 years 2 months ago
Improving the Performance of Large-Scale Unstructured PDE Applications
Abstract. This paper investigates two types of overhead due to duplicated local computations, which are frequently encountered in the parallel software of overlapping domain decomp...
Xing Cai
ICPP
1994
IEEE
14 years 1 months ago
Cachier: A Tool for Automatically Inserting CICO Annotations
Shared memory in a parallel computer provides prowith the valuable abstraction of a shared address space--through which any part of a computation can access any datum. Although un...
Trishul M. Chilimbi, James R. Larus