Sciweavers

420 search results - page 38 / 84
» Scalable Parallel Programming with CUDA
Sort
View
ASPLOS
2010
ACM
14 years 3 months ago
Speculative parallelization using software multi-threaded transactions
With the right techniques, multicore architectures may be able to continue the exponential performance trend that elevated the performance of applications of all types for decades...
Arun Raman, Hanjun Kim, Thomas R. Mason, Thomas B....
HPCC
2010
Springer
13 years 8 months ago
Parallel Computational Modelling of Inelastic Neutron Scattering in Multi-node and Multi-core Architectures
Abstract--This paper examines the initial parallel implementation of SCATTER, a computationally intensive inelastic neutron scattering routine with polycrystalline averaging capabi...
Michael T. Garba, Horacio González-Vé...
SCI
1999
Springer
14 years 22 days ago
Shared Memory Parallelization of the GROMOS96 Molecular Dynamics Code
This paper describes the parallelization of a commercial molecular dynamics simulation code, GROMOS96, on a SCI (Scalable Coherent Interface) interconnected PC cluster. The underly...
Marcus Dormanns
ARCS
2006
Springer
14 years 7 days ago
Large Event Traces in Parallel Performance Analysis
: A powerful and widely-used method for analyzing the performance behavior of parallel programs is event tracing. When an application is traced, performancerelevant events, such as...
Felix Wolf, Felix Freitag, Bernd Mohr, Shirley Moo...
HPDC
2010
IEEE
13 years 9 months ago
Scalability of communicators and groups in MPI
As the number of cores inside compute clusters continues to grow, the scalability of MPI (Message Passing Interface) is important to ensure that programs can continue to execute o...
Humaira Kamal, Seyed M. Mirtaheri, Alan Wagner