Sciweavers

420 search results - page 23 / 84
» Scalable Parallel Programming with CUDA
Sort
View
ICPP
1997
IEEE
14 years 20 days ago
Automatic Partitioning of Data and Computations on Scalable Shared Memory Multiprocessors
Abstract—This paper describes an algorithm for deriving data and computation partitions on scalable shared memory multiprocessors. The algorithm establishes affinity relationshi...
Sudarsan Tandri, Tarek S. Abdelrahman
SC
2000
ACM
14 years 25 days ago
Scalable Molecular Dynamics for Large Biomolecular Systems
We present an optimized parallelization scheme for molecular dynamics simulations of large biomolecular systems, implemented in the production-quality molecular dynamics program N...
Robert Brunner, James C. Phillips, Laxmikant V. Ka...
IJHPCA
2007
130views more  IJHPCA 2007»
13 years 8 months ago
A Composition Environment for MPI Programs
While MPI is the most common mechanism for expressing parallelism, MPI programs are not composable by using current MPI process managers or parallel shells. We introduce MPISH2, an...
Narayan Desai, Ewing L. Lusk, Rick Bradshaw
IPPS
1998
IEEE
14 years 1 days ago
Performance Range Comparison via Crossing Point Analysis
Parallel programming is elusive. The relative performance of di erent parallel implementations varies with machine architecture, system and problem size. How to compare di erent i...
Xian-He Sun
IPPS
2007
IEEE
14 years 2 months ago
Improving Scalability of OpenMP Applications on Multi-core Systems Using Large Page Support
Modern multi-core architectures have become popular because of the limitations of deep pipelines and heating and power concerns. Some of these multi-core architectures such as the...
Ranjit Noronha, Dhabaleswar K. Panda