Sciweavers

IPPS
2010
IEEE
13 years 4 months ago
Restructuring parallel loops to curb false sharing on multicore architectures
The memory hierarchy of most multicore systems contains one or more levels of cache that is shared among multiple cores. The shared-cache architecture presents many opportunities f...
Santosh Sarangkar, Apan Qasem
IPPS
2010
IEEE
13 years 4 months ago
Overlapping computation and communication: Barrier algorithms and ConnectX-2 CORE-Direct capabilities
Abstract--This paper explores the computation and communication overlap capabilities enabled by the new CORE-Direct hardware capabilities introduced in the InfiniBand (IB) Host Cha...
Richard L. Graham, Stephen W. Poole, Pavel Shamis,...
IPPS
2010
IEEE
13 years 4 months ago
Out-of-core distribution sort in the FG programming environment
We describe the implementation of an out-of-core, distribution-based sorting program on a cluster using FG, a multithreaded programming framework. FG mitigates latency from disk-I/...
Priya Natarajan, Thomas H. Cormen, Elena Riccio St...
IPPS
2010
IEEE
13 years 4 months ago
A parallel Particle swarm optimization algorithm for option pricing
Hari Prasain, Girish K. Jha, Parimala Thulasiraman...
IPPS
2010
IEEE
13 years 4 months ago
Fast binding site mapping using GPUs and CUDA
Bharat Sukhwani, Martin C. Herbordt
21
Voted
IPPS
2010
IEEE
13 years 4 months ago
Hybrid MPI/Pthreads parallelization of the RAxML phylogenetics code
Abstract--A hybrid MPI/Pthreads parallelization was implemented in the RAxML phylogenetics code. New MPI code was added to the existing Pthreads production code to exploit parallel...
Wayne Pfeiffer, Alexandros Stamatakis