Distributed and Parallel Computing

154

IPPS
2010
IEEE

104views Distributed And Parallel Com...» more IPPS 2010»

Performance and energy optimization of concurrent pipelined applications

15 years 3 months ago

In this paper, we study the problem of finding optimal mappings for several independent but concurrent workflow applications, in order to optimize performance-related criteria tog...

Anne Benoit, Paul Renaud-Goud, Yves Robert

claim paper

Read More »

180

click to vote

IPPS
2010
IEEE

103views Distributed And Parallel Com...» more IPPS 2010»

Varying bandwidth resource allocation problem with bag constraints

15 years 3 months ago

Download www.cse.iitd.ernet.in

We consider the problem of scheduling jobs on a pool of machines. Each job requires multiple machines on which it executes in parallel. For each job, the input specifies release ti...

Venkatesan T. Chakaravarthy, Vinayaka Pandit, Yogi...

claim paper

Read More »

164

click to vote

IPPS
2010
IEEE

175views Distributed And Parallel Com...» more IPPS 2010»

Consistency in hindsight: A fully decentralized STM algorithm

15 years 3 months ago

Download www.so.in.tum.de

Abstract--Software transactional memory (STM) algorithms often rely on centralized components to achieve atomicity, isolation and consistency. In a distributed setting, centralized...

Annette Bieniusa, Thomas Fuhrmann

claim paper

Read More »

101

click to vote

IPPS
2010
IEEE

94views Distributed And Parallel Com...» more IPPS 2010»

Direct self-consistent field computations on GPU clusters

15 years 3 months ago

Download www.ipdps.org

Guochun Shi, Volodymyr V. Kindratenko, Ivan S. Ufi...

claim paper

Read More »

190

click to vote

IPPS
2010
IEEE

203views Distributed And Parallel Com...» more IPPS 2010»

Tile QR factorization with parallel panel processing for multicore architectures

15 years 3 months ago

Download icl.cs.utk.edu

To exploit the potential of multicore architectures, recent dense linear algebra libraries have used tile algorithms, which consist in scheduling a Directed Acyclic Graph (DAG) of...

Bilel Hadri, Hatem Ltaief, Emmanuel Agullo, Jack D...

claim paper

Read More »

123

click to vote

IPPS
2010
IEEE

161views Distributed And Parallel Com...» more IPPS 2010»

Optimal loop unrolling for GPGPU programs

15 years 3 months ago

Download etd.ohiolink.edu

Giridhar Sreenivasa Murthy, Mahesh Ravishankar, Mu...

claim paper

Read More »

208

click to vote

IPPS
2010
IEEE

209views Distributed And Parallel Com...» more IPPS 2010»

Improving numerical reproducibility and stability in large-scale numerical simulations on GPUs

15 years 3 months ago

Download gcl.cis.udel.edu

The advent of general purpose graphics processing units (GPGPU's) brings about a whole new platform for running numerically intensive applications at high speeds. Their multi-...

Michela Taufer, Omar Padron, Philip Saponaro, Sand...

claim paper

Read More »

133

click to vote

IPPS
2010
IEEE

105views Distributed And Parallel Com...» more IPPS 2010»

Linpack evaluation on a supercomputer with heterogeneous accelerators

15 years 3 months ago

Download matsu-www.is.titech.ac.jp

We report Linpack benchmark results on the TSUBAME supercomputer, a large scale heterogeneous system equipped with NVIDIA Tesla GPUs and ClearSpeed SIMD accelerators. With all of 1...

Toshio Endo, Akira Nukada, Satoshi Matsuoka, Naoya...

claim paper

Read More »

158

click to vote

IPPS
2010
IEEE

154views Distributed And Parallel Com...» more IPPS 2010»

Dynamic analysis of the relay cache-coherence protocol for distributed transactional memory

15 years 3 months ago

Download www.real-time.ece.vt.edu

Transactional memory is an alternative programming model for managing contention in accessing shared in-memory data objects. Distributed transactional memory (TM) promises to alle...

Bo Zhang, Binoy Ravindran

claim paper

Read More »

118

click to vote

IPPS
2010
IEEE

118views Distributed And Parallel Com...» more IPPS 2010»

Dynamic fractional resource scheduling for HPC workloads

15 years 3 months ago

$Dynamic fractional resource scheduling for HPC workloads$