IPPS 2010 | Sciweavers

108

click to vote

IPPS
2010
IEEE

94views Distributed And Parallel Com...» more IPPS 2010»

Direct self-consistent field computations on GPU clusters

15 years 4 months ago

Download www.ipdps.org

Guochun Shi, Volodymyr V. Kindratenko, Ivan S. Ufi...

claim paper

Read More »

218

click to vote

IPPS
2010
IEEE

203views Distributed And Parallel Com...» more IPPS 2010»

Tile QR factorization with parallel panel processing for multicore architectures

15 years 4 months ago

Download icl.cs.utk.edu

To exploit the potential of multicore architectures, recent dense linear algebra libraries have used tile algorithms, which consist in scheduling a Directed Acyclic Graph (DAG) of...

Bilel Hadri, Hatem Ltaief, Emmanuel Agullo, Jack D...

claim paper

Read More »

136

click to vote

IPPS
2010
IEEE

161views Distributed And Parallel Com...» more IPPS 2010»

Optimal loop unrolling for GPGPU programs

15 years 4 months ago

Download etd.ohiolink.edu

Giridhar Sreenivasa Murthy, Mahesh Ravishankar, Mu...

claim paper

Read More »

237

click to vote

IPPS
2010
IEEE

209views Distributed And Parallel Com...» more IPPS 2010»

Improving numerical reproducibility and stability in large-scale numerical simulations on GPUs

15 years 4 months ago

Download gcl.cis.udel.edu

The advent of general purpose graphics processing units (GPGPU's) brings about a whole new platform for running numerically intensive applications at high speeds. Their multi-...

Michela Taufer, Omar Padron, Philip Saponaro, Sand...

claim paper

Read More »

152

click to vote

IPPS
2010
IEEE

105views Distributed And Parallel Com...» more IPPS 2010»

Linpack evaluation on a supercomputer with heterogeneous accelerators

15 years 4 months ago

Download matsu-www.is.titech.ac.jp

We report Linpack benchmark results on the TSUBAME supercomputer, a large scale heterogeneous system equipped with NVIDIA Tesla GPUs and ClearSpeed SIMD accelerators. With all of 1...

Toshio Endo, Akira Nukada, Satoshi Matsuoka, Naoya...

claim paper

Read More »

180

click to vote

IPPS
2010
IEEE

154views Distributed And Parallel Com...» more IPPS 2010»

Dynamic analysis of the relay cache-coherence protocol for distributed transactional memory

15 years 4 months ago

Download www.real-time.ece.vt.edu

Transactional memory is an alternative programming model for managing contention in accessing shared in-memory data objects. Distributed transactional memory (TM) promises to alle...

Bo Zhang, Binoy Ravindran

claim paper

Read More »

135

Voted

IPPS
2010
IEEE

118views Distributed And Parallel Com...» more IPPS 2010»

Dynamic fractional resource scheduling for HPC workloads

15 years 4 months ago

$Dynamic fractional resource scheduling for HPC workloads$

Download graal.ens-lyon.fr

Mark Stillwell, Frédéric Vivien, Hen...

claim paper

Read More »

194

click to vote

IPPS
2010
IEEE

174views Distributed And Parallel Com...» more IPPS 2010»

Fine-grained QoS scheduling for PCM-based main memory systems

15 years 4 months ago

Download www.cs.pitt.edu

With wide adoption of chip multiprocessors (CMPs) in modern computers, there is an increasing demand for large capacity main memory systems. The emerging PCM (Phase Change Memory) ...

Ping Zhou, Yu Du, Youtao Zhang, Jun Yang 0002

claim paper

Read More »

175

click to vote

IPPS
2010
IEEE

132views Distributed And Parallel Com...» more IPPS 2010»

Broadcasting on large scale heterogeneous platforms under the bounded multi-port model

15 years 4 months ago

Download hal.inria.fr

We consider the problem of broadcasting a large message in a large scale distributed platform. The message must be sent from a source node, with the help of the receiving peers whi...

Olivier Beaumont, Lionel Eyraud-Dubois, Shailesh K...

claim paper

Read More »

156

click to vote

IPPS
2010
IEEE

117views Distributed And Parallel Com...» more IPPS 2010»

Performance evaluation of concurrent collections on high-performance multicore computing systems

15 years 4 months ago

Download vuduc.org

This paper is the first extensive performance study of a recently proposed parallel programming model, called Concurrent Collections (CnC). In CnC, the programmer expresses her co...

Aparna Chandramowlishwaran, Kathleen Knobe, Richar...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers