Distributed and Parallel Computing

132

PPOPP
2015
ACM

23views Distributed and Parallel Com...» more PPOPP 2015»

Exploiting communication concurrency on high performance computing systems

10 years 2 months ago

Although logically available, applications may not exploit enough instantaneous communication concurrency to maximize hardware utilization on HPC systems. This is exacerbated in h...

Nicholas Chaimov, Khaled Z. Ibrahim, Samuel Willia...

claim paper

Read More »

123

click to vote

PPOPP
2015
ACM

8views Distributed and Parallel Com...» more PPOPP 2015»

GPU-SM: shared memory multi-GPU programming

10 years 2 months ago

Download impact.crhc.illinois.edu

Discrete GPUs in modern multi-GPU systems can transparently access each other’s memories through the PCIe interconnect. Future systems will improve this capability by including ...

Javier Cabezas, Marc Jordà, Isaac Gelado, N...

claim paper

Read More »

133

click to vote

PPOPP
2015
ACM

10views Distributed and Parallel Com...» more PPOPP 2015»

Rethinking the parallelization of random-restart hill climbing: a case study in optimizing a 2-opt TSP solver for GPU execution

10 years 2 months ago

Download cs.txstate.edu

Molly A. O'Neil, Martin Burtscher

claim paper

Read More »

140

click to vote

PPOPP
2015
ACM

17views Distributed and Parallel Com...» more PPOPP 2015»

Barrier elision for production parallel programs

10 years 2 months ago

Download srl.cs.berkeley.edu

Large scientiﬁc code bases are often composed of several layers of runtime libraries, implemented in multiple programming languages. In such situation, programmers often choose ...

Milind Chabbi, Wim Lavrijsen, Wibe de Jong, Koushi...

claim paper

Read More »

130

click to vote

PPOPP
2015
ACM

7views Distributed and Parallel Com...» more PPOPP 2015»

The SprayList: a scalable relaxed priority queue

10 years 2 months ago

Download www.mit.edu

High-performance concurrent priority queues are essential for applications such as task scheduling and discrete event simulation. Unfortunately, even the best performing implement...

Dan Alistarh, Justin Kopinsky, Jerry Li, Nir Shavi...

claim paper

Read More »

133

click to vote

PPOPP
2015
ACM

7views Distributed and Parallel Com...» more PPOPP 2015»

A performance study of Java garbage collectors on multicore architectures

10 years 2 months ago

Download www-public.tem-tsp.eu

In the last few years, managed runtime environments such as the Java Virtual Machine (JVM) are increasingly used on large-scale multicore servers. The garbage collector (GC) repre...

Maria Carpen Amarie, Patrick Marlier, Pascal Felbe...

claim paper

Read More »

124

click to vote

PPOPP
2015
ACM

17views Distributed and Parallel Com...» more PPOPP 2015»

NUMA-aware graph-structured analytics

10 years 2 months ago

Download 202.120.40.188

Graph-structured analytics has been widely adopted in a number of big data applications such as social computation, web-search and recommendation systems. Though much prior resear...

Kaiyuan Zhang, Rong Chen, Haibo Chen

claim paper

Read More »

151

click to vote

PPOPP
2015
ACM

21views Distributed and Parallel Com...» more PPOPP 2015»

Optimization of asynchronous graph processing on GPU with hybrid coloring model

10 years 2 months ago

Download grid.hust.edu.cn

Modern GPUs have been widely used to accelerate the graph processing for complicated computational problems regarding graph theory. Many parallel graph algorithms adopt the asynch...

Xuanhua Shi, Junling Liang, Sheng Di, Bingsheng He...

claim paper

Read More »

130

click to vote

PPOPP
2015
ACM

5views Distributed and Parallel Com...» more PPOPP 2015»

Section based program analysis to reduce overhead of detecting unsynchronized thread communication

10 years 2 months ago

Download masc.soe.ucsc.edu

Madan Das, Gabriel Southern, Jose Renau

claim paper

Read More »

134

click to vote

PPOPP
2015
ACM

7views Distributed and Parallel Com...» more PPOPP 2015»

Stochastic gradient descent on GPUs

10 years 2 months ago

Download www.cs.utexas.edu

Irregular algorithms such as Stochastic Gradient Descent (SGD) can beneﬁt from the massive parallelism available on GPUs. However, unlike in data-parallel algorithms, synchroniz...

Rashid Kaleem, Sreepathi Pai, Keshav Pingali

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers