PPOPP 2015 | Sciweavers

21

PPOPP
2015
ACM

23views Distributed and Parallel Com...» more PPOPP 2015»

Exploiting communication concurrency on high performance computing systems

8 years 9 months ago

Although logically available, applications may not exploit enough instantaneous communication concurrency to maximize hardware utilization on HPC systems. This is exacerbated in h...

Nicholas Chaimov, Khaled Z. Ibrahim, Samuel Willia...

claim paper

Read More »

22

click to vote

PPOPP
2015
ACM

8views Distributed and Parallel Com...» more PPOPP 2015»

GPU-SM: shared memory multi-GPU programming

8 years 9 months ago

Download impact.crhc.illinois.edu

Discrete GPUs in modern multi-GPU systems can transparently access each other’s memories through the PCIe interconnect. Future systems will improve this capability by including ...

Javier Cabezas, Marc Jordà, Isaac Gelado, N...

claim paper

Read More »

27

click to vote

PPOPP
2015
ACM

10views Distributed and Parallel Com...» more PPOPP 2015»

Rethinking the parallelization of random-restart hill climbing: a case study in optimizing a 2-opt TSP solver for GPU execution

8 years 9 months ago

Download cs.txstate.edu

Molly A. O'Neil, Martin Burtscher

claim paper

Read More »

33

click to vote

PPOPP
2015
ACM

17views Distributed and Parallel Com...» more PPOPP 2015»

Barrier elision for production parallel programs

8 years 9 months ago

Download srl.cs.berkeley.edu

Large scientiﬁc code bases are often composed of several layers of runtime libraries, implemented in multiple programming languages. In such situation, programmers often choose ...

Milind Chabbi, Wim Lavrijsen, Wibe de Jong, Koushi...

claim paper

Read More »

28

click to vote

PPOPP
2015
ACM

7views Distributed and Parallel Com...» more PPOPP 2015»

The SprayList: a scalable relaxed priority queue

8 years 9 months ago

Download www.mit.edu

High-performance concurrent priority queues are essential for applications such as task scheduling and discrete event simulation. Unfortunately, even the best performing implement...

Dan Alistarh, Justin Kopinsky, Jerry Li, Nir Shavi...

claim paper

Read More »

24

click to vote

PPOPP
2015
ACM

7views Distributed and Parallel Com...» more PPOPP 2015»

A performance study of Java garbage collectors on multicore architectures

8 years 9 months ago

Download www-public.tem-tsp.eu

In the last few years, managed runtime environments such as the Java Virtual Machine (JVM) are increasingly used on large-scale multicore servers. The garbage collector (GC) repre...

Maria Carpen Amarie, Patrick Marlier, Pascal Felbe...

claim paper

Read More »

21

click to vote

PPOPP
2015
ACM

17views Distributed and Parallel Com...» more PPOPP 2015»

NUMA-aware graph-structured analytics

8 years 9 months ago

Download 202.120.40.188

Graph-structured analytics has been widely adopted in a number of big data applications such as social computation, web-search and recommendation systems. Though much prior resear...

Kaiyuan Zhang, Rong Chen, Haibo Chen

claim paper

Read More »

26

click to vote

PPOPP
2015
ACM

21views Distributed and Parallel Com...» more PPOPP 2015»

Optimization of asynchronous graph processing on GPU with hybrid coloring model

8 years 9 months ago

Download grid.hust.edu.cn

Modern GPUs have been widely used to accelerate the graph processing for complicated computational problems regarding graph theory. Many parallel graph algorithms adopt the asynch...

Xuanhua Shi, Junling Liang, Sheng Di, Bingsheng He...

claim paper

Read More »

22

click to vote

PPOPP
2015
ACM

5views Distributed and Parallel Com...» more PPOPP 2015»

Section based program analysis to reduce overhead of detecting unsynchronized thread communication

8 years 9 months ago

Download masc.soe.ucsc.edu

Madan Das, Gabriel Southern, Jose Renau

claim paper

Read More »

28

click to vote

PPOPP
2015
ACM

7views Distributed and Parallel Com...» more PPOPP 2015»

Stochastic gradient descent on GPUs

8 years 9 months ago

Download www.cs.utexas.edu

Irregular algorithms such as Stochastic Gradient Descent (SGD) can beneﬁt from the massive parallelism available on GPUs. However, unlike in data-parallel algorithms, synchroniz...

Rashid Kaleem, Sreepathi Pai, Keshav Pingali

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers