Sciweavers

19

PPOPP
2015
ACM

17views Distributed and Parallel Com...» more PPOPP 2015»

Barrier elision for production parallel programs

8 years 6 months ago

Large scientiﬁc code bases are often composed of several layers of runtime libraries, implemented in multiple programming languages. In such situation, programmers often choose ...

Milind Chabbi, Wim Lavrijsen, Wibe de Jong, Koushi...

claim paper

Read More »

18

click to vote

PPOPP
2015
ACM

7views Distributed and Parallel Com...» more PPOPP 2015»

The SprayList: a scalable relaxed priority queue

8 years 6 months ago

Download www.mit.edu

High-performance concurrent priority queues are essential for applications such as task scheduling and discrete event simulation. Unfortunately, even the best performing implement...

Dan Alistarh, Justin Kopinsky, Jerry Li, Nir Shavi...

claim paper

Read More »

18

click to vote

PPOPP
2015
ACM

7views Distributed and Parallel Com...» more PPOPP 2015»

A performance study of Java garbage collectors on multicore architectures

8 years 6 months ago

Download www-public.tem-tsp.eu

In the last few years, managed runtime environments such as the Java Virtual Machine (JVM) are increasingly used on large-scale multicore servers. The garbage collector (GC) repre...

Maria Carpen Amarie, Patrick Marlier, Pascal Felbe...

claim paper

Read More »

15

click to vote

PPOPP
2015
ACM

17views Distributed and Parallel Com...» more PPOPP 2015»

NUMA-aware graph-structured analytics

8 years 6 months ago

Download 202.120.40.188

Graph-structured analytics has been widely adopted in a number of big data applications such as social computation, web-search and recommendation systems. Though much prior resear...

Kaiyuan Zhang, Rong Chen, Haibo Chen

claim paper

Read More »

19

click to vote

PPOPP
2015
ACM

21views Distributed and Parallel Com...» more PPOPP 2015»

Optimization of asynchronous graph processing on GPU with hybrid coloring model

8 years 6 months ago

Download grid.hust.edu.cn

Modern GPUs have been widely used to accelerate the graph processing for complicated computational problems regarding graph theory. Many parallel graph algorithms adopt the asynch...

Xuanhua Shi, Junling Liang, Sheng Di, Bingsheng He...

claim paper

Read More »

17

click to vote

PPOPP
2015
ACM

5views Distributed and Parallel Com...» more PPOPP 2015»

Section based program analysis to reduce overhead of detecting unsynchronized thread communication

8 years 6 months ago

Download masc.soe.ucsc.edu

Madan Das, Gabriel Southern, Jose Renau

claim paper

Read More »

20

click to vote

PPOPP
2015
ACM

7views Distributed and Parallel Com...» more PPOPP 2015»

Stochastic gradient descent on GPUs

8 years 6 months ago

Download www.cs.utexas.edu

Irregular algorithms such as Stochastic Gradient Descent (SGD) can beneﬁt from the massive parallelism available on GPUs. However, unlike in data-parallel algorithms, synchroniz...

Rashid Kaleem, Sreepathi Pai, Keshav Pingali

claim paper

Read More »

18

click to vote

PPOPP
2015
ACM

16views Distributed and Parallel Com...» more PPOPP 2015»

Supporting multiple accelerators in high-level programming models

8 years 6 months ago

Download rosecompiler.org

Computational accelerators, such as manycore NVIDIA GPUs, Intel Xeon Phi and FPGAs, are becoming common in workstations, servers and supercomputers for scientiﬁc and engineering...

Yonghong Yan 0001, Pei-Hung Lin, Chunhua Liao, Bro...

claim paper

Read More »

18

click to vote

PPOPP
2015
ACM

11views Distributed and Parallel Com...» more PPOPP 2015»

Adaptive GPU cache bypassing

8 years 6 months ago

Download www.computermachines.org

Modern graphics processing units (GPUs) include hardwarecontrolled caches to reduce bandwidth requirements and energy consumption. However, current GPU cache hierarchies are ine�...

Yingying Tian, Sooraj Puthoor, Joseph L. Greathous...

claim paper

Read More »

22

click to vote

PPOPP
2015
ACM

4views Distributed and Parallel Com...» more PPOPP 2015»

Predicate RCU: an RCU for scalable concurrent updates

8 years 6 months ago

Download www.cs.technion.ac.il

Read-copy update (RCU) is a shared memory synchronization mechanism with scalable synchronization-free reads that nevertheless execute correctly with concurrent updates. To guaran...

Maya Arbel, Adam Morrison

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers