Computer Science | Sciweavers

74

PPOPP
2015
ACM

4views Distributed and Parallel Com...» more PPOPP 2015»

Predicate RCU: an RCU for scalable concurrent updates

9 years 10 months ago

Read-copy update (RCU) is a shared memory synchronization mechanism with scalable synchronization-free reads that nevertheless execute correctly with concurrent updates. To guaran...

Maya Arbel, Adam Morrison

claim paper

Read More »

81

click to vote

PPOPP
2015
ACM

16views Distributed and Parallel Com...» more PPOPP 2015»

A collection-oriented programming model for performance portability

9 years 10 months ago

Download www.cs.utah.edu

This paper describes Surge, a collection-oriented programming model that enables programmers to compose parallel computations using nested high-level data collections and operator...

Saurav Muralidharan, Michael Garland, Bryan C. Cat...

claim paper

Read More »

69

click to vote

PPOPP
2015
ACM

9views Distributed and Parallel Com...» more PPOPP 2015»

Diagnosing the causes and severity of one-sided message contention

9 years 10 months ago

Download sc14.supercomputing.org

Nathan R. Tallent, Abhinav Vishnu, Hubertus van Da...

claim paper

Read More »

82

click to vote

PPOPP
2015
ACM

8views Distributed and Parallel Com...» more PPOPP 2015»

Optimization for performance and energy for batched matrix computations on GPUs

9 years 10 months ago

Download www.netlib.org

As modern hardware keeps evolving, an increasingly eﬀective approach to develop energy eﬃcient and high-performance solvers is to design them to work on many small size indepe...

Azzam Haidar, Tingxing Dong, Piotr Luszczek, Stani...

claim paper

Read More »

69

click to vote

PPOPP
2015
ACM

7views Distributed and Parallel Com...» more PPOPP 2015»

Effects of source-code optimizations on GPU performance and energy consumption

9 years 10 months ago

Download cs.txstate.edu

This paper studies the effects of source-code optimizations on the performance, power draw, and energy consumption of a modern compute GPU. We evaluate 128 versions of two n-body ...

Jared Coplin, Martin Burtscher

claim paper

Read More »

67

click to vote

PPOPP
2015
ACM

13views Distributed and Parallel Com...» more PPOPP 2015»

More than you ever wanted to know about synchronization: synchrobench, measuring the impact of the synchronization on concurrent

9 years 10 months ago

Download sydney.edu.au

In this paper, we present the most extensive comparison of synchronization techniques. We evaluate 5 different synchronization techniques through a series of 31 data structure alg...

Vincent Gramoli

claim paper

Read More »

76

click to vote

PPOPP
2015
ACM

5views Distributed and Parallel Com...» more PPOPP 2015»

SYNC or ASYNC: time to fuse for distributed graph-parallel computation

9 years 10 months ago

Download ipads.se.sjtu.edu.cn

Large-scale graph-structured computation usually exhibits iterative and convergence-oriented computing nature, where input data is computed iteratively until a convergence conditi...

Chenning Xie, Rong Chen, Haibing Guan, Binyu Zang,...

claim paper

Read More »

73

click to vote

PPOPP
2015
ACM

8views Distributed and Parallel Com...» more PPOPP 2015»

A library for portable and composable data locality optimizations for NUMA systems

9 years 10 months ago

Download e-collection.library.ethz.ch

Many recent multiprocessor systems are realized with a nonuniform memory architecture (NUMA) and accesses to remote memory locations take more time than local memory accesses. Opt...

Zoltan Majo, Thomas R. Gross

claim paper

Read More »

84

click to vote

PPOPP
2015
ACM

15views Distributed and Parallel Com...» more PPOPP 2015»

Automatic scalable atomicity via semantic locking

9 years 10 months ago

Download labs.yahoo.com

In this paper, we consider concurrent programs in which the shared nsists of instances of linearizable ADTs (abstract data types). We present an automated approach to concurrency ...

Guy Golan-Gueta, G. Ramalingam, Mooly Sagiv, Eran ...

claim paper

Read More »

85

click to vote

PPOPP
2015
ACM

25views Distributed and Parallel Com...» more PPOPP 2015»

RaftLib: a C++ template library for high performance stream parallel processing

9 years 10 months ago

Download www.cs.wustl.edu

Stream processing or data-ﬂow programming is a compute paradigm that has been around for decades in many forms yet has failed garner the same attention as other mainstream langu...

Jonathan C. Beard, Peng Li, Roger D. Chamberlain

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers