Search Sciweavers | Sciweavers

50

PPOPP
2009
ACM

358views Distributed and Parallel Com...» more PPOPP 2009»

OpenMP to GPGPU: a compiler framework for automatic translation and optimization

14 years 10 months ago

GPGPUs have recently emerged as powerful vehicles for generalpurpose high-performance computing. Although a new Compute Unified Device Architecture (CUDA) programming model from N...

Seyong Lee, Seung-Jai Min, Rudolf Eigenmann

claim paper

Read More »

69

click to vote

SIGMOD
2009
ACM

136views Database» more SIGMOD 2009»

A comparison of approaches to large-scale data analysis

14 years 10 months ago

Download database.cs.brown.edu

There is currently considerable enthusiasm around the MapReduce (MR) paradigm for large-scale data analysis [17]. Although the basic control flow of this framework has existed in ...

Andrew Pavlo, Erik Paulson, Alexander Rasin, Danie...

claim paper

Read More »

34

click to vote

SIGMOD
2010
ACM

207views Database» more SIGMOD 2010»

Automatic contention detection and amelioration for data-intensive operations

14 years 2 months ago

Download www.cs.columbia.edu

To take full advantage of the parallelism oﬀered by a multicore machine, one must write parallel code. Writing parallel code is diﬃcult. Even when one writes correct code, the...

John Cieslewicz, Kenneth A. Ross, Kyoho Satsumi, Y...

claim paper

Read More »

37

click to vote

VLSISP
2008

173views more VLSISP 2008»

Fast Bit Gather, Bit Scatter and Bit Permutation Instructions for Commodity Microprocessors

13 years 9 months ago

Download palms.ee.princeton.edu

Advanced bit manipulation operations are not efficiently supported by commodity word-oriented microprocessors. Programming tricks are typically devised to shorten the long sequence...

Yedidya Hilewitz, Ruby B. Lee

claim paper

Read More »

29

click to vote

EUROPAR
2005
Springer

136views Distributed And Parallel Com...» more EUROPAR 2005»

PerfMiner: Cluster-Wide Collection, Storage and Presentation of Application Level Hardware Performance Data

14 years 3 months ago

Download www.cs.utk.edu

Abstract. We present PerfMiner, a system for the transparent collection, storage and presentation of thread-level hardware performance data across an entire cluster. Every sub-proc...

Philip Mucci, Daniel Ahlin, Johan Danielsson, Per ...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers