Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

181

CC
2012
Springer

250views System Software» more CC 2012»

Improving Performance of OpenCL on CPUs

14 years 25 days ago

Improving Performance of OpenCL on CPUs

Download www.cdl.uni-saarland.de

Abstract. Data-parallel languages like OpenCL and CUDA are an important means to exploit the computational power of today’s computing devices. In this paper, we deal with two aspects of implementing such languages on CPUs: First, we present a static analysis and an accompanying optimization to exclude code regions from control-ﬂow to dataﬂow conversion, which is the commonly used technique to leverage vector instruction sets. Second, we present a novel technique to implement barrier synchronization. We evaluate our techniques in a custom OpenCL CPU driver which is compared to itself in diﬀerent conﬁgurations and to proprietary implementations by AMD and Intel. We achieve an average

Ralf Karrenberg, Sebastian Hack

Real-time Traffic

CC 2012 | Parallel Languages | Proprietary Implementations | System Software | Vector Instruction |

claim paper

Related Content

» Efficient compilation of finegrained SPMDthreaded programs for multicore CPUs

» Programming Massively Parallel Architectures using MARTE a Case Study

» Sponge portable stream programming on graphics engines

» GRace a lowoverhead mechanism for detecting data races in GPU programs

» Speculative execution on multiGPU systems

» FPGAs vs CPUs trends in peak floatingpoint performance

» The Scalable Heterogeneous Computing SHOC benchmark suite

» A Parallel SPH Implementation on MultiCore CPUs

» Coordinating the use of GPU and CPU for improving performance of compute intensive applica...

Post Info
More Details (n/a)

Added	20 Apr 2012
Updated	20 Apr 2012
Type	Journal
Year	2012
Where	CC
Authors	Ralf Karrenberg, Sebastian Hack

Comments (0)