Search Sciweavers | Sciweavers

33 search results - page 4 / 7

» A memory optimization technique for software-managed scratch...

click to vote

PC
2010

190views Management» more PC 2010»

High-performance cone beam reconstruction using CUDA compatible GPUs

13 years 5 months ago

Download www-hagi.ist.osaka-u.ac.jp

Compute uniﬁed device architecture (CUDA) is a software development platform that allows us to run C-like programs on the nVIDIA graphics processing unit (GPU). This paper prese...

Yusuke Okitsu, Fumihiko Ino, Kenichi Hagihara

claim paper

Read More »

click to vote

CC
2008
Springer

144views System Software» more CC 2008»

Control Flow Emulation on Tiled SIMD Architectures

13 years 9 months ago

Download plg.uwaterloo.ca

Heterogeneous multi-core and streaming architectures such as the GPU, Cell, ClearSpeed, and Imagine processors have better power/ performance ratios and memory bandwidth than tradi...

Ghulam Lashari, Ondrej Lhoták, Michael McCo...

claim paper

Read More »

click to vote

IPPS
2010
IEEE

134views Distributed And Parallel Com...» more IPPS 2010»

Optimization of linked list prefix computations on multithreaded GPUs using CUDA

13 years 5 months ago

Download www.umiacs.umd.edu

We present a number of optimization techniques to compute prefix sums on linked lists and implement them on multithreaded GPUs using CUDA. Prefix computations on linked structures ...

Zheng Wei, Joseph JáJá

claim paper

Read More »

click to vote

CCGRID
2011
IEEE

256views Distributed And Parallel Com...» more CCGRID 2011»

Small Discrete Fourier Transforms on GPUs

12 years 11 months ago

Download www.cs.fsu.edu

– Efficient implementations of the Discrete Fourier Transform (DFT) for GPUs provide good performance with large data sizes, but are not competitive with CPU code for small data ...

S. Mitra, A. Srinivasan

claim paper

Read More »

click to vote

ICS
2009
Tsinghua U.

144views Distributed And Parallel Com...» more ICS 2009»

Performance modeling and automatic ghost zone optimization for iterative stencil loops on GPUs

14 years 2 months ago

Download www.cs.virginia.edu

Iterative stencil loops (ISLs) are used in many applications and tiling is a well-known technique to localize their computation. When ISLs are tiled across a parallel architecture...

Jiayuan Meng, Kevin Skadron

claim paper

Read More »

« Prev « First page 4 / 7 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers