Improving Cache Locality for Ray Casting with CUDA

14 years 2 months ago

Download www-hagi.ist.osaka-u.ac.jp

Abstract: In this paper, we present an acceleration method for texture-based ray casting on the compute uniﬁed device architecture (CUDA) compatible graphics processing unit (GPU). Since ray casting is a memory-intensive application, our method increases the hit rate of the texture cache during rendering. To achieve this, our method dynamically selects the width and height of thread blocks (TBs) such that each warp, which is a series of 32 threads simultaneously processed on the GPU, can achieve high data locality for speciﬁc viewpoints. The objective of this selection is to allow every warp rather than every thread to access data with a small stride, because the GPU executes multiple threads at the same time. In experiments using a GeForce GTX 480 card (i.e., the latest Fermi architecture), we ﬁnd that the speedup of our method ranges

Yuki Sugimoto, Fumihiko Ino, Kenichi Hagihara

Real-time Traffic

Acceleration Method | ARCS 2012 | Device Architecture | Graphics Processing Unit | Software Engineering |

claim paper

» Rendering complex scenes with memorycoherent ray tracing

» CacheEfficient Layouts of Bounding Volume Hierarchies

Post Info
More Details (n/a)

Added	19 Apr 2012
Updated	19 Apr 2012
Type	Journal
Year	2012
Where	ARCS
Authors	Yuki Sugimoto, Fumihiko Ino, Kenichi Hagihara

Comments (0)

Sciweavers

Improving Cache Locality for Ray Casting with CUDA

Acceleration Method | ARCS 2012 | Device Architecture | Graphics Processing Unit | Software Engineering |

Explore & Download

Productivity Tools

Sciweavers