Optimizing the use of GPU memory in applications with large data sets

14 years 4 days ago

Download www.cs.berkeley.edu

Abstract--With General Purpose programmable GPUs becoming more and more popular, automated tools are needed to bridge the gap between achievable performance from highly parallel architectures and the performance required in applications. In this paper, we concentrate on improving GPU memory management for applications with large and intermediate data sets that do not completely fit in GPU memory. For such applications, the movement of the extra data to CPU memory must be carefully managed. In particular, we focus on solving the joint task scheduling and data transfer scheduling problem posed in [1], and propose an algorithm that gives close to optimal results (as measured by running simulated annealing overnight) in terms of the amount of data transferred for image processing benchmarks such as edge detection and Convolutional Neural Networks. Our results enable a reduction of up to 30

Nadathur Satish, Narayanan Sundaram, Kurt Keutzer

Real-time Traffic

Distributed And Parallel Computing | GPU Memory | GPU Memory Management | HIPC 2009 | Purpose Programmable Gpus |

claim paper

Post Info
More Details (n/a)

Added	18 Feb 2011
Updated	18 Feb 2011
Type	Journal
Year	2009
Where	HIPC
Authors	Nadathur Satish, Narayanan Sundaram, Kurt Keutzer

Comments (0)

Sciweavers

Optimizing the use of GPU memory in applications with large data sets

Distributed And Parallel Computing | GPU Memory | GPU Memory Management | HIPC 2009 | Purpose Programmable Gpus |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers