Adaptive and transparent cache bypassing for GPUs

9 years 10 months ago

Download parse.ele.tue.nl

In the last decade, GPUs have emerged to be widely adopted for general-purpose applications. To capture on-chip locality for these applications, modern GPUs have integrated multilevel cache hierarchy, in an attempt to reduce the amount and latency of the massive and sometimes irregular memory accesses. However, inferior performance is frequently attained due to serious congestion in the caches results from the huge amount of concurrent threads. In this paper, we propose a novel compile-time framework for adaptive and transparent cache bypassing on GPUs. It uses a simple yet eﬀective approach to control the bypass degree to match the size of applications’ runtime footprints. We validate the design on seven GPU platforms that cover all existing GPU generations using 16 applications from widely used GPU benchmarks. Experiments show that our design can signiﬁcantly mitigate the negative impact due to small cache sizes and improve the overall performance. We analyze the performance a...

Real-time Traffic

Applied Computing | SC 2015 |

claim paper

Post Info
More Details (n/a)

Added	17 Apr 2016
Updated	17 Apr 2016
Type	Journal
Year	2015
Where	SC

Comments (0)

Sciweavers

Adaptive and transparent cache bypassing for GPUs

Applied Computing | SC 2015 |

Explore & Download

Productivity Tools

Sciweavers