Many-Thread Aware Prefetching Mechanisms for GPGPU Applications

13 years 11 months ago

Download comparch.gatech.edu

Abstract-- We consider the problem of how to improve memory latency tolerance in massively multithreaded GPGPUs when the thread-level parallelism of an application is not sufficient to hide memory latency. One solution used in conventional CPU systems is prefetching, both in hardware and software. However, we show that straightforwardly applying such mechanisms to GPGPU systems does not deliver the expected performance benefits and can in fact hurt performance when not used judiciously. This paper proposes new hardware and software prefetching mechanisms tailored to GPGPU systems, which we refer to as many-thread aware prefetching (MT-prefetching) mechanisms. Our software MT-prefetching mechanism, called interthread prefetching, exploits the existence of common memory access behavior among fine-grained threads. For hardware MTprefetching, we describe a scalable prefetcher training algorithm along with a hardware-based inter-thread prefetching mechanism. In some cases, blindly applying ...

Jaekyu Lee, Nagesh B. Lakshminarayana, Hyesoon Kim

Real-time Traffic

Hardware | Memory Latency | MICRO 2010 | Performance |

claim paper

Post Info
More Details (n/a)

Added	14 Feb 2011
Updated	14 Feb 2011
Type	Journal
Year	2010
Where	MICRO
Authors	Jaekyu Lee, Nagesh B. Lakshminarayana, Hyesoon Kim, Richard W. Vuduc

Comments (0)

Sciweavers

Many-Thread Aware Prefetching Mechanisms for GPGPU Applications

Hardware | Memory Latency | MICRO 2010 | Performance |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers