Chimera: Collaborative Preemption for Multitasking on a Shared GPU

9 years 10 months ago

Download cccp.eecs.umich.edu

The demand for multitasking on graphics processing units (GPUs) is constantly increasing as they have become one of the default components on modern computer systems along with traditional processors (CPUs). Preemptive multitasking on CPUs has been primarily supported through context switching. However, the same preemption strategy incurs substantial overhead due to the large context in GPUs. The overhead comes in two dimensions: a preempting kernel suffers from a long preemption latency, and the system throughput is wasted during the switch. Without precise control over the large preemption overhead, multitasking on GPUs has little use for applications with strict latency requirements. In this paper, we propose Chimera, a collaborative preemption approach that can precisely control the overhead for multitasking on GPUs. Chimera ﬁrst introduces streaming multiprocessor (SM) ﬂushing, which can instantly preempt an SM by detecting and exploiting idempotent execution. Chimera utilize...

Jason Jong Kyu Park, Yongjun Park, Scott A. Mahlke

Real-time Traffic

ASPLOS 2015 | Programming Languages |

claim paper

Post Info
More Details (n/a)

Added	16 Apr 2016
Updated	16 Apr 2016
Type	Journal
Year	2015
Where	ASPLOS
Authors	Jason Jong Kyu Park, Yongjun Park, Scott A. Mahlke

Comments (0)

Sciweavers

Chimera: Collaborative Preemption for Multitasking on a Shared GPU

ASPLOS 2015 | Programming Languages |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers