Exploring Fine-Grained Task-Based Execution on Multi-GPU Systems

14 years 6 months ago

Download www.capsl.udel.edu

Using multi-GPU systems, including GPU clusters, is gaining popularity in scientiﬁc computing. However, when using multiple GPUs concurrently, the conventional data parallel GPU programming paradigms, e.g., CUDA, cannot satisfactorily address certain issues, such as load balancing, GPU resource utilization, overlapping ﬁnegrained computation with communication, etc. In this paper, we present a ﬁne-grained task-based execution framework for multi-GPU systems. By scheduling ﬁner-grained tasks than what is supported in the conventional CUDA programming method among multiple GPUs, and allowing concurrent task execution on a single GPU, our framework provides means for solving the above issues and efﬁciently utilizing multi-GPU systems. Experiments with a molecular dynamics application show that, for nonuniform distributed workload, the solutions based on our framework achieve good load balance, and considerable performance improvement over other solutions based on the standard C...

Long Chen, Oreste Villa, Guang R. Gao

Real-time Traffic

CLUSTER 2011 | Concurrent Task | Distributed And Parallel Computing | Programming Methodologies | Programming Paradigms |

claim paper

Added	18 Dec 2011
Updated	18 Dec 2011
Type	Journal
Year	2011
Where	CLUSTER
Authors	Long Chen, Oreste Villa, Guang R. Gao

Sciweavers

Exploring Fine-Grained Task-Based Execution on Multi-GPU Systems

CLUSTER 2011 | Concurrent Task | Distributed And Parallel Computing | Programming Methodologies | Programming Paradigms |

Explore & Download

Productivity Tools

Sciweavers