A scalable framework for heterogeneous GPU-based clusters

13 years 9 months ago

Download web.eecs.utk.edu

GPU-based heterogeneous clusters continue to draw attention from vendors and HPC users due to their high energy efﬁciency and much improved single-node computational performance, however, there is little parallel software available that can utilize all CPU cores and all GPUs on the heterogeneous system efﬁciently. On a heterogeneous cluster, the performance of a GPU (or a compute node) increases in a much faster rate than the performance of the PCI-Express connection (or the interconnection network) such that communication eventually becomes the bottleneck of the entire system. To overcome the bottleneck, we developed a multi-level partitioning and distribution method that guarantees a near-optimal communication volume. We have also extended heterogeneous tile algorithms to work on distributed-memory GPU clusters. Our main idea is to execute a serial program and generate hybrid-size tasks, and follow a dataﬂow programming model to ﬁre the tasks on different compute nodes. We t...

Fengguang Song, Jack Dongarra

Real-time Traffic