Sciweavers

PPOPP
2015
ACM

Exploiting communication concurrency on high performance computing systems

8 years 7 months ago
Exploiting communication concurrency on high performance computing systems
Although logically available, applications may not exploit enough instantaneous communication concurrency to maximize hardware utilization on HPC systems. This is exacerbated in hybrid programming models such as SPMD+OpenMP. We present the design of a“multi-threaded”runtime able to transparently increase the instantaneous network concurrency and to provide near saturation bandwidth, independent of the application configuration and dynamic behavior. The runtime forwards communication requests from application level tasks to multiple communication servers. Our techniques alleviate the need for spatial and temporal application level message concurrency optimizations. Experimental results show improved message throughput and bandwidth by as much as 150% for 4KB bytes messages on InfiniBand and by as much as 120% for 4KB byte messages on Cray Aries. For more complex operations such as all-to-all collectives, we observe as much as 30% speedup. This translates into 23% speedup on 12,28...
Nicholas Chaimov, Khaled Z. Ibrahim, Samuel Willia
Added 16 Apr 2016
Updated 16 Apr 2016
Type Journal
Year 2015
Where PPOPP
Authors Nicholas Chaimov, Khaled Z. Ibrahim, Samuel Williams, Costin Iancu
Comments (0)