Sciweavers

IEEEPACT
2007
IEEE

Performance Portable Optimizations for Loops Containing Communication Operations

14 years 6 months ago
Performance Portable Optimizations for Loops Containing Communication Operations
Effective use of communication networks is critical to the performance and scalability of parallel applications. Partitioned Global Address Space languages like UPC bring the promise of performance and programmer productivity. Studies of well-tuned programs have suggested that PGAS languages are effective at utilizing modern networks because their one-sided communication is a good match to the underlying network hardware. An open question is whether the manual optimizations required to achieve good performance can be performed automatically by the compiler in a performance portable manner. In this paper we present a compiler and runtime optimization framework for loops containing communication operations. Our framework performs compile time message vectorization and strip-mining and defers until runtime the selection of the actual communication operations. At runtime, the communication requirements of the program are analyzed, and communication is instantiated and scheduled based on...
Costin Iancu, Wei Chen, Katherine A. Yelick
Added 03 Jun 2010
Updated 03 Jun 2010
Type Conference
Year 2007
Where IEEEPACT
Authors Costin Iancu, Wei Chen, Katherine A. Yelick
Comments (0)