Sciweavers

EUROPAR
2011
Springer

Model-Driven Tile Size Selection for DOACROSS Loops on GPUs

13 years 5 days ago
Model-Driven Tile Size Selection for DOACROSS Loops on GPUs
DOALL loops are tiled to exploit DOALL parallelism and data locality on GPUs. In contrast, due to loop-carried dependences, DOACROSS loops must be skewed first in order to make tiling legal and exploit wavefront parallelism across the tiles and within a tile. Thus, tile size selection, which is performance-critical, becomes more complex for DOACROSS loops than DOALL loops on GPUs. This paper presents a model-driven approach to automating this process. Validation using 1D, 2D and 3D SOR solvers shows that our framework can find the tile sizes for these representative DOACROSS loops to achieve performances close to the best observed for a range of problem sizes tested.
Peng Di, Jingling Xue
Added 20 Dec 2011
Updated 20 Dec 2011
Type Journal
Year 2011
Where EUROPAR
Authors Peng Di, Jingling Xue
Comments (0)