Model-Driven Tile Size Selection for DOACROSS Loops on GPUs

14 years 3 months ago

Download www.cse.unsw.edu.au

DOALL loops are tiled to exploit DOALL parallelism and data locality on GPUs. In contrast, due to loop-carried dependences, DOACROSS loops must be skewed ﬁrst in order to make tiling legal and exploit wavefront parallelism across the tiles and within a tile. Thus, tile size selection, which is performance-critical, becomes more complex for DOACROSS loops than DOALL loops on GPUs. This paper presents a model-driven approach to automating this process. Validation using 1D, 2D and 3D SOR solvers shows that our framework can ﬁnd the tile sizes for these representative DOACROSS loops to achieve performances close to the best observed for a range of problem sizes tested.

Peng Di, Jingling Xue

Real-time Traffic

Distributed And Parallel Computing | Doacross Loops | EUROPAR 2011 | Tile Size | Tile Sizes |

claim paper

Added	20 Dec 2011
Updated	20 Dec 2011
Type	Journal
Year	2011
Where	EUROPAR
Authors	Peng Di, Jingling Xue

Sciweavers

Model-Driven Tile Size Selection for DOACROSS Loops on GPUs

Distributed And Parallel Computing | Doacross Loops | EUROPAR 2011 | Tile Size | Tile Sizes |

Explore & Download

Productivity Tools

Sciweavers