CAPRI: Prediction of compaction-adequacy for handling control-divergence in GPGPU architectures

12 years 4 months ago

Download lph.ece.utexas.edu

Wide SIMD-based GPUs have evolved into a promising platform for running general purpose workloads. Current programmable GPUs allow even code with irregular control to execute well on their SIMD pipelines. To do this, each SIMD lane is considered to execute a logical thread where hardware ensures that control ﬂow is accurate by automatically applying masked execution. The masked execution, however, often degrades performance because the issue slots of masked lanes are wasted. This degradation can be mitigated by dynamically compacting multiple unmasked threads into a single SIMD unit. This paper proposes a fundamentally new approach to branch compaction that avoids the unnecessary synchronization required by previous techniques and that only stalls threads that are likely to beneﬁt from compaction. Our technique is based on the compaction-adequacy predictor (CAPRI). CAPRI dynamically identiﬁes the compactioneffectiveness of a branch and only stalls threads that are predicted to b...

Minsoo Rhu, Mattan Erez

Real-time Traffic

Baseline Design | Hardware | ISCA 2012 | Prediction Accuracy | Unnecessary Synchronization |

claim paper

Post Info
More Details (n/a)

Added	28 Sep 2012
Updated	28 Sep 2012
Type	Journal
Year	2012
Where	ISCA
Authors	Minsoo Rhu, Mattan Erez

Comments (0)

Sciweavers

CAPRI: Prediction of compaction-adequacy for handling control-divergence in GPGPU architectures

Baseline Design | Hardware | ISCA 2012 | Prediction Accuracy | Unnecessary Synchronization |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers