Improving SIMT Efficiency of Global Rendering Algorithms with Architectural Support for Dynamic Micro-Kernels

14 years 4 months ago

Download home.engineering.iastate.edu

Wide Single Instruction, Multiple Thread (SIMT) architectures often require a static allocation of thread groups that are executed in lockstep throughout the entire application kernel. Individual thread branching is supported by executing all control flow paths for threads in a thread group and only committing the results of threads on the current control path. While convergence algorithms are used to maximize processor efficiency during branching operations, applications requiring complex control flow often result in low processor efficiency due to the length and quantity of control paths. Global rendering algorithms are an example of a class of application that can be accelerated using a large number of independent parallel threads that each require complex control flow, resulting in comparatively low efficiency on SIMT processors. To improve processor utilization for global rendering algorithms, we introduce a SIMT architecture that allows for threads to be created dynamically at ru...

Michael Steffen, Joseph Zambreno

Real-time Traffic

Global Rendering Algorithms | Hardware | MICRO 2010 | Thread Group | Threads |

claim paper

Post Info
More Details (n/a)

Added	14 Feb 2011
Updated	14 Feb 2011
Type	Journal
Year	2010
Where	MICRO
Authors	Michael Steffen, Joseph Zambreno

Comments (0)

Sciweavers

Improving SIMT Efficiency of Global Rendering Algorithms with Architectural Support for Dynamic Micro-Kernels

Global Rendering Algorithms | Hardware | MICRO 2010 | Thread Group | Threads |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers