Sciweavers

30 search results - page 1 / 6
» Performance scalability of decoupled software pipelining
Sort
View
TACO
2008
74views more  TACO 2008»
13 years 10 months ago
Performance scalability of decoupled software pipelining
Ram Rangan, Neil Vachharajani, Guilherme Ottoni, D...
CGO
2008
IEEE
14 years 5 months ago
Parallel-stage decoupled software pipelining
In recent years, the microprocessor industry has embraced chip multiprocessors (CMPs), also known as multi-core architectures, as the dominant design paradigm. For existing and ne...
Easwaran Raman, Guilherme Ottoni, Arun Raman, Matt...
CGO
2010
IEEE
14 years 5 months ago
Decoupled software pipelining creates parallelization opportunities
Decoupled Software Pipelining (DSWP) is one approach to automatically extract threads from loops. It partitions loops into long-running threads that communicate in a pipelined man...
Jialu Huang, Arun Raman, Thomas B. Jablin, Yun Zha...
MICRO
2005
IEEE
136views Hardware» more  MICRO 2005»
14 years 4 months ago
Automatic Thread Extraction with Decoupled Software Pipelining
Until recently, a steadily rising clock rate and other uniprocessor microarchitectural improvements could be relied upon to consistently deliver increasing performance for a wide ...
Guilherme Ottoni, Ram Rangan, Adam Stoler, David I...
EGH
2011
Springer
12 years 10 months ago
High-Performance Software Rasterization on GPUs
In this paper, we implement an efficient, completely software-based graphics pipeline on a GPU. Unlike previous approaches, we obey ordering constraints imposed by current graphi...
Samuli Laine, Tero Karras