Efficient Pipelining of Nested Loops: Unroll-and-Squash

15 years 9 months ago

Download www.coe.uncc.edu

The size and complexity of current custom VLSI have forced the use of high-level programming languages to describe hardware, and compiler and synthesis technology bstract designs into silicon. Since streaming data processing in DSP applications is typically described by loop constructs in a high-level language, loops are the most critical portions of the hardware description and special techniques are developed to optimally synthesize them. In this paper, we introduce a new method for mapping and pipelining nested loops efficiently into hardware. It achieves fine-grain parallelism even on strong intra- and inter-iteration data-dependent inner loops and, by sharing resources economically, improves performance at the expense of a small amount of additional area. We implemented the transformation within the Nimble Compiler environment and evaluated its performance on several signal-processing benchmarks. The method achieves up to 2x improvement in the area efficiency compared to the best...

Darin Petkov, Randolph E. Harr, Saman P. Amarasing

Real-time Traffic

Current Custom Vlsi | Data-dependent Inner Loops | Distributed And Parallel Computing | IPPS 2002 | Synthesis Technology Bstract |

claim paper

» Low Complexity Design of High Speed Parallel Decision Feedback Equalizers

» Automatic memory partitioning and scheduling for throughput and power optimization

Post Info
More Details (n/a)

Added	15 Jul 2010
Updated	15 Jul 2010
Type	Conference
Year	2002
Where	IPPS
Authors	Darin Petkov, Randolph E. Harr, Saman P. Amarasinghe

Comments (0)

Sciweavers

Efficient Pipelining of Nested Loops: Unroll-and-Squash

Current Custom Vlsi | Data-dependent Inner Loops | Distributed And Parallel Computing | IPPS 2002 | Synthesis Technology Bstract |

Explore & Download

Productivity Tools

Sciweavers