Automatic Synthesis of Customized Local Memories for Multicluster Application Accelerators

15 years 8 months ago

Download cccp.eecs.umich.edu

Distributed local memories, or scratchpads, have been shown to effectively reduce cost and power consumption of application-specific accelerators while maintaining performance. The design of the local memory organization must take several factors into account, including the memory bandwidth and size requirements of the program and the distribution of program data among the memories. In addition, when register structures and function units in the accelerator are clustered, the effects of intercluster communication should be taken into account. This work proposes a technique to synthesize the local memory architecture of a clustered accelerator using a phase-ordered approach. First, the dataflow graph is pre-partitioned to define a performance-centric grouping of the operations. Second, memory synthesis is performed by combining multiple data structures into a set of physical memories that minimizes cost while maintaining a performance threshold. Finally, post-partitioning is performed ...

Manjunath Kudlur, Kevin Fan, Michael L. Chu, Scott

Real-time Traffic

ASAP 2004 | Hardware | Local Memory | Local Memory Organization | Memory Organization |

claim paper

Post Info
More Details (n/a)

Added	20 Aug 2010
Updated	20 Aug 2010
Type	Conference
Year	2004
Where	ASAP
Authors	Manjunath Kudlur, Kevin Fan, Michael L. Chu, Scott A. Mahlke

Comments (0)

Sciweavers

Automatic Synthesis of Customized Local Memories for Multicluster Application Accelerators

ASAP 2004 | Hardware | Local Memory | Local Memory Organization | Memory Organization |

Explore & Download

Productivity Tools

Sciweavers