Search Sciweavers | Sciweavers

43 search results - page 6 / 9

» Optimizing Nested Loops with Iterational and Instructional R...

click to vote

CASES
2007
ACM

81views System Software» more CASES 2007»

An efficient framework for dynamic reconfiguration of instruction-set customization

13 years 11 months ago

Download www.comp.nus.edu.sg

We present an efficient framework for dynamic reconfiguration of application-specific custom instructions. A key component of this framework is an iterative algorithm for temporal...

Huynh Phung Huynh, Joon Edward Sim, Tulika Mitra

claim paper

Read More »

click to vote

HPCA
2004
IEEE

143views Distributed And Parallel Com...» more HPCA 2004»

Creating Converged Trace Schedules Using String Matching

14 years 7 months ago

Download cseweb.ucsd.edu

This paper focuses on generating efficient software pipelined schedules for in-order machines, which we call Converged Trace Schedules. For a candidate loop, we form a string of t...

Satish Narayanasamy, Yuanfang Hu, Suleyman Sair, B...

claim paper

Read More »

click to vote

PPOPP
2006
ACM

133views Distributed And Parallel Com...» more PPOPP 2006»

Optimizing irregular shared-memory applications for distributed-memory systems

14 years 1 months ago

Download www.ecn.purdue.edu

In prior work, we have proposed techniques to extend the ease of shared-memory parallel programming to distributed-memory platforms by automatic translation of OpenMP programs to ...

Ayon Basumallik, Rudolf Eigenmann

claim paper

Read More »

click to vote

ICPPW
2006
IEEE

108views Distributed And Parallel Com...» more ICPPW 2006»

Towards a Source Level Compiler: Source Level Modulo Scheduling

14 years 1 months ago

Download cs.haifa.ac.il

Modulo scheduling is a major optimization of high performance compilers wherein The body of a loop is replaced by an overlapping of instructions from diﬀerent iterations. Hence ...

Yosi Ben-Asher, Danny Meisler

claim paper

Read More »

click to vote

SC
1992
ACM

111views Applied Computing» more SC 1992»

Compiler Code Transformations for Superscalar-Based High Performance Systems

13 years 11 months ago

Download impact.crhc.illinois.edu

Exploiting parallelism at both the multiprocessor level and the instruction level is an e ective means for supercomputers to achieve high-performance. The amount of instruction-le...

Scott A. Mahlke, William Y. Chen, John C. Gyllenha...

claim paper

Read More »

« Prev « First page 6 / 9 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers