Search Sciweavers | Sciweavers

84 search results - page 14 / 17

» Loop Distribution and Fusion with Timing and Code Size Optim...

135

click to vote

CASES
2006
ACM

146views System Software» more CASES 2006»

Adapting compilation techniques to enhance the packing of instructions into registers

15 years 9 months ago

Download ww2.cs.fsu.edu

The architectural design of embedded systems is becoming increasingly idiosyncratic to meet varying constraints regarding energy consumption, code size, and execution time. Tradit...

Stephen Hines, David B. Whalley, Gary S. Tyson

claim paper

Read More »

142

click to vote

IPPS
2009
IEEE

142views Distributed And Parallel Com...» more IPPS 2009»

Annotation-based empirical performance tuning using Orio

15 years 10 months ago

Download www.mcs.anl.gov

In many scientiﬁc applications, signiﬁcant time is spent tuning codes for a particular highperformance architecture. Tuning approaches range from the relatively nonintrusive (...

Albert Hartono, Boyana Norris, Ponnuswamy Sadayapp...

claim paper

Read More »

126

Voted

ICPPW
2005
IEEE

101views Distributed And Parallel Com...» more ICPPW 2005»

Speculative Parallel Threading Architecture and Compilation

15 years 9 months ago

Download people.apache.org

Thread-level speculation is a technique that brings thread-level parallelism beyond the data-flow limit by executing a piece of code ahead of time speculatively before all its inp...

Xiao-Feng Li, Zhao-Hui Du, Chen Yang, Chu-Cheow Li...

claim paper

Read More »

130

Voted

SPAA
2003
ACM

135views Distributed And Parallel Com...» more SPAA 2003»

Performance comparison of MPI and three openMP programming styles on shared memory multiprocessors

15 years 8 months ago

Download www.lri.fr

When using a shared memory multiprocessor, the programmer faces the selection of the portable programming model which will deliver the best performance. Even if he restricts his c...

Géraud Krawezik

claim paper

Read More »

149

click to vote

VALUETOOLS
2006
ACM

167views Hardware» more VALUETOOLS 2006»

Detailed cache simulation for detecting bottleneck, miss reason and optimization potentialities

15 years 9 months ago

Download itec.uka.de

Cache locality optimization is an eﬃcient way for reducing the idle time of modern processors in waiting for needed data. This kind of optimization can be achieved either on the...

Jie Tao, Wolfgang Karl

claim paper

Read More »

« Prev « First page 14 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers