Sciweavers

CGO
2008
IEEE
14 years 6 months ago
Modulo scheduling for highly customized datapaths to increase hardware reusability
In the embedded domain, custom hardware in the form of ASICs is often used to implement critical parts of applications when performance and energy efficiency goals cannot be met ...
Kevin Fan, Hyunchul Park, Manjunath Kudlur, Scott ...
CGO
2008
IEEE
14 years 6 months ago
Program optimization space pruning for a multithreaded gpu
Program optimization for highly-parallel systems has historically been considered an art, with experts doing much of the per
Shane Ryoo, Christopher I. Rodrigues, Sam S. Stone...
CGO
2008
IEEE
14 years 6 months ago
Near-optimal instruction selection on dags
David Ryan Koes, Seth Copen Goldstein
CGO
2008
IEEE
14 years 6 months ago
Prediction and trace compression of data access addresses through nested loop recognition
This paper describes an algorithm that takes a trace (i.e., a sequence of numbers or vectors of numbers) as input, and from that produces a sequence of loop nests that, when run, ...
Alain Ketterlin, Philippe Clauss
CGO
2008
IEEE
14 years 6 months ago
Automatic array inlining in java virtual machines
Array inlining expands the concepts of object inlining to arrays. Groups of objects and arrays that reference each other are placed consecutively in memory so that their relative ...
Christian Wimmer, Hanspeter Mössenböck
CGO
2008
IEEE
14 years 6 months ago
Fault-safe code motion for type-safe languages
Brian R. Murphy, Vijay Menon, Florian T. Schneider...
CGO
2008
IEEE
14 years 6 months ago
Spice: speculative parallel iteration chunk execution
The recent trend in the processor industry of packing multiple processor cores in a chip has increased the importance of automatic techniques for extracting thread level paralleli...
Easwaran Raman, Neil Vachharajani, Ram Rangan, Dav...
CGO
2008
IEEE
14 years 6 months ago
Cole: compiler optimization level exploration
Modern compilers implement a large number of optimizations which all interact in complex ways, and which all have a different impact on code quality, compilation time, code size,...
Kenneth Hoste, Lieven Eeckhout
CGO
2008
IEEE
14 years 6 months ago
Compiling for vector-thread architectures
Vector-thread (VT) architectures exploit multiple forms of parallelism simultaneously. This paper describes a compiler for the Scale VT architecture, which takes advantage of the ...
Mark Hampton, Krste Asanovic