Sciweavers

778 search results - page 30 / 156
» Efficient Code Generation for Automatic Parallelization and ...
Sort
View
AMC
2006
80views more  AMC 2006»
13 years 9 months ago
Parallel preconditioned conjugate gradient optimization of the Rayleigh quotient for the solution of sparse eigenproblems
A parallel algorithm based on the multidimensional minimization of the Rayleigh quotient is proposed to evaluate the leftmost eigenpairs of the generalized symmetric positive defi...
Luca Bergamaschi, Angeles Martinez, Giorgio Pini
CORR
2006
Springer
98views Education» more  CORR 2006»
13 years 9 months ago
Enabling user-driven Checkpointing strategies in Reverse-mode Automatic Differentiation
This paper presents a new functionality of the Automatic Differentiation (AD) Tool tapenade. tapenade generates adjoint codes which are widely used for optimization or inverse prob...
Laurent Hascoët, Mauricio Araya-Polo
SC
2004
ACM
14 years 2 months ago
Rating Compiler Optimizations for Automatic Performance Tuning
To achieve maximum performance gains through compiler optimization, most automatic performance tuning systems use a feed-back directed approach to rate the code versions generated...
Zhelong Pan, Rudolf Eigenmann
MICRO
1992
IEEE
133views Hardware» more  MICRO 1992»
14 years 1 months ago
Code generation schema for modulo scheduled loops
Software pipelining is an important instruction scheduling technique for efficiently overlapping successive iterations of loops and executing them in parallel. Modulo scheduling i...
B. Ramakrishna Rau, Michael S. Schlansker, Parthas...
EGH
2005
Springer
14 years 2 months ago
Optimal automatic multi-pass shader partitioning by dynamic programming
Complex shaders must be partitioned into multiple passes to execute on GPUs with limited hardware resources. Automatic partitioning gives rise to an NP-hard scheduling problem tha...
Alan Heirich