Sciweavers

115 search results - page 4 / 23
» Fusion of Loops for Parallelism and Locality
Sort
View
MICRO
2000
IEEE
176views Hardware» more  MICRO 2000»
13 years 6 months ago
An Advanced Optimizer for the IA-64 Architecture
level of abstraction, compared with the program representation for scalar optimizations. For example, loop unrolling and loop unrolland-jam transformations exploit the large regist...
Rakesh Krishnaiyer, Dattatraya Kulkarni, Daniel M....
IPPS
2003
IEEE
14 years 8 days ago
Global Communication Optimization for Tensor Contraction Expressions under Memory Constraints
The accurate modeling of the electronic structure of atoms and molecules involves computationally intensive tensor contractions involving large multi-dimensional arrays. The effi...
Daniel Cociorva, Xiaoyang Gao, Sandhya Krishnan, G...
MFCS
2000
Springer
13 years 10 months ago
Explicit Fusions
We introduce explicit fusions of names. To `fuse' two names is to declare that they may be used interchangeably. An explicit fusion is one that can exist in parallel with som...
Philippa Gardner, Lucian Wischik
DCOSS
2011
Springer
12 years 6 months ago
A distributed information fusion method for localization based on Pareto optimization
—To overcome the limitations of specific positioning techniques for mobile wireless nodes and achieve a high accuracy, the fusion of heterogeneous sensor information is an appea...
Alessio De Angelis, Carlo Fischione
CASES
2001
ACM
13 years 10 months ago
Combined partitioning and data padding for scheduling multiple loop nests
With the widening performance gap between processors and main memory, efficient memory accessing behavior is necessary for good program performance. Loop partition is an effective...
Zhong Wang, Edwin Hsing-Mean Sha, Xiaobo Hu