Sciweavers

350 search results - page 8 / 70
» Performance Analysis and Optimization of Parallel Scientific...
Sort
View
117
Voted
CLUSTER
2008
IEEE
15 years 10 months ago
OpenMP-centric performance analysis of hybrid applications
—Several performance analysis tools support hybrid applications. Most originated as MPI profiling or tracing tools and OpenMP capabilities were added to extend the performance a...
Karl Fürlinger, Shirley Moore
126
Voted
PPL
2008
264views more  PPL 2008»
15 years 3 months ago
A Performance Evaluation of the Nehalem Quad-Core Processor for Scientific Computing
In this work we present an initial performance evaluation of Intel's latest, secondgeneration quad-core processor, Nehalem, and provide a comparison to first-generation AMD a...
Kevin J. Barker, Kei Davis, Adolfy Hoisie, Darren ...
130
Voted
CLUSTER
2004
IEEE
15 years 7 months ago
Predicting memory-access cost based on data-access patterns
Improving memory performance at software level is more effective in reducing the rapidly expanding gap between processor and memory performance. Loop transformations (e.g. loop un...
Surendra Byna, Xian-He Sun, William Gropp, Rajeev ...
144
Voted
CASES
2006
ACM
15 years 7 months ago
Efficient architectures through application clustering and architectural heterogeneity
Customizing architectures for particular applications is a promising approach to yield highly energy-efficient designs for embedded systems. This work explores the benefits of arc...
Lukasz Strozek, David Brooks
170
Voted
CCGRID
2008
IEEE
15 years 5 months ago
A Probabilistic Model to Analyse Workflow Performance on Production Grids
Production grids are complex and highly variable systems whose behavior is not well understood and difficult to anticipate. The goal of this study is to estimate the impact of the ...
Tristan Glatard, Johan Montagnat, Xavier Pennec