Efficient performance tuning of parallel programs is often hard. In this paper we describe an approach that uses a uni-processor execution of a multithreaded program as reference ...
High performance computers currently under construction, such as IBM’s Blue Gene/L, consisting of large numbers (64K) of low cost processing elements with relatively small local...
Ed Upchurch, Paul L. Springer, Maciej Brodowicz, S...
Performance prediction across platforms is increasingly important as developers can choose from a wide range of execution platforms. The main challenge remains to perform accurate...
This paper describes an overview of Environment for Memory Performance Studies (EMPS). EMPS is a framework to allow different data gathering and simulation tools to be composed to...
Jeffrey K. Hollingsworth, Allan Snavely, Simone Sb...
Abstract-Predicting sequential execution blocks of a large scale parallel application is an essential part of accurate prediction of the overall performance of the application. Whe...
Gengbin Zheng, Gagan Gupta, Eric J. Bohm, Isaac Do...