A scalable approach to performance analysis of MPI applications is presented that includes automated source code instrumentation, low overhead generation of profile and trace data...
Shirley Moore, Felix Wolf, Jack Dongarra, Sameer S...
With the availability of multi-core processors and large-scale computing clusters, the study of parallel algorithms has been revived throughout the industry. We present a portfolio...
Christoph M. Wintersteiger, Leonardo Mendonç...
We present parallel algorithms for building decision-tree classifiers on shared-memory multiprocessor (SMP) systems. The proposed algorithms span the gamut of data and task parall...
Mohammed Javeed Zaki, Ching-Tien Ho, Rakesh Agrawa...
One approach to distributed parallel programming is to utilize self-migrating threads. Computations can be distributed first, and parallelized second. The first step produces a ...
Lei Pan, Lubomir Bic, Michael B. Dillencourt, Ming...
Abstract. The performance of HPF codes is in uenced by the characteristics of the parallel system and by the e ciency of the compilation system. Performance analysis has to take in...
Maria Calzarossa, Luisa Massari, Alessandro P. Mer...