Sciweavers

294 search results - page 41 / 59
» Efficient Execution of Parallel Applications in Multiprogram...
Sort
View
IEEEINTERACT
2003
IEEE
14 years 21 days ago
Procedure Cloning and Integration for Converting Parallelism from Coarse to Fine Grain
This paper introduces a method for improving program run-time performance by gathering work in an application and executing it efficiently in an integrated thread. Our methods ext...
Won So, Alexander G. Dean
ASPLOS
2009
ACM
14 years 8 months ago
DMP: deterministic shared memory multiprocessing
Current shared memory multicore and multiprocessor systems are nondeterministic. Each time these systems execute a multithreaded application, even if supplied with the same input,...
Joseph Devietti, Brandon Lucia, Luis Ceze, Mark Os...
IPPS
2009
IEEE
14 years 2 months ago
Application profiling on Cell-based clusters
In this paper, we present a methodology for profiling parallel applications executing on the IBM PowerXCell 8i (commonly referred to as the “Cell” processor). Specifically, we...
Hikmet Dursun, Kevin J. Barker, Darren J. Kerbyson...
HPDC
1996
IEEE
13 years 11 months ago
Modeling the Effects of Contention on the Performance of Heterogeneous Applications
Fast networks have made it possible to coordinate distributed heterogeneous CPU, memory, and storage resources to provide a powerful platform for executing high-performance applic...
Silvia M. Figueira, Francine Berman
CONCURRENCY
2010
95views more  CONCURRENCY 2010»
13 years 5 months ago
The Scalasca performance toolset architecture
SCALASCA is a performance toolset that has been specifically designed to analyze parallel application execution behavior on large-scale systems. It offers an incremental performan...
Markus Geimer, Felix Wolf, Brian J. N. Wylie, Erik...