Sciweavers

403 search results - page 68 / 81
» On Using Incremental Profiling for the Performance Analysis ...
Sort
View
27
Voted
BIBE
2007
IEEE
208views Bioinformatics» more  BIBE 2007»
13 years 11 months ago
The GPU on biomedical image processing for color and phenotype analysis
The computational power and memory bandwidth of graphics processing units (GPUs) have turned them into attractive platforms for general-purpose applications. In this paper, we expl...
Antonio Ruiz, Manuel Ujaldon, Jose Antonio Andrade...
SIGMETRICS
2010
ACM
201views Hardware» more  SIGMETRICS 2010»
14 years 9 days ago
Transparent, lightweight application execution replay on commodity multiprocessor operating systems
We present S, the first system to provide transparent, lowoverhead application record-replay and the ability to go live from replayed execution. S i...
Oren Laadan, Nicolas Viennot, Jason Nieh
PVM
2010
Springer
13 years 5 months ago
Enabling Concurrent Multithreaded MPI Communication on Multicore Petascale Systems
With the ever-increasing numbers of cores per node on HPC systems, applications are increasingly using threads to exploit the shared memory within a node, combined with MPI across ...
Gábor Dózsa, Sameer Kumar, Pavan Bal...
ISCA
2009
IEEE
146views Hardware» more  ISCA 2009»
14 years 2 months ago
Multi-execution: multicore caching for data-similar executions
While microprocessor designers turn to multicore architectures to sustain performance expectations, the dramatic increase in parallelism of such architectures will put substantial...
Susmit Biswas, Diana Franklin, Alan Savage, Ryan D...
IWOMP
2011
Springer
12 years 10 months ago
A Runtime Implementation of OpenMP Tasks
Many task-based programming models have been developed and refined in recent years to support application development for shared memory platforms. Asynchronous tasks are a powerfu...
James LaGrone, Ayodunni Aribuki, Cody Addison, Bar...