Sciweavers

572 search results - page 57 / 115
» A Performance Prediction Methodology for Data-dependent Para...
Sort
View
ISCAPDCS
2001
13 years 10 months ago
Branch Prediction of Conditional Nested Loops through an Address Queue
-Multi-dimensional applications, such as image processing and seismic analysis, usually require the optimized performance obtained from instruction-level parallelism. The critical ...
Zhigang Jin, Nelson L. Passos, Virgil Andronache
ICS
1994
Tsinghua U.
14 years 29 days ago
Evaluating automatic parallelization for efficient execution on shared-memory multiprocessors
We present a parallel code generation algorithm for complete applications and a new experimental methodology that tests the efficacy of our approach. The algorithm optimizes for d...
Kathryn S. McKinley
EUROPAR
2009
Springer
14 years 3 months ago
Adaptive Parallel Householder Bidiagonalization
With the increasing use of large image and video archives and high-resolution multimedia data streams in many of today’s research and application areas, there is a growing need f...
Fangbin Liu, Frank J. Seinstra
EUROPAR
2010
Springer
13 years 10 months ago
Multithreaded Geant4: Semi-automatic Transformation into Scalable Thread-Parallel Software
This work presents an application case study. Geant4 is a 750,000 line toolkit first designed in the mid-1990s and originally intended only for sequential computation. Intel's...
Xin Dong 0004, Gene Cooperman, John Apostolakis
ICPADS
2006
IEEE
14 years 2 months ago
Critical Path and Area Based Scheduling of Parallel Task Graphs on Heterogeneous Platforms
While most parallel task graphs scheduling research has been done in the context of single homogeneous clusters, heterogeneous platforms have become prevalent and are extremely at...
Tchimou N'Takpé, Frédéric Sut...