Sciweavers

1074 search results - page 102 / 215
» Parallelizing post-placement timing optimization
Sort
View
MST
1998
91views more  MST 1998»
13 years 8 months ago
An Experimental Analysis of Parallel
We have developed a methodology for predicting the performance of parallel algorithms on real parallel machines. The methodology consists of two steps. First, we characterize a mac...
Guy E. Blelloch, Charles E. Leiserson
CLUSTER
2008
IEEE
14 years 3 months ago
A comparison of search heuristics for empirical code optimization
—This paper describes the application of various search techniques to the problem of automatic empirical code optimization. The search process is a critical aspect of auto-tuning...
Keith Seymour, Haihang You, Jack Dongarra
VIS
2005
IEEE
297views Visualization» more  VIS 2005»
14 years 10 months ago
OpenGL Multipipe SDK: A Toolkit for Scalable Parallel Rendering
We describe OpenGL Multipipe SDK (MPK), a toolkit for scalable parallel rendering based on OpenGL. MPK provides a uniform application programming interface (API) to manage scalabl...
Praveen Bhaniramka, Philippe C. D. Robert, Stefan ...
PVM
2007
Springer
14 years 3 months ago
Optimization of Collective Communications in HeteroMPI
Abstract. HeteroMPI is an extension of MPI designed for high performance computing on heterogeneous networks of computers. The recent new feature of HeteroMPI is the optimized vers...
Alexey L. Lastovetsky, Maureen O'Flynn, Vladimir R...
ISCA
2008
IEEE
112views Hardware» more  ISCA 2008»
14 years 3 months ago
Parallelism-Aware Batch Scheduling: Enhancing both Performance and Fairness of Shared DRAM Systems
In a chip-multiprocessor (CMP) system, the DRAM system is shared among cores. In a shared DRAM system, requests from a thread can not only delay requests from other threads by cau...
Onur Mutlu, Thomas Moscibroda