Sciweavers

540 search results - page 78 / 108
» Scheduling dynamic parallelism on accelerators
Sort
View
ICPADS
2002
IEEE
14 years 2 months ago
Adaptive Matrix Multiplication in Heterogeneous Environments
In this paper, an adaptive matrix multiplication algorithm for dynamic heterogeneous environments is developed and evaluated. Unlike the state-of-the-art approaches, where load ba...
Bo Hong, Viktor K. Prasanna
IPPS
2007
IEEE
14 years 4 months ago
Decomposing Partial Order Execution Graphs to Improve Message Race Detection
In message-passing parallel applications, messages are not delivered in a strict order. In most applications, the computation results and the set of messages produced during the e...
Basile Schaeli, Sebastian Gerlach, Roger D. Hersch
ISCA
2008
IEEE
137views Hardware» more  ISCA 2008»
14 years 4 months ago
Self-Optimizing Memory Controllers: A Reinforcement Learning Approach
Efficiently utilizing off-chip DRAM bandwidth is a critical issue in designing cost-effective, high-performance chip multiprocessors (CMPs). Conventional memory controllers deli...
Engin Ipek, Onur Mutlu, José F. Martí...
FGCS
2007
78views more  FGCS 2007»
13 years 9 months ago
Promoting performance and separation of concerns for data mining applications on the grid
Grid Computing brought the promise of making high-performance computing cheaper and more easily available than traditional supercomputing platforms. Such a promise was very well r...
Vasco Furtado, Francisco Flávio de Souza, W...
IJPP
2010
156views more  IJPP 2010»
13 years 7 months ago
ForestGOMP: An Efficient OpenMP Environment for NUMA Architectures
Exploiting the full computational power of current hierarchical multiprocessor machines requires a very careful distribution of threads and data among the underlying non-uniform ar...
François Broquedis, Nathalie Furmento, Bric...