Dynamic Task and Data Placement over NUMA Architectures: An OpenMP Runtime Perspective

15 years 11 months ago

Download hal.inria.fr

Abstract. Exploiting the full computational power of current hierarchical multiprocessor machines requires a very careful distribution of threads and data among the underlying non-uniform architecture so as to avoid memory access penalties. Directive-based programming languages such as OpenMP provide programmers with an easy way to structure the parallelism of their application and to transmit this information to the runtime system. Our runtime, which is based on a multi-level thread scheduler combined with a NUMA-aware memory manager, converts this information into “scheduling hints” to solve thread/memory afﬁnity issues. It enables dynamic load distribution guided by application structure and hardware topology, thus helping to achieve performance portability. First experiments show that mixed solutions (migrating threads and data) outperform next-touch-based data distribution policies and open possibilities for new optimizations.

François Broquedis, Nathalie Furmento, Bric

Real-time Traffic

Current Hierarchical Multiprocessor | IWOMP 2009 | Memory Access Penalties | OpenMP Provide Programmers | Programming Languages |

claim paper

Post Info
More Details (n/a)

Added	26 Jul 2010
Updated	26 Jul 2010
Type	Conference
Year	2009
Where	IWOMP
Authors	François Broquedis, Nathalie Furmento, Brice Goglin, Raymond Namyst, Pierre-André Wacrenier

Comments (0)

Sciweavers

Dynamic Task and Data Placement over NUMA Architectures: An OpenMP Runtime Perspective

Current Hierarchical Multiprocessor | IWOMP 2009 | Memory Access Penalties | OpenMP Provide Programmers | Programming Languages |

Explore & Download

Productivity Tools

Sciweavers