Sciweavers

JUCS
2000

Compiler Generated Multithreading to Alleviate Memory Latency

13 years 11 months ago
Compiler Generated Multithreading to Alleviate Memory Latency
: Since the era of vector and pipelined computing, the computational speed is limited by the memory access time. Faster caches and more cache levels are used to bridge the growing gap between the memory and processor speeds. With the advent of multithreaded processors, it becomes feasible to concurrently fetch data and compute in two cooperating threads. A technique is presented to generate these threads at compile time, taking into account the characteristics of both the program and the underlying architecture. The results have been evaluated for an explicitly parallel processor. With a number of common programs the data-fetch thread allows to continue the computation without cache miss stalls. Key Words: data locality, multithreading, run-time data relocation, compiler optimization, cache optimization, prefetching, tiling
Kristof Beyls, Erik H. D'Hollander
Added 19 Dec 2010
Updated 19 Dec 2010
Type Journal
Year 2000
Where JUCS
Authors Kristof Beyls, Erik H. D'Hollander
Comments (0)