The Memory Bandwidth Bottleneck and its Amelioration by a Compiler

14 years 12 months ago

Download www.cs.rochester.edu

As the speed gap between CPU and memory widens, memory hierarchy has become the primary factor limiting program performance. Until now, the principal focus of hardware and software innovations has been overcoming latency. However, the advent of latency tolerance techniques such as non-blocking cache and software prefetching begins the process of trading bandwidth for latency by overlapping and pipelining memory transfers. Since actual latency is the inverse of the consumed bandwidth, memory latency cannot be fully tolerated without inﬁnite bandwidth. This perspective has led us to two questions. Do current machines provide sufﬁcient data bandwidth? If not, can a program be restructured to consume less bandwidth? This paper answers these questions in two parts. The ﬁrst part deﬁnes a new bandwidth-based performance model and demonstrates the serious performance bottleneck due to the lack of memory bandwidth. The second part describes a new set of compiler optimizations for redu...

Chen Ding, Ken Kennedy

Real-time Traffic

Distributed And Parallel Computing | IPPS 2000 | Latency Tolerance Techniques | Memory Latency | Sufﬁcient Data Bandwidth |

claim paper

Post Info
More Details (n/a)

Added	31 Jul 2010
Updated	31 Jul 2010
Type	Conference
Year	2000
Where	IPPS
Authors	Chen Ding, Ken Kennedy

Comments (0)

Sciweavers

The Memory Bandwidth Bottleneck and its Amelioration by a Compiler

Distributed And Parallel Computing | IPPS 2000 | Latency Tolerance Techniques | Memory Latency | Sufﬁcient Data Bandwidth |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers