Latency-tolerant software pipelining in a production compiler

14 years 6 months ago

Download rw4.cs.uni-sb.de

In this paper we investigate the beneﬁt of scheduling non-critical loads for a higher latency during software pipelining. "Noncritical" denotes those loads that have sufﬁcient slack in the cyclic data dependence graph so that increasing the scheduling distance to their ﬁrst use can only increase the number of stages of the software pipeline, but should not increase the lengths of the individual stages, the initiation interval (II). The associated cost is in many cases negligible, but the memory stall reduction due to improved latency coverage and load clustering in the schedule can be considerable. We ﬁrst analyze beneﬁt and cost in theory and then present how we have implemented latency-tolerant pipelining experimentally in the Intel Itanium R product compiler. A key component of the technique is the preselection of likely long-latency loads that is integrated into prefetching heuristics in the high-level optimizer. Only when applied selectively based on these pre...

Sebastian Winkel, Rakesh Krishnaiyer, Robyn Sampso

Real-time Traffic

CGO 2008 | Cyclic Data Dependence | Likely Long-latency Loads | Non-critical Loads | Software Engineering |

claim paper

Post Info
More Details (n/a)

Added	29 May 2010
Updated	29 May 2010
Type	Conference
Year	2008
Where	CGO
Authors	Sebastian Winkel, Rakesh Krishnaiyer, Robyn Sampson

Comments (0)

Sciweavers

Latency-tolerant software pipelining in a production compiler

CGO 2008 | Cyclic Data Dependence | Likely Long-latency Loads | Non-critical Loads | Software Engineering |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers