Floating-Point Fused Multiply-Add: Reduced Latency for Floating-Point Addition

14 years 6 months ago

Download arith.polito.it

In this paper we propose an architecture for the computation of the double—precision ﬂoating—point multiply—add fused (MAF) operation A + (B × C) that permits to compute the ﬂoating—point addition with lower latency than ﬂoating—point multiplication and MAF. While previous MAF architectures compute the three operations with the same latency, the proposed architecture permits to skip the ﬁrst pipeline stages, those related with the multiplication B ×C, in case of an addition. For instance, for a MAF unit pipelined into three or ﬁve stages, the latency of the ﬂoating—point addition is reduced to two or three cycles, respectively. To achieve the latency reduction for ﬂoating-point addition, the alignment shifter, which in previous organizations is in parallel with the multiplication, is moved so that the multiplication can be bypassed. To avoid that this modiﬁcation increases the critical path, a double-datapath organization is used, in which the alignment a...

Javier D. Bruguera, Tomás Lang

Real-time Traffic

Applied Computing | ARITH 2005 | Lower Latency | ﬁrst Pipeline Stages | ﬂoating—point Addition |

claim paper

Post Info
More Details (n/a)

Added	24 Jun 2010
Updated	24 Jun 2010
Type	Conference
Year	2005
Where	ARITH
Authors	Javier D. Bruguera, Tomás Lang

Comments (0)

Sciweavers

Floating-Point Fused Multiply-Add: Reduced Latency for Floating-Point Addition

Applied Computing | ARITH 2005 | Lower Latency | ﬁrst Pipeline Stages | ﬂoating—point Addition |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers