In binary, normalized numbers are represented as $\pm 1.xxxxxxx_2 \times 2^{yyyy}$.

Why normalization?

Floating-Point: IEEE 754-1985 standard → Single precision (32-bit), Double precision (64-bit)

image.png

(1 + fraction) is called the significand

image.png

Compared to Fixed-Point Numbers, Floating point number system can cover a much more extensive range!

image.png

Floating-Point Addition:

image.png

在现代流水线处理器中,单精度浮点加法的典型延迟(Latency)通常为4个时钟周期

image.png

image.png

image.png