Dynamic loss-scaler state for mixed-precision training.
Tracks the current scale factor and the number of consecutive
successful (non-overflow) steps. On overflow the scale is halved;
after growth_interval successful steps it doubles.
Summary
Functions
Create a new loss scaler.