stages (to be assembled into 2/3-ish stage pipeline) * negation * normalization (to either of *s or non-*s precision) * rounding * packing to final fp format (has optional shift for denormal f32-in-f64)