OpenPower has Tininess-Before-Rounding, many widely-used architectures have Tininess-After-Rounding instead. I think we should add a flag to allow OpenPower to switch modes -- this is part of efficiently running x86 FP operations on OpenPower (e.g. through qemu)