Bit permute double word needs to be optimized (see Claire Wolf's https://github.com/riscv/riscv-bitmanip/tree/master/verilog/rvb_bextdep) after the October 2020 tapeout