I think we should support 128-bit elements symmetrically to 64/32/16/8-bit operations by just having the 128-bit vsx instruction be the base instruction that gets prefixed, this would mean subvl=1 has 128-bit subvectors, subvl=2 has 256-bit subvectors, 3 for 384-bit, and 4 for 512-bit. this is because it doesn't make sense to have an instruction for a 64-bit half of a intrinsically 128-bit operation such as f128 add.
conceptually i like it. especially the symmetry and being able to apply sub-vectoring to 128-bit operations. it's waaay advanced and in the future though :)
(In reply to Luke Kenneth Casson Leighton from comment #1) > conceptually i like it. especially the symmetry and being able to apply > sub-vectoring to 128-bit operations. it's waaay advanced and in the future > though :) maybe not that far in the future, since aes is a 128-bit operation that we will want to support :)