1 2
AVX128: vpshuflw/vpshufhw can be improved Similar to #3784, this is just using a simple vinselement loop instead of anything smart, can be improved dramatically.