On Mon, 6 Nov 2023 18:33:46 GMT, Sandhya Viswanathan <sviswanat...@openjdk.org> wrote:
> This is not a masked operation so every lane of dst will be written through > pinsrw/pinsrb. An vpxor before is not required. xor here clears the intermediate vector after each iteration, this is eventually ORs with destination. Checkout line https://github.com/openjdk/jdk/pull/16354/files/86783403c453d329e33d94f787a5709ec35f7099#diff-318d0e76b9a97e8cf8936be1de34e52735c4d947a77cac38babbbf9a081d16fcR1644 https://github.com/openjdk/jdk/pull/16354/files/86783403c453d329e33d94f787a5709ec35f7099#diff-318d0e76b9a97e8cf8936be1de34e52735c4d947a77cac38babbbf9a081d16fcR1651 ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/16354#discussion_r1384225640