Hello!
> So what I'm confused about is in the original output template operand
> 0 is duplicated. In the new template operand 1 is duplicated.
>
> Presumably what you're trying to accomplish is avoiding a false read
> on operand 0 (the destination)? Can you please confirm?
> Knowing that shoul
This is the same patch I posted a few days ago, a bit modified according to
Uros' recommendation.
Patch fixes false dependencies for vmovss, vmovsd, vrcpss, vrsqrtss, vsqrtss
and vsqrtsd instructions.
Tested on x86-64/Linux, no new test fails, some SPEC 2006/2017 performance
gains.
2018-05-04
This patch fixes false dependencies for vmovss, vmovsd, vrcpss, vrsqrtss,
vsqrtss and vsqrtsd instructions.
Tested on x86-64/Linux, no new test fails, some SPEC 2006/2017 performance
gains.
Please let me know if something is wrong here and should be changed.
--
Alexander Nesterovskiy
falsedep