https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88494

--- Comment #8 from vekumar at gcc dot gnu.org ---
I tested mdbx before and after the revision Richard pointed out. 
On My Ryzen box there is ~4% regression. 

Although "vblenvps" is fast path instruction and can execute in pipe 0/1. It
competes with vcmpccsd, fma and muls instruction that are also executing on
pipe 0|1. Looks to me regression is due to added dependency and port pressure.  

We need to benchmark with large application like SPEC and then decide whether
we need to enable X86_TUNE_SCALAR_FLOAT_BLENDV tuning for Ryzen or not. On
BDVER4 there were no blendvps generated and no regression seen.

Reply via email to