https://gcc.gnu.org/bugzilla/show_bug.cgi?id=125876

--- Comment #12 from vekumar at gcc dot gnu.org ---
(In reply to Hongtao Liu from comment #11)
> (In reply to vekumar from comment #9)
> > (In reply to Sarvesh Chandra from comment #8)
> > > Sure Richard, would an RFC to gcc-patches be apt for this issue?
> > 
> > Can u check if latency wise vunpcklpd is better than vmovddup for register
> > variants, based on that you can post patch against this PR.
> 
> According to Agner Fog's table, vmovddup is same as vunpcklpd for thougput
> and latency on Intel platform, but looks like Zen4/zen5 has better throuput
> for vunpcklpd.

Yes, on Zen5 vunpcklpd has better throughput compared to vmovddup.

Reply via email to