[Bug target/94863] Failure to use blendps over mov when possible

2024-04-13 Thread pinskia at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94863 --- Comment #4 from Andrew Pinski --- (In reply to Gabriel Ravier from comment #1) > Note: The given outputs for LLVM and GCC are when compiling with `-O3 > -msse4.1` I think you have the oppsite meaning with respect to `-msse4.1` here. They

[Bug target/94863] Failure to use blendps over mov when possible

2021-04-25 Thread pinskia at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94863 Andrew Pinski changed: What|Removed |Added Severity|normal |enhancement

[Bug target/94863] Failure to use blendps over mov when possible

2020-04-30 Thread gabravier at gmail dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94863 --- Comment #3 from Gabriel Ravier --- For binary size, the `movsd` takes 4 bytes and the `blendps` takes 6 bytes The port allocations for the instructions are as such (same formatting as for the throughputs) : Wolfdale: p5, p015 Nehalem: p5,

[Bug target/94863] Failure to use blendps over mov when possible

2020-04-30 Thread rguenth at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94863 --- Comment #2 from Richard Biener --- throughputs put aside - how's port allocation and latency figures? That said, GCC usually sides on the smaller insn encoding variant when latency isn't different - we're usually not looking at throughput