https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94863
--- Comment #4 from Andrew Pinski ---
(In reply to Gabriel Ravier from comment #1)
> Note: The given outputs for LLVM and GCC are when compiling with `-O3
> -msse4.1`
I think you have the oppsite meaning with respect to `-msse4.1` here. They
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94863
Andrew Pinski changed:
What|Removed |Added
Severity|normal |enhancement
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94863
--- Comment #3 from Gabriel Ravier ---
For binary size, the `movsd` takes 4 bytes and the `blendps` takes 6 bytes
The port allocations for the instructions are as such (same formatting as for
the throughputs) :
Wolfdale: p5, p015
Nehalem: p5,
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94863
--- Comment #2 from Richard Biener ---
throughputs put aside - how's port allocation and latency figures? That said,
GCC usually sides on the smaller insn encoding variant when latency isn't
different - we're usually not looking at throughput