29/09/2025 18:28, Shreesh Adiga: > On Wed, Sep 24, 2025 at 8:28 PM Thomas Monjalon <[email protected]> wrote: > > > Hello, > > > > 16/07/2025 12:34, Shreesh Adiga: > > > Replace the clearing of lower 32 bits of XMM register with blend of > > > zero register. > > > Replace the clearing of upper 64 bits of XMM register with > > _mm_move_epi64. > > > Clang is able to optimize away the AND + memory operand with the > > > above sequence, however GCC is still emitting the code for AND with > > > memory operands which is being explicitly eliminated here. > > > > > > Additionally replace the 48 byte crc_xmm_shift_tab with the contents of > > > shf_table which is 32 bytes, achieving the same functionality. > > > > > > Signed-off-by: Shreesh Adiga <[email protected]> > > > > Sorry I'm not following. > > Please could you start with defining the goal of this patch? > > Is it a code simplification or a performance optimization? > > It is intended to be a minor performance optimization.
Please could you give some performance numbers in the commit log?

