On Mon, Aug 26, 2024 at 06:54:58PM +0000, Amonson, Paul D wrote: >> And this still shows the ~14% regression in your original post? > > At the small buffer sizes the margin of error or "noise" is larger, > 7-11%. My average could be just bad luck. It will take me a while to > re-setup for full data collection runs but I can try it again if you > like.
IMHO that would be useful to establish the current state of the patch set from a performance standpoint, especially since you've added code intended to mitigate the regression. +#define COMP_CRC32C_SMALL(crc, data, len) \ + ((crc) = pg_comp_crc32c_sse42((crc), (data), (len))) My interpretation of Andres's upthread suggestion is that we'd add the length check within the macro instead of introducing a separate one. We'd expect the compiler to optimize out comparisons for small lengths known at compile time and always call the existing implementation (which may still involve a function pointer in most cases). -- nathan