Hi John, 

> Further, we verified upthread that Intel's current and near-future product 
> line
> includes server chips (some with over 100 cores, so not exactly low-end) that
> don't support AVX-512 at all. I have no idea how common they will be, but they
> will certainly be found in cloud datacenters somewhere. Shouldn't we have an
> answer for them as well?

Just submitted a patch to improve the SSE4.2 version using the source you 
referenced. See 
 
https://www.postgresql.org/message-id/PH8PR11MB82869FF741DFA4E9A029FF13FBF72%40PH8PR11MB8286.namprd11.prod.outlook.com
 
> I know you had extended time off work, but I've already shared my findings and
> explained my reasoning [2]. The title of the paper is "Fast CRC Computation 
> for
> iSCSI Polynomial Using CRC32 Instruction", so unsurprisingly it does improve 
> the
> SSE42 version. With a few dozen lines of code, I can get ~3x speedup on page-
> sized inputs. At the very least we want to use this technique on Arm [3], and 
> the
> only blocker now is the question regarding the patents. I'm interested to 
> hear the
> response on this.

Still figuring this out. Will respond as soon as I can. 

Thanks,
Raghuveer

Reply via email to