Re: {OT} Youtube Video: newCTFE: Starting to write the x86 JIT

Patrick Schluter via Digitalmars-d Tue, 25 Apr 2017 09:22:02 -0700

On Tuesday, 25 April 2017 at 09:09:00 UTC, Ola Fosheim Grøstadwrote:

On Monday, 24 April 2017 at 17:48:50 UTC, Stefan Koch wrote:
[...]
Oh, ok. AFAIK The decoding of indexing modes into micro-ops(the real instructions used inside the CPU, not the actualop-codes) has no effect on the caching system. It may howevercompress the generated code so you don't flush the instructioncache and speed up the decoding of op-codes into micro-ops.
If you want to improve cache loads you have to consider when touse the "prefetch" instructions, but the effect (positive ornegative) varies greatly between CPU generations so you willbasically need to target each CPU-generation individually.
Probably too much work to be worthwhile as it usually doesn'tpay off until you work on large datasets and then you usuallyhave to be careful with partitioning the data intocache-friendly working-sets. Probably not so easy to do for aJIT.
You'll probably get a decent performance boost without worryingabout caching too much in the first implementation anyway. Anygains in that area could be obliterated in the next CPUgeneration... :-/

It's already the case. Intel and AMD (especially in Ryzen)strongly discourage the use of prefetch instructions since atleast Core2 and Athlon64. The icache cost rarely pays off andvery often breaks the auto-prefetcher algorithms by spoilingmemory bandwidth.

Re: {OT} Youtube Video: newCTFE: Starting to write the x86 JIT

Reply via email to