If you show me an example I'd be obliged.

Henry Rich

On 4/9/2022 10:19 PM, Elijah Stone wrote:
Duff's device is a technique for loop unrolling which involves jumping into the middle of the unrolled loop.  Overlapping access is an alternate technique also useful for loop unrolling (as well as arranging for aligned access), which involves doing a small amount of redundant work, but saves branches.  Though it is slightly less general, this tends to be more performant on modern hardware.

On Sat, 9 Apr 2022, Henry Rich wrote:

I don't understand the last sentence. 'duff'? 'overlapping access'?

Henry Rich

On 4/9/2022 9:37 PM, Elijah Stone wrote:
On Sat, 9 Apr 2022, Henry Rich wrote:

JE generally does few data-dependent branches, and I expect it would not be a good idea to use two hyperthreads in one core; but you'll have to make that decision.

Branch miss is tens of cycles; cache miss is hundreds.

Re branches: from what I've seen, there is too much duff and not enough overlapping access.  Though I guess those are metadata-dependent branches, not data-dependent branches :)
----------------------------------------------------------------------
For information about J forums see http://www.jsoftware.com/forums.htm


--
This email has been checked for viruses by AVG.
https://www.avg.com

----------------------------------------------------------------------
For information about J forums see http://www.jsoftware.com/forums.htm

----------------------------------------------------------------------
For information about J forums see http://www.jsoftware.com/forums.htm

----------------------------------------------------------------------
For information about J forums see http://www.jsoftware.com/forums.htm

Reply via email to