Re: First time using Parallel

max haughton via Digitalmars-d-learn Sun, 26 Dec 2021 03:17:05 -0800

On Sunday, 26 December 2021 at 06:10:03 UTC, Era Scarecrow wrote:

This is curious. I was up for trying to parallelize my code,specifically having a block of code calculate some polynomials(*Related to Reed Solomon stuff*). So I cracked openstd.parallel and looked over how I would manage this all.
To my surprise I found ParallelForEach, which gives theexample of:
```d
foreach(value; taskPool.parallel(range) ){code}
```
Since my code doesn't require any memory management, sharedresources or race conditions (*other than stdout*), I pluggedin an iota and gave it a go. To my amazement no compilingissues, and all my cores are in heavy use and it's outputtingresults!
Now said results are out of order (*and early results aregarbage from stdout*), but I'd included a bitwidth comment sosorting should be easy.
```d
        0x3,    /*7*/
        0x11,   /*9*/
        0x9,    /*10*/
        0x1D,   /*8*/
        0x5,    /*11*/
        0x3,    /*15*/
        0x53,   /*12*/
        0x1B,   /*13*/
        0x2B,   /*14*/
```
etc etc.
Previously years ago I remember having to make a struct andthen having to pass a function and a bunch of stuff from withinthe struct, often breaking and being hard to get to even workso I didn't hardly touch this stuff. This is making outputtingdata MUCH faster and so easily; Well at least on a beefycomputer and not just some chromebook I'm programming on so itcan all be on the go.
So I suppose, is there anything I need to know? About sharedresources or how to wait until all threads are done?

Parallel programming is one of the deepest rabbit holes you canactually get to use in practice. Your question at the momentdoesn't really have much context to it so it's difficult tosuggest where you should go directly.

I would start by removing the use of stdout in your loop kernel -I'm not familiar with what you are calculating, but if you canbasically have the (parallel) loop operate from (say) one arraydirectly into another then you can get extremely good parallelscaling with almost no effort.

Not using in the actual loop should make the code faster evenwithout threads because having a function call in the hot codewill mean compilers optimizer will give up on certaintransformations - i.e. do all the work as compactly as possiblethen output the data in one step at the end.

Re: First time using Parallel

Reply via email to