Re: problem with parallel foreach

weaselcat via Digitalmars-d-learn Wed, 13 May 2015 05:20:49 -0700

On Wednesday, 13 May 2015 at 09:01:05 UTC, Gerald Jansen wrote:

On Wednesday, 13 May 2015 at 03:19:17 UTC, thedeemon wrote:
In case of Python's parallel.Pool() separate processes do thework without any synchronization issues. In case of D'sstd.parallelism it's just threads inside one process and theydo fight for some locks, thus this result.
Okay, so to do something equivalent I would need to usestd.process. My next question is how to pass the common data tothe sub-processes. In the Python approach I guess this isautomatically looked after by pickling serialization. Is theresomething similar in D? Alternatively, would the use ofstd.mmfile to temporarily store the common data be a reasonableapproach?

Assuming you're on a POSIX compliant platform, you would justtake advantage of fork()'s shared memory model and pipes - i.e,read the data, then fork in a loop to process it, then use pipesto communicate. It ran about 3x faster for me by doing this, andobviously scales with the workloads you have(the provided dataonly seems to have 2.) If you could provide a larger dataset andthe python implementation, that would be great.

I'm actually surprised and disappointed that there isn't afork()-backend to std.process OR std.parallel. You have to usestdc

Re: problem with parallel foreach

Reply via email to