Re: std.parallelism equivalents for posix fork and multi-machine processing

Laeeth Isharc via Digitalmars-d Thu, 14 May 2015 13:06:06 -0700

On Thursday, 14 May 2015 at 16:33:46 UTC, John Colvin wrote:

On Wednesday, 13 May 2015 at 20:34:24 UTC, weaselcat wrote:
On Wednesday, 13 May 2015 at 20:28:02 UTC, Laeeth Isharc wrote:
Is there value to having equivalents to the std.parallelismapproach that works with processes rather than threads, andmakes it easy to manage tasks over multiple machines?
I'm not sure if you're asking because of this thread, but see

http://forum.dlang.org/thread/[email protected]#post-tczkndtepnvppggzmews:40forum.dlang.org
python outperforming D because it doesn't have to deal withsynchronization headaches. I found D to be way faster whenreimplemented with fork, but having to use the stdc API isugly(IMO)
It was also easy to get D very fast by just being a little moreeager with IO and reducing the enormous number of littleallocations being made.

Yes - thank you for your highly educational rewrite, which Ipersonally very much appreciate your taking the trouble to do.Perhaps this should be turned (by you or someone else) into amini case-study on the wiki of how to write idiomatic andefficient D code. Or maybe just put up the slides from yourforthcoming talk (which I look forward to watching later when itis up).

It's good to know D can in fact deliver on the implicit promisein a real use case with not too much work. (Yes, naively writtencode was a bit slow when dealing with millions of lines, but inwhich language of comparable flexibility would that not be true).It's also interesting that your code was idiomatic. (I wasreading up about Scala, which seems beautiful in many ways, butit is terribly disturbing to see that the idiomatic way oftenseems to be the most inefficient, at least as things stood acouple of years ago).

But, even so, I think having a wrapper for fork and an API formultiprocessing (which you could then hook up to eg the DigitalOcean, AWS apis etc) would be rather helpful.

I spoke with a friend of mine at one of the most admired/hatedWall Street firms. One of the smartest quants I know who has nowmoved to portfolio management. He was doing a study on tick datagoing back to 2000. I asked him how long it took to run on hisfirm's infrastructure. An hour! And the operations were prettysimple. I think it should only take a couple of minutes. And itwould be nice to show an example of - from a spreadsheet -spinning up 100 digital ocean instances - and running the numbersnot just on one security, but every relevant security, and havinga nice summary appear back in the sheet within a couple ofminutes.

The reason speed matters is that long waits interfere with rapiditeration and the creative thought process. In a marketenvironment you may well have forgotten what you wanted after anhour...



Laeeth.

Re: std.parallelism equivalents for posix fork and multi-machine processing

Reply via email to