Re: dmd-concurrency

Shammah Chancellor Sun, 24 Nov 2013 00:16:36 -0800

On 2013-11-20 07:34:36 +0000, Chris Williams said:

On Wednesday, 20 November 2013 at 04:24:14 UTC, Daniel Murphy wrote:
This is the correct forum to post phobos proposals on.
Well then, here's what I had written:
A few applications I've considered implementing seem like they would beeasier if there was a channel-based messaging system instd.concurrency. I'm happy to do this implementation, but I thought Iwould try to get some sort of sign-off before doing so. Following, Iwill lay out my argument for the addition, and then the API that I amconsidering.
---
One fairly common task is thread-pooling. With the standardsend/receive model currently implemented, you have to choose a specificthread to target when you send a task. While it's true that you cansimply iterate through your list of threads over and over, to spreadthe load evenly over them, that presumes that all tasks take evenprocessing time. It makes more sense to be able to push data into ashared channel (secretly a work queue), and the first thread thatfinishes its previous task will be able to immediately pull the taskbefore everyone else. This also means that the necessity of passingaround references to your threads so that they can be looped over goesaway.
I haven't tested it, but it looks like this sort of thing might bequasi-possible using the register/unregister/locate methods. As eachthread starts, it can register itself with a named group (i.e.channel), and then anyone who wants to send an item to an arbitrarythread in that group can call locate() to retrieve one thread and callsend() against the Tid. The target thread would then need to unregisteritself while it is doing work, then re-register itself. My complaintagainst this is the need to unregister and re-register. If the threadissuing commands sends a large number of tasks all at once, they willall go to the same thread (if coded poorly) or the caller will need touse yield() or sleep() to allow the target thread to receive the taskand unregister, so that locate() can find a different thread. That'snot terribly efficient. I am also concerned that there's the chancethat all threads will be unregistered when we call locate(), whereas achanneling system would be able to expand the mailbox during the timesthat all threads are busy.
The actual implementation within concurrency.d also concerns me as (ifI read it correctly), the most recent item to register() will be theone which locate() finds, rather than the thread which has beenregistered the longest. While I suppose it's probably not too large ofan issue if the same two threads keep taking all the tasks - that meansthat your load can't exceed two threads worth of processing power - itstill seems like a LIFO system would be better. The registry is alsobased on an array rather than a set, which can make removal an O(n)operation, if the contents of the registry have to be shifted left, tofill an empty spot.
Overall, I think that adding a shared message box system would be astraightforward way to improve the handling of thread pooling via theactor model.
---
A less common use-case but I was also considering some world-simulators(e.g. for studying economics or building a game map) and here theability to broadcast messages to a large set of other actors, based onlocation, interest, etc. seems useful. In this case, messages wouldneed to be copied out to each subscriber in the channel rather thanhaving an existence as a point to point connection. For a networkedgame, most likely you would want to break each channel into two, wherelocally all senders on a channel push to a single listener that pipesthe messages over the network, and then remotely the messages would bebroadcast to many listeners again, but that's a reasonablystraightforward task for someone to implement on top of the channelfunctionality. I don't think that such functionality is needed inPhobos itself. Mostly, the presence of the broadcasting functionalityin the standard library allows them to use the easy and safe actormodel for more creative uses than a straight one-to-one pipe.
---
Overall, my hope would be to develop something that is conceptually nomore difficult to deal with than the current send()/receive() model,but also able to be used in a wide variety of ways. The API that Iwould propose to develop is:
interface Channel {
        void send(T...)(T vals);
        void prioritySend(T...)(T vals);
        void receive(T...)(out Tid sender, T ops);
        receiveOnlyRet!(T) receiveOnly(T...)();
        bool receiveTimeout(T...)(Duration d, T ops);

        void setMaxMailboxSize(Tid tid, size_t messages, OnCrowding doThis);
void setMaxMailboxSize(Tid tid, size_t messages, bool function(Tid)doThisFunc);
}
class SingleChannel : Channel {} // Send inserts a message into ashared message box. Receive removes message
class DuplicateChannel(bool echo = true) : Channel {} // Send insertsthe message into a message box per-recipient. Receive removes messagein the calling thread's channel message box. If echo is false, messageswill not be sent back to the sender, even if they are a registeredlistener
void registerSend(Channel c, Tid tid = thisTid); // used by functionsendAll(). Channel can be of either type
void unregisterSend(Channel c, Tid tid = thisTid);
void registerReceive(Channel c, Tid tid = thisTid); // used by functionreceiveAll(). Channel can be of either type
void unregisterReceive(Channel c, Tid tid = thisTid);
void sendAll(T...)(T ops); // Sends a copy of message to all channelsthis thread has registered for.void receiveAll(T...)(out Channel c, out Tid sender, T ops); //Receives a message of type T from any channel that we are registeredfor. Returns channel and sender
I believe that the look and feel stays fairly consistent with thecurrent set of functions in std.concurrency. I've added the ability forthe recipient to infer information about the sender since, in theduplication model, I believe there are quite a few cases where thiswould be important information. And of course, I've added the option toregister/unregister threads other than ourselves to allow a greaterrange of code layouts, though it's possible that the lack of this sortof thing in the original code is due to some sort of safety concern?
The most straightforward way to implement the DuplicateChannel would beto use the individual threads' message boxes, but this would mean thatdata put into a channel could be pulled out via the traditionalreceive() method. Currently, my intention would be to partition thesetwo systems (the direct send()/receive() model and the channel model),unless anyone has any reason to think they should be merged into asingle whole?
Those are my thoughts, anyways. Comments? Complaints?

How does one receive from multiple channels out-of-order? I wouldrather this sent it to the subscribed Tid via send, rather than havingan additional queue. It could possible send a ChannelMessage whichhas a reference to the sending channel and the message. I understandthis is a different model than what Go and whatnot use, but I thinkit's more pratical in some circumstances. Maybe both ways would begood? I personally use this method in my vibe-d server.

Re: dmd-concurrency

Reply via email to