Re: std.parallelism changes done

Sönke Ludwig Thu, 24 Mar 2011 19:26:02 -0700

Am 25.03.2011 02:17, schrieb dsimcha:

On 3/24/2011 9:05 PM, Sönke Ludwig wrote:

This may not be an issue in the std.parallelism design. A TaskPool task
can safely wait on other tasks. What prevents this from causing a
deadlock is that calling yieldForce, spinForce, or waitForce on a task
that has not started executing yet will execute the task immediately in
the thread that tried to force it, regardless of where it is in the
queue.


Indeed this pattern solves the problem to wait for the completion of a
specific task. It also avoids a huge potential of deadlocks that a
general yield() that does not take a task would have. However, it will
not solve the general problem of one task waiting for another, which
could be in terms of a condition variable or just a mutex that is used
in the middle of the task execution.


Can you elaborate and/or provide an example of the "general" problem?
I'm not quite sure what you're getting at.

I have one very specific constellation that I can only sketch.. Supposeyou have some kind of complex computation going on in the ThreadPool.This computation is done by a large set of tasks where each tasksdepends on the result of one or more other tasks. One task isresponsible for coordinating the work - it is spawning tasks and waitingfor their completion to spawn new tasks for which the results are nowavailable.

First thing here is that you do not want to do the waitForce() kind ofwaiting in the coordinator task because this might cause the coordinatorto be busy with an expensive taks while it could already spawn new tasksbecause maybe in the meantime some other tasks have already finished.

However, if you wait for a condition variable instead (which is firedafter each finished task) and if you can have multiple computations ofthis kind running in parallel, you can immediately run into thesituation that the thread pool is crowded with coordinator tasks thatare all waiting for their condition variables which will never betriggered because no worker tasks can be executed.

This is only one example and basically this problem can arise in allcases where one task depends on another task by some form of waitingthat will not execute the dependency like waitForce() does.

I also have a completely different example invloving the main thread(doing GPU work) which is much more diffucult, but I don't think I canmake that clear with only text.

Can you elaborate on this? The whole point of executeInNewThread() was
supposed to be that a TaskPool is not needed for simple cases.


Well OK if that is the only purpose to provide a shortcut for (new
Thread(&fun)).start then my suggestion may not make too much sense.
However, I have some doubts that the advantage to have this shortcut
here justifies the functional duplication of core.thread. Is there some
specific use case where you would use a Task object but not a ThreadPool?


executeInNewThread() is useful where you only have a few Tasks, rather
than needing to map a large number of Tasks onto a small number of
threads. Using core.thread here doesn't cut it, because core.thread
doesn't automate passing the function's arguments to the new thread.
Also, I figured out that some of the conditions for @safe tasks can be
relaxed a little if they run in a dedicated thread instead of a TaskPool.


I see.


But what I wanted to say is, even if it may be difficult to implement
such thread caching now, putting means to execute a Task in its own
thread now into the ThreadPool allows for such an optimization later (it
could even exist while still keeping Task.executeInNewThread()).


I can't really comment because I still don't understand this very well.

I hope I could make it a little more clear what I mean. The problem isjust that the system I'm talking about is quite complex and it's noteasy to find good and simple examples in that system. The problems ofcourse arise only in the most complex pathes of execution..

What I'm not sure about is if executeInNewThread() is supposed to beuseful just because it is somtimes nice to have the fine-grainedparallelism of the OS scheduler as opposed to task granilarity, or ifthe advantage is supposed to be efficiency gained because the threadpool is not created. In the latter case the caching of some threads tobe reused for a executeInOwnThread()-method should lead to a betterperformance in almost any case where thread creation overhead is relevant.

.. OK my writing skills are degrading rapidly as I have to fight againstmy closing eyes.. will go to sleep now

Re: std.parallelism changes done

Reply via email to