Re: Parallelism and Concurrency was Re: Ideas for a"Object-Belongs-to-Thread" (nntp: message 4 of 20) threading model (nntp: message 20 of 20 -lastone!-)

nigelsandever Mon, 17 May 2010 12:14:20 -0700

On Mon, 17 May 2010 17:20:28 +0100, Dave Whipp - d...@dave.whipp.name<+nntp+browseruk+a2ac8a2dcb.dpuu#dave.whipp.n...@spamgourmet.com> wrote:

nigelsande...@btconnect.com wrote:
There are very few algorithms that actually benefit from using even lowhundreds of threads, let alone thousands. The ability of Erlang (and goan IO and many others) to spawn 100,000 threads makes an impressivedemo for the uninitiated, but finding practical uses of such abilitiesis very hard.
It may be true that there are only a small number of basic algorithmsthat benefit from massive parallelization.

There is a distinct difference between "massive parallelisation" and"thousands of threads".

The important thing is not the number of algorithms: it's the numberprograms and workloads.

From that statement, you do not appear to understand the subject matter ofthis thread: Perl 6 concurrency model.

Even if there was only one parallel algorithm, if that algorithm wasneeded for the majority of parallel workloads then it would besignificant.
In fact, though utilizing thousands of threads may be hard, once you getto millions of threads then things become interesting again. Physicalsimulations, image processing, search, finance, etc., are all fieldsthat exhibit workloads amenable to large scale parallelization.

Again, "large scale parallelisation" does not equate to "millions ofthreads".

For CPU-bound processes, there is no benefit in trying to utilise morethan one thread per core--or hardware thread if your cores havehyper-threading. Context switches are expensive, and running hundreds (letalone thousands or millions) of threads on 2/4/8/12 core commodityhardware, means that you'll spend more time context switching than doingactual work. With the net result of less rather than more throughput.

Sure, there are exotica hardware that have thousands of cores, but ithardly seems likely that having spent $millions upon such hardware to runyour massively parallel algorithms to solve problems in realistic timeframes, that your going to use a dynamic (no-compiled) language to writeyour solutions in.

And for IO-bound processes, asynchronous IO scales far better thanthrowing threads at the problem.

Pure SIMD (vectorization) is insufficient for many of these workloads:programmers really do need to think in terms of threads (most likelymapped to OpenCL or Cuda under the hood).

If you care to review the thread, you'll find that I'm thepro-(kernel)threading candidate in this debate.

Sure, OpenCL goes far beyond just throwing SIMD workloads at the localGPU, extending out to heterogeneous clusters and other massively parallelhardware setups, but I think that it is beyond the scope of Perl 5 toconsider catering to such systems as a core requirement. Catering to suchexotic systems is far better left to external modules written and testedby those with the need for and experience of such systems--and with thehardware to run it on. There is simply no purpose in burdening the 99% ofPerl 6 installations that will never have a need for such things with theinfrastructure to support them in the core.

To use millions of threads you don't focus on what the algorithm isdoing: you focus on where the data is going. If you move dataunnecessarily (or fail to move it when it was necessary) then you'llburn power and lose performance.


Sorry, but I've got to call you on this.

Parallelisation (threading) is all about improving performance. And thefirst three rules of performance are: algorithm; algorithm; algorithm.Choose the wrong algorithm and you are wasting cycles. Parallelise thatwrong algorithm, and you're just multiplying the number of cycles you'rewasting.

Re: Parallelism and Concurrency was Re: Ideas for a"Object-Belongs-to-Thread" (nntp: message 4 of 20) threading model (nntp: message 20 of 20 -lastone!-)

Reply via email to