Re: [fpc-devel] Parallel Computing

Peter Popov Mon, 03 Nov 2008 10:30:49 -0800

Hi all

I would like to point out that these days there is a huge hype aroundmulticore systems. As a result one sees stupid parallel demonstrations,such as the Mandelbrot one. This is a purely graphics demo with no otherutility and remote from any reallistic parallel applications. Moreover,this examples is one of the few which can be done efficiently on presentday multicore system. So, if FPC team is really intent on bringingparallel constructs in the language, one has to look at things from abroader perspective. It is necessary to think of what reallistic parallelapplications would look like.

Unfortunately, the utility of multicore systems has been largelyexagerated by their manufacturers. The main problem is that multiple coresshare the same memory bandwith. As a result it is highly unlikely that onecan have COMPLEX programs running in a concurrent way in a multicoresystem without clogging the memory bus and using up all the cache.Multiple cores are usefull if there is little memory transfer (does nothappen often, except of course if you compute fractals), or if memorytransfer is done in a predictable fashion. About the only example of thelater is linear algebra subroutines (scientific computing) and certainmultimedia applications (concurrent MPEG decoders, for example).

Now, it is true that a dot-product or vectur sum can be done veryellegantly with a prallel loop. However, these are very low-leveloperations, ones that (at least in the scientific computing community) aretypically optimized for each particular architecture and provided as userapi. Consequently no one would go around write matrix-vectormultiplication in a high level language. Linear algerba is the usualbottleneck and if you do real applications this has already been writtenand optimized. Consequently, parallel loops look beautiful, but they areof little practially utility. In summary the programming style that leadto assembly level loop-unrolling for superscalar processors is likely tobe the same programming style that will be used for multicore machines.

So typicall parallel code revolves around higher level algorithms. Forexample, if you want to compute the heat distribution in a automobileengine you would go and first partition your engine in a lot of smallercomponents. The you would perform complex, memory intensive computationson each piece, then you would patch them together. It is quesionable ifmilticore systems are usefull in such a scenario as it involves largememory transfers. However, if it is (or you have a real multi-processorshared memory machine), then what you would need from the language is anice enapsulation of threads. This implies (local) parallel procedures,arrays of (local) parallel procedures, parallel class methods, semaphoresand critical sections.

The present obstacle with object pascal is that for each (class) methodthat implements a parallel algorithm, one has to separately implement athread object. Moreover, parallel algorithms typically need global vars(the one they operate in prallel to), so you need to move the local methodvariables to the thread object to. In the end, the implementation of youalgorithm is shared between the method and the thread object. Finallysynchronization is provided by classes (TEvent, TCriticalSection) whichhave to be constructed and destructed explicitly, with the necessaryresource protection (try..finally) overhead. This is not convenient.


I hope this will helps the discussion.

Peter Popov
_______________________________________________
fpc-devel maillist  -  [email protected]
http://lists.freepascal.org/mailman/listinfo/fpc-devel

Re: [fpc-devel] Parallel Computing

Reply via email to