Re: [fonc] Everything You Know (about Parallel Programming) Is Wrong!: A Wild Screed about the Future

BGB Tue, 03 Apr 2012 19:08:11 -0700

On 4/3/2012 9:46 AM, Miles Fidelman wrote:

David Barbour wrote:
On Tue, Apr 3, 2012 at 8:25 AM, Eugen Leitl <[email protected]<mailto:[email protected]>> wrote:
    It's not just imperative programming. The superficial mode of human
    cognition is sequential. This is the problem with all of mathematics
    and computer science as well.
Perhaps human attention is basically sequential, as we're only ableto focus our eyes on one point and use two hands. But I think humansunderstand parallel behavior well enough - maintaining multiplerelationships, for example, and predicting the behaviors of multiplepeople.
And for that matter, driving a car, playing a sport, walking andchewing gum at the same time :-)

yes, but people have built-in machinery for this, in the form of thecerebellum.relatively little conscious activity is generally involved in thesesorts of tasks.

if people had to depend on their higher reasoning powers for basicmovement tasks, people would likely be largely unable to operateeffectively in basic day-to-day tasks.

    If you look at MPI debuggers, it puts people into a whole other
    universe of pain that just multithreading.
I can think of a lot of single-threaded interfaces that put people ina universe of pain. It isn't clear to me that distribution is atfault there. ;)
Come to think of it, tracing flow-of-control through anobject-oriented system REALLY is a universe of pain (consider thedifference between a simulation - say a massively multiplayer game -where each entity is modeled as an object, with one or two threadswinding their way through every object, 20 times a second; vs.modeling each entity as a process/actor).


FWIW, in general I don't think much about global control-flow.

however, there is a problem with the differences between:
global behavior (the program as a whole);
local behavior (a local collection of functions and statements).

a person may tend to use general fuzzy / "intuitive" behavior forreasoning about "the system as a whole", but will typically use fairlyrigid sequential logic for thinking about the behavior of a given pieceof code.

there is a problem if the individual pieces of code are no longerreadily subject to analysis.

the problem I think with multithreading isn't so much that things areparallel or asynchronous, but rather that things are very ofteninconsistent.

if two threads try to operate on the same piece of data at the sametime, often this will create states which are impossible had either beenoperating on the data individually (and, very often, changes made in onethread will not be immediately visible to others, say, because thecompiler had not actually thought to write the change to memory, or theother thread to reload the variable).

hence, people need things like the "volatile" modifier, use of atomicoperations, things like "mutexes" or "synchronized" regions, ...



this leaves several possible options:

systems go further in this direction, with little expectation of globalsynchronization unless some specific mechanism is used (two threadsworking on a piece of memory may temporarily each see their own local copy);or, languages/compilers go the other direction, so that one threadchanging a variable is mandated to be immediately visible to other threads.


one option is more costly than the other.

as-is, the situation seems to be that compilers lean on one side (onlylocally consistent), whereas the HW tries to be globally consistent.

a question then, is assuming HW is not kept strictly consistent, how tobest handle this (regarding both language design and performance).

however, personally I think abandoning local sequential logic andconsistency, as being a bad move.

I am personally more in-favor of message passing, and either the abilityto access objects synchronously, or "pass messages to the object", whichmay be in-turn synchronous.


consider, for example:
class Foo
{

sync function methodA() { ... } //synchronous (only one suchmethod executes at a time)

    sync function methodB() { ... }    //synchronous

async function methodC() { ... } //asynchronous / concurrent(calls will not block)sync async function methodD() { ... } //synchronous, but calls willnot block

}

...
var obj=new Foo();

//thread A
obj.methodA();
//thread B
obj.methodB();

the VM could enforce that the object only executes a single such methodat a time (but does not globally lock the object, unlike "synchronized").


similarly:
//thread A
async obj.methodA();
//thread B
async obj.methodB();

which works similarly, except neither thread blocks (in this case, "obj"functions as a virtual process, and the method call serves more as amessage pass). note that, if methods are not "sync", then they mayexecute concurrently.

note that "obj.methodC();" will behave as if the async keyword weregiven (it may be called concurrently). "obj.methodD();" will behave asif it were a synchronous method called asynchronously.

this allows deliberately asynchronous behavior (and more ability tooptimize for a multi-processor system), but at the same time, does nothinder the use of synchronous logic.


similar behavior can be applied to blocks:
async { ... }    //block does not block
sync { ... }  //block will block

as-is, most of this stuff uses "green threads" (and some amount ofinternal locking), so it is less certain how to best map it to real HW.

I can imagine some how to map it to either "CPUs with inter-processorinterrupts", or to something like TCP or UDP, which could be "closeenough" (like, if a hugely multiprocessor system were implemented in aform resembling a bunch of simpler shared-memory multiprocessor systemsconnected over a network or similar).

FWIW, as-is, I don't really make all that intensive use ofmultithreading. my code is, technically, multithreaded, but most of thecode is itself still largely single-threaded, and most threads are basedon "subdivision by task" rather than "parallel execution".


for example:
a thread to manage the GC;

another thread to manage the "TRNG". it tries to generate true randomnumbers based on the assumption that the noise patterns seen in "rdtsc"deltas represent "true" entropy (basically, it is a loop which doeslittle more than sleep and call rdtsc, and add the values seen into arandom number generator);

also the VM green-thread workers, and several other misc threads.

a recent consideration was to move video recording/encoding to its ownthread, due to the apparent lag-inducing behavior of the current videorecording (which operates in the same thread as my 3D renderer,resulting in a notable drop in framerate whenever video recording isactive). granted, this is partly because I am using MJPG, and my JPEGencoder has a hard time keeping up with encoding full-resolution(800x600 in this case) screen-shots in a timely manner:individually, both the video recording and 3D renderer pull off around40fps, but when used together, the overall framerate is more around10-15fps.

yes, it is much faster to dump truecolor video to disk, but this rapidlyeats up HDD space (around 1GB/min with 16bpp output). blarg...currently, video-recording isn't a major use-case though.



or such...

_______________________________________________
fonc mailing list
[email protected]
http://vpri.org/mailman/listinfo/fonc

Re: [fonc] Everything You Know (about Parallel Programming) Is Wrong!: A Wild Screed about the Future

Reply via email to