Re: [fonc] Everything You Know (about Parallel Programming) Is Wrong!: A Wild Screed about the Future

Miles Fidelman Wed, 04 Apr 2012 14:45:14 -0700

David Barbour wrote:

On Wed, Apr 4, 2012 at 9:00 AM, Miles Fidelman<[email protected] <mailto:[email protected]>> wrote:


    The whole point of architecture is to generate the overall outline
    of a system, to address a particular problem space within the
    constraints at hand. The KISS principle applies (along with "seek
    simplicity and distrust it"). If there isn't a degree of
    simplicity and elegance in an architecture, the architect hasn't
    done particularly good job.


I agree.


    In the past, limitations of hardware, languages, and run-time
    environments have dictated against taking parallel (or more
    accurately, concurrent) approaches to problems, even when massive
    concurrency is the best mapping onto the problem domain -
    resulting in very ugly code.

I'd say the bigger problem is that there haven't been very goodconcurrency models that ever reached mainstream. The choices have beenthreads and locks, processes, maybe a transactional database.

Sort of depends on the scale you're looking at. At a systems level,pretty much everything is concurrent these days - lots of clientshitting lots of servers, with very loose coupling, and huge numbers oftransactions happening all at once. Some pieces tend to get serializedas you get closer to databases, but that's about it.

As you get into the architecture of a specific service, again thingslook pretty concurrent - say a web server spawning a process to handle arequest.

Outside of mainstream, there are a lot more options. Lightweight timewarp. Synchronous reactive. Temporal logic. Event calculus. Concurrentconstraint. Temporal concurrent constraint. Functional reactiveprogramming.


Few of which, however, are conceptually simple (maybe elegant).

    Yes, there are additional problems introduced by modeling a
problem as massively concurrent
Well, not inherently. I'd note that your example of 5 loops with 2000tanks at 20Hz is essentially an implementation of a step-clockedconcurrency model. It just happens to be represented in a sequentialprogramming language, so you get a bunch of semantic noise (i.e. as anoutside observer you know that all 2000 computations of line-of-sightare independent of one another, but that isn't obvious in the language).

I wouldn't consider anything that's inherently synchronous as an exampleof concurrency. Parallel yes.

Re. LOS calcuations: right off that bat, from a global viewpoint I seean n-squared problem reduced to an n-factorial problem (if I can seeyou, you can see me - don't need to calculate that twice). But that'san optimization that's independent of concurrency.

But your particular choice for massive concurrency - asynchronousprocesses or actors - does introduce many additional problems.

And eliminates many more - at least if one takes a shared-nothing,message passing approach.

    - control latency, support replay, testing, maintenance,
    verification: these are nothing new at the systems level (think
    about either all the different things that run on a common server,
    or about all the things that go on in a distributed system such as
    the federated collection of SMTP servers that we're relying on
    right now)
Yes, these are old problems at the systems level. Mostlyunsolved. I've seen e-mails take 5-6 days to get through. We havealmost no ability to reason about system invariants. We need constanthuman administration to keep our systems up and running. I'm quitefamiliar with it.
    - consistency: is not your message "Avoid the Concurrency Trap by
    Embracing Non-Determinism?" -- is not a key question: what does it
    mean to "embrace non-determinism" and how to design systems in an
    inherently indeterminate environment? (more below)
Uh, no. But I see below you confused me with another David. Perhapssee my Mar 28 comment in this thread. I reject Ungar's position.

Yup. Sorry about that.  I don't reject his position.

        The old sequential model, or even the pipeline technique I
        suggest, do not contradict the known, working structure for
        consistency.


    But is consistency the issue at hand?
Yes. Of course, it wasn't an issue until you discarded it in pursuitof your `simple` concurrency model.

We're talking about "fundamentals of new computing" - and a far as I cantell, Ungar and Hewitt have it about right in pointing out thatconsistency goes out the window as systems scale up in size andcomplexity. The question to me is what are design patterns that areuseful in designing systems in an a world where inconsistency is agiven. Probabilistic and biological models seem to apply.


    This line of conversation goes back to a comment that the limits
    to exploiting parallelism come down to people thinking
    sequentially, and inherent complexity of designing parallel
    algorithms. I argue that quite a few problems are more easily
    viewed through the lens of concurrency - using network protocols
    and military simulation as examples that I'm personally familiar with.

I have not argued that people think sequentially or that parallelalgorithms are inherently complex. I agree that many problems are wellviewed through the lens of concurrency.

Ahh... but that is the point I was responding to that led to this branchof discussion.

But your proposed approach to concurrency is not easier, not once youaccount for important problems solved by the original simulator thatyou chose to ignore.
    You seem to be making the case for sequential techniques that
maintain consistency.
Pipelines are a fine sequential technique, of course, and I think weshould use them often. But more generally I'd say what we need iseffective support for synchronous concurrent behavior - i.e. to modeltwo or more things happening at the same time.

And I find that synchronous behavior, at least at the micro level, leadsto huge amounts of problems, and very brittle systems. Wheresynchronous behavior is needed, I find it easier to apply at higherlevels of a protocol stack, where application-behavior can drivespecific synchronization models. (TCP over IP for example, returnreceipts for mail, 2-phase commits for databases, forward errorcorrection for media streams, ...).

I also disagree with Hewitt (most recently athttp://lambda-the-ultimate.org/node/4453). Ungar and Hewitt both argue"we need indeterminism, so let's embrace it". But they forget thatevery medicine is a poison if overdosed or misapplied, and it doesn'ttake much indeterminism to become poisonous.
To accept and tolerate indeterminism where necessary does not mean toembrace it. It should be controlled, applied carefully and explicitly.

Then we're in violent disagreement. I've yet to see a complex systemwhere indeterminism isn't the norm, and where attempts to imposedeterminism cause huge, often insurmountable problems (think societies,or mono-culture agriculture).


        a) you selectively conceptualize only part of the system - an
        idealized happy path. It is much more difficult to
        conceptualize your whole system - i.e. all those sad paths you
        created but ignored. Many simulators have collision detection,
        soft real-time latency constraints, and consistency
        requirements. It is not easy to conceptualize how your system
        achieves these.


    In this one, I write primarily from personal experience and
    observation. There are a huge class of systems that are inherently
    concurrent, and inherently not serializeable. Pretty much any
    distributed system comes to mind - email and transaction
    processing come to mind. I happen to think that simulators fall
    into this class - and in this regard there's an existence proof:

In general, the asynchronous semantics you get with processes andactors are also a poor map to simulation problems. In my experience,the best approaches to simulation involve synchronous programming ofsome sort - event calculus, step clock, synchronous reactive, temporallogic, functional reactive, reactive demand programming, etc.

Well, clearly, we have very different experiences (or are working onvery different kinds of simulations).

The basic reason for this is that you: (a) want to model lots ofthings happening at once (not `asynchronously` but truly `at the sametime`), (b) you don't want any participant to have special advantageby ordering in a turn, (c) you want consistency, freedom from glitchesand anomalies, and ability to debug and regression-test your model,(d) you want precise real-time reactions - i.e. as opposed to delayingmessages indefinitely.

not when you're building things like wargames - things are veryprobabilistic at the macro level - if you want to understand a range ofoutcomes, you have to re-run the exercise, possibly many times usingmonte carlo techniques

    In the case of email, I can't even begin to think about applying
    synchronous parallelism to messages flowing a federation of mail
    servers.
That's a rather facetious case, of course. E-mail is defined by anasynchronous protocol. But I can easily model asynchronouscommunication in a synchronous system by use of intermediate sharedstate (a database, for example).

Not at all. The world is asynchronous. Email provides a very goodmodel for the environment that we're building systems in today.




--
In theory, there is no difference between theory and practice.
In practice, there is.   .... Yogi Berra


_______________________________________________
fonc mailing list
[email protected]
http://vpri.org/mailman/listinfo/fonc

Re: [fonc] Everything You Know (about Parallel Programming) Is Wrong!: A Wild Screed about the Future

Reply via email to