Re: Concurrency, was Re: Doh! Stupid Programming Mistakes

Levi Pearson Wed, 25 Oct 2006 14:23:46 -0700

On Oct 25, 2006, at 11:51 AM, Bryan Sant wrote:


I always just go with threads.  But then again, I do a lot of desktop
software, where interaction between components is frequent and shared
memory is more efficient, reliable, and convenient than message
passing via pipes or some other IPC mechanism.  I'm not saying that
Levi's points aren't valid, on the contrary, they are.  Memory space
protection provided by a process is valuable...  Valuable if you're
using C, or some other language that can stomp on or leak memory.  If
you're using a language with memory management (Perl, C#, Java, Lisp),
then the protection provided by processes has little value and some
down sides.

You're conflating two different problems here. First, there is theproblem of memory safety. C and C++ allow you to fairly easily writeinto memory that you should not write into. Memory-safe languagesdon't let you do it at all. Memory safety requires that allprimitive memory allocation and especially deallocation be managed bythe language runtime, typically via a garbage collector.

The second problem is the nondeterministic interleaving of executionthat exists in the shared-memory concurrency model. Every heapvariable is, by default, shared by all threads. Since the schedulercan switch between threads at arbitrary times, a program that usesheap variables naively will almost certainly behave unpredictably andnot do what you want. Enter locks. They allow you to re-serializethe execution of your program in certain areas, so only one threadcan run at a time. This solves one problem, but creates a few more.

First of all, you must remember to put locks in all the rightplaces. Some higher level languages help out quite a bit with this,but if you're doing raw pthreads in C, it's pretty easy to screw upand create a race condition, where nondeterminism creeps into yourprogram again. And in any language higher-level than assembly, it'sentirely possible that an operation that looks atomic on the surface(i.e., can't be broken down any further in that language) actuallyconsists of many machine operations, so the scheduler could switch toa different thread /in the middle/ of that operation. Doing shared-memory concurrency safely in a high-level language requires a lot ofinformation about the implementation of that language, which kind ofdefeats the purpose.

Second, you are hampered in your ability to create new abstractions.When multiple shared resources are involved, you must be careful toobtain and release the locks in the correct order. This is a pain,it creates concerns that cross abstraction barriers, and is generallyan impediment to good software design practices.

Finally, locks can create performance issues. The purpose of a lockis to serialize your program, and if there are too many of them, youramount of parallelism drops through the floor and you end up with aserial program. In the worst case, you can deadlock and bring theprogram to a halt. Getting good performance with locks along withelimination of 100% of race conditions and deadlocks is a very hardthing to do. As the amount of concurrency goes up, the performancepenalty of locks and the chance of hitting a lurking race conditiongoes up, too.

So, I hope that made the distinction between the problems caused bylack of memory safety and the problems caused by shared-stateconcurrency clear. Regardless of the problems, both are stillsometimes the right solution. They just shouldn't be the DEFAULTsolution for a programmer who wants to write correct code, ingeneral. Some particular high-level languages and programmingenvironments make using any other concurrency paradigm at least asdifficult; programmers in such environments are simply screwed, andshould demand better tools.

You can achieve a much more natural programming model by
using threads and semaphores, than processes and marshaled messages.

What feels natural to do is largely defined by the language you areusing, so that is only true for a subset of languages. I wouldargue that languages that make shared-state concurrency the mostnatural way to approach a problem ought to be redesigned so thatshared-state concurrency is well-supported when necessary, butalternatives feel just as (or more, preferably) natural.

You have also left out one important option from your list, though;threads that by default share nothing, but can explicitly ask forregions of memory to be shared. Combine that with softwaretransactional memory (aka optimistic or lock-free concurrency) andmessage-passing and deterministic concurrency whenever they areappropriate, and you can use the tool that suits your problem andeliminate the possibility of large classes of programming errors,just like memory protection eliminates another large class ofprogramming errors.


                --Levi

/*
PLUG: http://plug.org, #utah on irc.freenode.net
Unsubscribe: http://plug.org/mailman/options/plug
Don't fear the penguin.
*/

Re: Concurrency, was Re: Doh! Stupid Programming Mistakes

Reply via email to