Re: Concurrency and River

Bill Venners Mon, 01 Oct 2007 19:14:49 -0700

Hi Dan,

On Oct 1, 2007, at 12:26 PM, Dan Creswell wrote:

Brian Goetz has brought up that we may have frog-boiled ourselvesinto
an bad situation by adopting the model of shared state with locks in
Java. In general the shared state/locks model makes concurrentprograms
difficult to reason about, but in particular this approach to
concurrency isn't composable. You can't safely combine differentmodules
without understanding the details of what they do with locks and how
they will interact.
That's just about the consequences of shared state with concurrent
access be it using locks or transactional memory etc.
And I think in general concurrency is difficult to reason abouteven inmessage based systems with shared nothing because you still haveissuesof failure to deal with including how that might impact messagedelivery.

Concurrency may be difficult to reason about in general, but somemodels are more difficult than others. Some models make it impossibleto trip in certain ways, just as it is impossible in Java bytecodesto free a pointer to memory twice. As I understand it, Erlangprevents deadlock by only allowing threads to interact via messages.Transactional memory can still produce starved threads that keepretrying and keep getting rolled back, but the app as a whole willmake progress because some threads will be getting stuff done. Java'sbasic model of synchronization is a bit more like a mine field,because you have to understand the whole application, includingeverything libraries are doing with locks and callbacks and such, tobe sure there are no potential deadlocks. And that's really hard todo by analysis, and it is hard to detect problems via testing becausethey can happen quite rarely.

If you have a JavaSpaces client that looks at how many CPUs or coresit has to work with when it starts up, and fires up one master threadand enough worker threads to keep all those cores busy, assuming eachthread is an independent guy that only communicates with otherthreads over the network via a JavaSpace, then those threads can'tdeadlock. (Though I supposed you could design a JavaSpace protocolthat could hang them up.)

The Pragmatic Programmers recently published a book on Erlang,which gota lot of people taking about Erlang. Erlang uses a shared nothingmodel,with message passing between "processes" managed by "actors".Processes
can be implemented as threads I assume, or can be distributed. One
interesting thing about Erlang is that it tries to unify theremote andlocal models, as far as I can tell. Not that they haven't read aNote on
Distributed Computing. I think that instead of trying to make remote
nodes look like local ones, they may treat local ones asunreliable as
remote ones.
I've yet to see exactly how Erlang does failure detection ofprocesses.
 I guess there might be some timeout value somewhere in respect of
messages reaching a destination etc but I've not seen a description of
this aspect of Erlang.
Further whilst Erlang might do failure detection (of a form)solving the
issues of failure are the difficult bit and I'm less convinced Erlang
offers much here.  For example, one solution to failure is replication
and it appears you are (unsurprisingly) left to do that for yourself
right now.  Putting my high-performance hat on I'd also point out that
replication has recognized limits especially when it's done with
transactions which leads to even more esoteric solutions that are
largely about appropriate architecture/interactions and less about
shared-nothing or message passing.

I'm not trying to promote Erlang's approach, only to point out thatit is getting a lot of buzz, because people are thinking about multi-core.

I've been involved with a language called Scala lately, which has an
Erlang-like actors library. On the mailing list they keep talkingaboutissues with implementing remote actors. I as yet don't understandthese
details either, but I keep getting this wierd feeling that wheel
reinvention is going on. They seem to be talking about how to solve
problems that Jini addressed almost 10 years ago.

So here's my question. I get the feeling that the trend to multi-core
architectures represents a disruptive technology shift that willshake
up the software industry to some extent. Does River have something to
offer here? If you expect the chips your software will run on willhave
multiple cores, and maybe you don't know how many until your program
starts running, you'll want to organize your software so itdistributesprocessing across those cores dynamically. Isn't JavaSpaces a goodway
to do that?

I think what it might mean is that you treat another core on the same
box running a worker thread the same as a worker thread across the
network. That way you have a uniform programming model, and whenyou runout of cores, you just add more boxes and you get more workernodes. Soit would be the opposite of the concept targeted by the Note. Yes,youwould use objects through a uniform interface, and whether or notthat
object is implemented locally or remotely would be an implementation
detail of the object. But what you'd assume is not that the thing is
local (a thread on another core of the same box) but remote.
Hmmmm, so the uniform model concept is nice and cleans out one
difficulty but there are some others lying around in this which Ireckon
are in need of consideration:
(1) A number of multi-core systems are threatening to head towardsNUMA
type architectures where the cost of comms is in part related to the
number of memory spaces you have to hop.

(2)  There's at least some (significant?) difference between comms
performance across processors in the same box versus across a network
and therefore the protocols you design and what you pass around in
messages might be somewhat different.

I'm not sure how NUMA would affect things, but local versus remoteinterfaces usually get into considering chatty versus chunky designs.So my feeling was that if you really are going to just only ever wantto exploit multiple cores on one box, JavaSpaces would be overkillbecause you can reasonably rule out partial failure. In the case whensomeone wants to exploit multiple cores, but also either distributeprocessing across the network as well, or at least leave the dooropen, make it easy to distribute across the network in the future,that JavaSpaces has a compelling solution.

I can imagine J2EE people all over the place in a few yearsscratching their heads about how they will take advantage of multiplecores for tasks they need done. Will they run a separate J2EE appserver on each core? Seems like they could run one app server withmultiple threads on each box. But then how do you distribute tasks tothose threads? JMS doesn't have a take semantic. I suppose they couldinstall a load balancer in front of a cluster, and have a masterserver firing jobs into the load balancer.

JavaSpaces solves this problem very elegantly, and has for a longtime. The change in the status quo is that the rise of multi-coremeans more people will be trying to figure out how to do this kind ofparallel processing than before. To exploit multi-core, you have tofigure out how to partition your app so that you can do parallelprocessing. You have to find the parallelism. If you actually can dothat, you next have to figure out how to implement it. Theopportunity I see for River is a marketing one, to simply try andpromote the idea that JavaSpaces can be used to solve this kind ofproblem. So when people face the problem someday, they'll think ofJavaSpaces.

Is it still called JavaSpaces? Jini isn't called Jini anymore. Whatabout JavaSpaces?


Thanks.

Bill

Re: Concurrency and River

Reply via email to