Re: [netlogo-devel] Actor model of concurrency (Akka) in multi agent simulation software (NetLogo)

Jason Bertsche Wed, 04 Feb 2015 23:39:22 -0800

I've actually thought about concurrency in NetLogo a fair amount. As forthe pitfalls...—they're complicated. I'll ramble and explain a bit.

First of all, this would be a pretty fundamental change to the language,so one of the pitfalls is that, in order to even have a chance ofgetting a good speed-up out of this change, I think we would need tospend a lot of time on overhauling the existing NetLogo code to operatein this way. On top of that, this added level of complication—that is,model state changes passing through an actor system—I would expect togreatly complicate the core NetLogo code, further increasing thealready-substantial maintenance burden for future core NetLogodevelopment work.

The complications do not stop there, though; there are much morepractical problems at hand. I'm not going to provide any proof of theseclaims here, but here are some assumptions of mine:


1. The cases where one most desires to parallelize NetLogo are those
   involving large agentsets
2. The vast majority of `ask` operations in NetLogo change world
   (global) state (e.g. agent positions/variables)
3. In a substantial percentage of the cases where `ask` is used, the
   outcome depends on the global state (e.g. positions/variables of
   other agents)

On top of those three premises, agents in `ask` blocks can change theglobal state in a way that affects all of the other agents in theagentset that is being `ask`ed (e.g. modifying a global variable, movingtowards/away from another agent). This makes parallelizationdifficult. The typical actor-based solution to this problem might be tohide the world state within an actor (or, in the style of C or plainJava, you might use resource locks on world state), but, since prettymuch every agent is going to have to spend time waiting on the worldstate to be available for reading or writing, I would expect that youwouldn't get a very substantial speed up—possibly even a /slowdown/ inmany cases, because of the concurrency overhead!—from this approach.The results depend on that contents of the `ask` block, but I havesevere doubts that typical models would see much speedup from this sortof concurrency, and the functionality would come at the cost of hundredsof hours of lost development time over the years, I would expect,because of the way in which it would have to tangle itself up in theguts of the core NetLogo code. So that particular approach doesn't seemvery good to me, especially because it hasn't really solved the problemof all the agents trying to change the global state simultaneously.

Let's not give up hope, though; this isn't the only way to approachconcurrency in NetLogo. Software transactional memory (STM) is a styleof concurrency that takes a cue from the database world, treatingconcurrent actions as transactions upon the global state that can berolled back if some conflict arises in the global state. One couldimagine this being applied to `ask` operations. A student at UtrechtUniversity created a NetLogo-like program (HLogo) that performedconcurrency through STM and wrote his Master's thesis<http://dspace.library.uu.nl/handle/1874/284708> on this very topic. Iactually wouldn't be surprised if the author of the paper reads thisboard and might have something to say on the matter. Regardless, onthis topic, he concluded this:

Execution results show that HLogo is faster than NetLogo for mostcases, particularly where the number of agents stay low. When theagent population grows as to produce significant number of STMconflicts, the performance of HLogo considerably drops.

If you look at the results in Section 4.4 of his thesis, you'll see thatthe performance really does drop off quite noticeably as the number ofagents increases—dropping off to the point of being a substantial speed/regression/ in comparison to standard NetLogo. Unfortunately, though,the case where we lots of agents is *the very case* we want concurrencyto help us out with! My view on the matter is that STM is inherentlyincapable of being able to solve this problem; using STM for thousandsof concurrent operations that are each almost certain to conflict andcause a rollback is merely going to lead to what is essentially just asequential processing of the agentset (but with a ton of concurrencyoverhead).

On top of that, the STM approach also jettisons another thing thatNetLogo holds dear: reproducible results. That is, in NetLogo, you canuse the `random-seed` to set the model's random seed, and running thesame simulation with the same seed will consistently yield the sameresults. However, with STM, the completion order of the transactions isnon-deterministic, so STM also fails to meet NetLogo's reproducibilitygoals, as well. With that, it looks like we'll need yet another approach.

After thinking about it a bit, we might conclude that the global stateis the biggest problem for us (as is pretty much always the case withconcurrency problems). Well, if you can't beat 'em... just change theproblem! How about we simply declare that changes to global state won'ttake effect until after an `ask` command ends? That is, if I did thefollowing:


```
to-report silliness
  clear-all
  crt 10
  ask turtles [ set color green ]

ask turtles [ set label (count turtles with [color = blue]) set colorblue]

  report sort [label] of turtles
end
```

and then ran `silliness` in my proposed variant of NetLogo, I would getback `[0 0 0 0 0 0 0 0 0 0]` (rather than `[0 1 2 3 4 5 6 7 8 9]`, whichis what NetLogo currently returns), because each turtle, when runningthe code block for that last `ask`, would use the global state as ofwhen the `ask` primitive was called, and they wouldn't be able to seethe changes that happened on the global state until after the `ask` hadentirely finished executing (meaning that the agent's copy of the globalstate would also need to be threaded through the call stack to anyprocedures called within the `ask` block—did anyone else just get amonadic shiver going down their spine?<https://wiki.haskell.org/State_Monad>).

This is definitely a bit weird, though. On the other hand, in someways, it's actually kind of consistent with some of NetLogo's currentbehavior. For example, if I say `ask turtles [ hatch 1 ]`, and weunderstand that our NetLogo's `ask` can mutate the global state,shouldn't we expect that code to result in an infinite loop (providedthat are turtles in the world when it is called)? That is, one mightexpect that the newly-hatched turtles would also be `ask`ed, but they'renot; NetLogo simply uses what the value was for `turtles` when the `ask`began (just like I'm proposing using the whole global state as of whenthe `ask` began). Of course, /within/ the `ask` *block*, the worldstate /does/ change in the current version of NetLogo, as demonstratedby the code `ca crt 2 ask turtles [ hatch 1 show count turtles ]`, whichprints out "3" and then "4". Clearly, `turtles` changes within theblock. I think I might even find it a little awkward if my above "statesnapshot" suggestion were implemented and running this code printed out"2" and then "2". But maybe not....

Either way, it doesn't seem like we could get away with removing theserial `ask` altogether. For the sake of covering both angles, Isuppose there could be an `ask-in-parallel` and an `ask-serially`—whichI'm tempted to call "fold-ask", since `ask`ing is serial is very muchlike a fold of world state, but it's kind of a deceptive name, sincefolding together monoids can totally be done in parallel, so let's justnot go with that name...—even though have two primitives for `ask`ing isless than ideal.

One /cool/ thing about `ask-in-parallel` is that it would not only offera good speedup simply from running multiple `ask` blocks concurrently,but it could also offer a speedup in an unexpected way: fewerrandom-number generator draws. RNG draws can actually take up a prettysignificant amount of CPU time in a model. When we're doing `ask`serially, we need to do a bunch of RNG draws so we can iterate throughthe agentset randomly. However, with `ask-in-parallel`, we're baking inthe assumption that order doesn't matter. If order /did/ matter, wewould have to worry once again about conflicts to global state andreproducibility of results. But, with those things out of the way,`ask` order /doesn't/ matter, so we don't need to worry about doing anyRNG draws for determining `ask` order. This is an especially big winwhen `ask`ing big agentsets.

But it's not all sunshine and lollipops here; we face some seriousproblems if agents' actions within the `ask` block aren't restricted.That is, if `ask` blocks can contain arbitrary NetLogo code, what's tostop two different agents from concurrently making conflictingchanges—for example, setting global/procedure-level variablesconcurrently, or calling another `ask` within the current `ask`. Theproblem with setting variables within `ask` is that, if multiple agentschange the same variable, it's not clear whose end value to use.Similarly, with nested `ask`s, what happens with `ask turtles [ set xcor0 ask other turtles [ set xcor 1 ] ]`? Is every turtle's `xcor` 0? Orare they all 1? Or are all 1, except the "last one" asked, which wouldhave 0? How is "last one" determined?—keep in mind that it's importantthat we get reproducible results! None of the options seemsparticularly promising to me. There are probably many other behaviors(e.g. file operations) that would also be problematic in concurrent`ask` blocks, so maybe things being forbidden should be the rule, ratherthan the exception. Furthermore, anything that hits the random numbergenerator within the `ask` block would also be a huge problem forgetting reproducible results—but maybe you could get around that bygiving each agent its own RNG within the `ask`...?

I guess the solution would be to forbid an `ask` block from containingthese sorts of conflicting behaviors, but that's not a very satisfyingsolution; it strikes me as very crippling to be forbidding these kindsof not-terribly-uncommon behaviors in `ask` blocks. I certainly knowthat /I/ like having my agents be able to all write to global variableswhen I'm testing things out, and I'm quite certain that many models havenested `ask`s, as well. I guess it would make sense if only`ask-in-parallel` had these restrictions, and `ask`s nested within an`ask-serially` could even be `ask-in-parallel`s! It's a bit gross thatthere would be two `ask`s in NetLogo with very different rules, though,and it's definitely not an elegant solution, but maybe it's the best wecould realistically do...? Maybe there could be just one primitive(`ask`) and NetLogo could first try to interpret all `ask`s as what I'vebeen proposing as `ask-in-parallel`, but, if restricted calls werefound within the block, NetLogo would then run the block as an`ask-serially`? Maybe? I don't know—this is the point, I think, atwhich I've been fully reduced to directionless babbling. Enough of that.

So, ultimately, I don't have a great answer for you in terms of how bestto handle concurrency in agent-based modeling. It seems to be a prettytough problem. The world could probably produce a dozen Ph.D theses onthe matter and still not come up with a good solution to the problem.If anyone thinks they know of a better solution, I'd be glad to hearabout it, if for no other reason than to sate my curiosity.


On 1/24/2015 10:51 AM, [email protected] wrote:

Have you considered using Akka for concurrent functioning of agents in NetLogo?

If yes, what are the pitfalls of this approach to concurrency?


--
You received this message because you are subscribed to the Google Groups 
"netlogo-devel" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/d/optout.

Re: [netlogo-devel] Actor model of concurrency (Akka) in multi agent simulation software (NetLogo)

Reply via email to