On Jan 11, 2006, at 6:21 AM, GHC wrote:

#650: Improve interaction between mutable arrays and GC
------------------------------- +--------------------------------------------
    Reporter:  simonmar        |        Owner:
        Type:  task            |       Status:  new
    Priority:  normal          |    Milestone:  6.6
   Component:  Runtime System  |      Version:  6.4.1
    Severity:  normal          |     Keywords:
          Os:  Unknown         |   Difficulty:  Moderate (1 day)
Architecture:  Unknown         |
------------------------------- +-------------------------------------------- This is the result of a discussion between myself and Simon PJ a few weeks
 ago, inspired by recent discoveries of poor performance with mutable
 arrays (eg. see Data.Hash discussion on glasgow-haskell-users around
 October 2005).

Note that all this applies to mutable arrays of pointers, i.e. `IOArray` and `STArray`, ''not'' to unboxed arrays, i.e. `IOUArray` and `STUArray`.
...
 === The Problem ===

 The problem is that mutable arrays are always completely traversed on
every GC. To get around this, we can keep an array in a frozen state and thaw it just before writing, then freeze it again afterward. This is a bit inconvenient, not to mention unsafe with multiple threads unless extra
 locking is used.

Yes. I'd love it if "freeze" and "thaw" just affected types plus (perhaps) the ability for mutations to occur in the first place (so that I don't freeze in one thread and mutate in another).

Given that virtually every object access in GHC involves a hidden read barrier (entering a thunk is effectively a read barrier which checks computedness), I'm not too worried about the costs of a write barrier for mutation in the IO monad. I'd welcome some up-front cost to avoid the GC problems.

 === Solutions ===

You should think seriously about keeping a write list for arrays as an alternative to cards or a full scan, as Bulat suggested. There are card/write list hybrids.

If you put a card mark in the block table, there's no particular need to modify the array representation beyond adding a "dirty" bit. The write barrier would look something like this:
  * Set array's dirty bit
  * Set card mark on appropriate block
  * Store pointer into array
Then during GC we scan the dirty blocks of dirty arrays.

Small arrays will fit in a block and will be scanned completely. Larger arrays will span more than one block and benefit from the card marks. But there's no need to actually distinguish these cases if you're using cards.

Note also that we can keep multiple card flags in the block descriptor, so there's actually opportunity for tuning here. It's trading off up-front bit-twiddling for lower GC overhead, though.

Do bear in mind that it's big arrays which really seem to be causing the trouble here. So I think some solution which does work proportional to the amount of mutation is in order.

For IORefs the game is utterly different; I'm not sure what to do about them. You'd like to avoid traversing a list of all the IORefs at each collection. Very tricky. Here it might be worth scanning just the dirty blocks.

Perhaps there ought to be an IORef list per block?

-Jan-Willem Maessen

[Who's very interested in seeing this solved well...]

_______________________________________________
Glasgow-haskell-bugs mailing list
[email protected]
http://www.haskell.org/mailman/listinfo/glasgow-haskell-bugs

Reply via email to