Two suggestions for safe refcounting

Zach the Mystic via Digitalmars-d Thu, 05 Mar 2015 23:51:48 -0800

As per deadalnix's request, a summary of my thoughts regardingthe thread "RCArray is unsafe":

It's rather easy to guarantee memory safety from the safeconfines of a garbage collected system. Let's take this as agiven.

It's much harder when you step outside that system and try tofigure out when it is or isn't safe to delete memory. Itshouldn't be too surprising, therefore, that there are lots ofpitfalls. Reference counting is a lonely outpost in thewilderness which is otherwise occupied by manual memorymanagement. It's the only alternative to chaos.

But the walls protecting this outpost are easily breached by anydangling reference which is not accounted for.

We have seen two instances of how this can occur. The first, whenboiled down to its essence, is that there is no correspondingbump in the reference count for a parameter which can alias anexisting reference:


void fun(ref RCStruct a, ref RCStruct b);
RCStruct c;
fun(c,c); // c aliases itself

void gun(ref RCStruct a);
static RCStruct d;
gun(d); // d aliases global d

Because the workarounds are easy:
{
  RCStruct c;
  auto tmp = c;
  fun(c,tmp);

  auto tmp2 = d;
  gun(tmp2);
}
...it seems okay to mark these rare violations @system.

The second, harder problem, is when you take a reference to asubcomponent of an RC'd type, e.g. an individual E of an RCArrayof E:


struct RCArray(E) {
  E[] array;
  int* count;
  ...
}
auto x =  RCArray([E()]);
E* t = &x[0];

Here's the problem. If x is assigned to a different RCArray, theone t points to will be deleted. On the other hand, if somespecial logic allows the definition of t to increment thereference count, then you have a memory leak, because t is notdesigned to keep track of x's original counter.

I don't know if we can get out of this mess. My suggestionrepresents a best-effort attempt. The only way I can see out ofthis problem is to redesign RCArray.

The problem with RCArray is that it "owns" the data itreferences. If a type different from RCArray, i.e. an individualE* into the array of E[], tries to reference the data, it'sstuck, because it's not an RCArray!E. Therefore, you need toseparate out the core data from the different types that canpoint to it. The natural place would be right next to itsreference counter, in a separate struct:


struct RCData {
  int count = 0;
  void[] chunk;

  this(size_t size) {
    chunk = new void[size];
  }
  void addRef() {
    ++count;
  }
  void decRef() {
    if (--count == 0)
      delete chunk;
  }
}

Now RCArray can be redesigned to point to an RCData type. All newRC types will also contain a pointer to an RCData instance:


struct RCArray(E) {
  E[] array;
  private RCData* data;

  this(E[] a) {
    data = new RCData(a * sizeof(a));
    data.chunk = cast(void[]) a;
    array = a;
  }

  this(this) {
    data.addRef();
  }

  ~this() {
    data.decRef();
  }

  ref RCElement!E opIndex(size_t i) return {
    return RCElement!E(array[i], data);
  }
  ...
}

Note how the last member, opIndex, doesn't return a raw E*, butonly an E* which is paired with a pointer to the same RCDatainstance as the RCArray is:


struct RCElement(E) {
  E* element;
  private RCData* data;

  this(this) {
    data.addRef();
  }
  ~this() {
    data.decRef();
  }
}

This is the best I could do.

Two suggestions for safe refcounting

Reply via email to