I ported a classic DeltaBlue benchmark to D, and for allocations I use a mix of malloc/free (for structs with some trailing inline arrays), and new for some other structs and arrays.

After running it, it felt kind of slow (I did not compare it to any other language yet, so no idea if it is actually slower, just feels slow), so I did some profiling (perf / hotspot mostly), and noticed that substantial amount of time is spent in gc (both allocation and collection). This is on Linux, and benchmark indeed creates a lot of garbage, as I rerun and rerun it in a loop.

In the past I used custom per-type allocator using D1 style allocator methods, mostly together with hand coded free lists (was working fine and was short).

Now I tried to use new `std.experimental.allocator`, and have trouble understanding how to use them.

```d
import std.experimental.allocator : allocatorObject, ThreadLocal;
import std.experimental.allocator.building_blocks.free_list : FreeList;
import std.experimental.allocator.gc_allocator : GCAllocator;
import std.experimental.allocator.mallocator : Mallocator;
import std.experimental.allocator.common : unbounded;
// static auto variableAllocator = allocatorObject(FreeList!(GCAllocator, Variable.sizeof)()); // static auto variableAllocator = allocatorObject(FreeList!(GCAllocator, Variable.sizeof, unbounded)()); // static auto variableAllocator = allocatorObject(FreeList!(Mallocator, Variable.sizeof, unbounded)()); alias VariableAllocator = ThreadLocal!(FreeList!(Mallocator, Variable.sizeof, unbounded));

// VariableAllocator variableAllocator;
// this() {
//   variableAllocator = VariableAllocator.instance;
//}

struct Variable {
  long value;
  List!Constraint* constraints;
  Constraint* determinedBy;
  long mark;
  Strength walkStrength;
  bool stay;
  // char[10] name;
  string name;

  static Variable* Create(string name, long initialValue) {
    // auto new_ = new Variable();
    auto new_ = VariableAllocator.instance.make!Variable();
    // auto new_ = variableAllocator.make!Variable();
    new_.value = initialValue;  // CRASH
    ...
    return new_;
  }
  static void Destroy(Variable* v) {
assert(v.constraints !is null, "bad Variable struct; already freed?");
    List!Constraint.Destroy(v.constraints);
    v.constraints = null;
    // free(v);
    // destroy(v);
    VariableAllocator.instance.dispose(v);
  }

  ...
  ...
}
```


It crashes at `new_.value`, as `new_` is `null`

```
Reading symbols from ./t1_gdc_debug...
(gdb) r
Starting program: /home/user/benchy/t1_gdc_debug
[Thread debugging using libthread_db enabled]
Using host libthread_db library "/lib/x86_64-linux-gnu/libthread_db.so.1".

Program received signal SIGSEGV, Segmentation fault.
0x000055555557c86a in deltablue.Variable.Create(immutable(char)[], long) (name=..., initialValue=0) at /home/user/benchy/deltablue.d:248
248         new_.value = initialValue;
(gdb) bt
#0 0x000055555557c86a in deltablue.Variable.Create(immutable(char)[], long) (name=..., initialValue=0) at /home/user/benchy/deltablue.d:248 #1 0x000055555557f14e in deltablue.ChainTest(int) (n=100) at /home/user/benchy/deltablue.d:901 #2 0x000055555557f88c in deltablue.deltablue(out ulong, out ulong) (iters=@0x7fffffffd9b8: 0, flops=@0x7fffffffd9c0: 0)
    at /home/user/benchy/deltablue.d:999
#3 0x00005555555e2b3e in b (__capture=0x7fffffffdba0, name=..., expectedResult=0) at /home/user/benchy/main.d:60 #4 0x00005555555e2417 in D main (args=...) at /home/user/benchy/main.d:91
(gdb) q

```


I want to use FreeList with parent allocator being Mallocator. Never return allocations back to Mallocator, and for each instance to be thread-local, AND for accessing thread-local instance to not require ANY locks.

Documentation for `std.experimental.allocator` is kind of vague, complicated and not very prescriptive.

Help would be appreciated.

Reply via email to