Looks like it's time to spend some more time with perf:http://i.imgur.com/k50dFbU.png
Nice!
I had to hack the ddmd code to get it compile (more "1337 h4x" were required to compile with LDC than with DMD), so I haven't uploaded the code for the benchmark to Github yet.
Were you compiling it with 2.066.1 or master? I'd be interested to see the changes you needed.
