And what about single threaded version?
Just ran the single thread examples after I moved time start before array allocation, thanks for that, good catch. Still better results in Java:
- java: 21 secs, 612 ms - with std.math: dmd: 23 secs, 994 ms ldc: 31 secs, 668 ms gdc: 52 secs, 576 ms - with core.stdc.math: dmd: 30 secs, 724 ms ldc: 30 secs, 988 ms gdc: time: 25 secs, 970 ms