On Saturday, 20 May 2017 at 03:24:41 UTC, 9il wrote:
The reason is that `matrixX[i, j]` is opIndex call, opIndex is a function. It can be inlined. But optimizers can not split its body and move half of opIndex computations out of the inner loop, which it required for vectorization.
Hmm, look like new LLVM solves this issue. Need to do more benchmarks...
