If the function is outside of class code runs much faster. I'm obviously doing something wrong and would appreciate any help with this.
Interesting test case, thanks :-) Adding "final" to the class method nullifies the speed difference.Somehow, LDC does not devirtualize the call in your testcase. Without the for-loops the call is nicely devirtualized, so no performance difference.
-Johan