It is amazing that the proto expression is faster then the naive one.
The compiler must really love the way proto evaluates an expression.
I still dont really know why. Usual speed-up in our use cases here is
like ranging from 10 to 50%.
That's weird.

Well, for me it's weird in the good way so I dont complain. Old version of nt2 had cases where
we were thrice as fast as same vector+iterator based code ...
