One issue might be that you change the type of L, which I believe boxes it (but someone closer to the compiler will have to verify).
Maybe try using a different variable for the result of the decomposition? Cheers, Kevin On Tuesday, June 3, 2014, Chris Foster <chris...@gmail.com <javascript:_e(%7B%7D,'cvml','chris...@gmail.com');>> wrote: > On Wed, Jun 4, 2014 at 2:12 AM, Chris Foster <chris...@gmail.com> wrote: > > fiddling with Base.BLAS.dot only got me as far as a segfault so far. > > Actually I think I've fixed that now in the gist and using BLAS.dot > directly is faster, though still not very impressive. According to > @time, I've still got some mystery allocations somewhere, but I can't > see where. Ideas anyone? >