It is always easier to discuss if there is a piece of code to look at. Could you perhaps post a few code examples that does not run as fast as you want?
Also, make sure to look at : https://github.com/IntelLabs/ParallelAccelerator.jl. They have a quite sophisticated compiler that does loop fusions and parallelization and other cool stuff. On Thursday, May 12, 2016 at 7:22:24 AM UTC+2, Anonymous wrote: > > This remains one of the main drawbacks of Julia, and the devectorize > package is basically useless as it doesn't support some really crucial > vectorized operations. I'd really prefer not to rewrite all my vectorized > code into nested loops if at all possible, but I really need more speed, > can anyone tell me the timeline and future plans for making vectorized code > run at C speed? >
