This is great stuff. Initial observations (under Linux/GCC) are that native threads are about 20% faster than OpenMP, so I surmise you are feeding LLVM some very tasty code. (I tested long loops with straightforward memory access.)
On the other hand, some of the earlier posts make me think that you were leveraging the strong vector optimization of the Intel C compiler and its tight coupling to MKL libraries. If so, is there any prospect of getting LLVM to take advantage of MKL? On Wednesday, October 26, 2016 at 8:13:38 PM UTC-4, Todd Anderson wrote: > > Okay, METADATA with ParallelAccelerator verison 0.2 has been merged so if > you do a standard Pkg.add() or update() you should get the latest version. > > For native threads, please note that we've identified some issues with > reductions and stencils that have been fixed and we will shortly be > released in version 0.2.1. I will post here again when that release takes > place. > > Again, please give it a try and report back with experiences or file bugs. > > thanks! > > Todd >