Anybody have any idea as a workaround? My program has very high
data throughput, and never actually handles the array elements
individually, just manipulates them simd chunks at once.
Validating the content is important, but performance is taking
a real toll...
One possible workaround is to use simd operation eplicitly. I've
played around a bit with this and came up with this:
https://gist.github.com/jerro/4988229
This currently only works with GDC and LDC (but it probably could
be made to work with DMD too). It uses std.simd (not in phobos
yet) from here:
https://github.com/TurkeyMan/phobos