Anybody have any idea as a workaround? My program has very high data throughput, and never actually handles the array elements individually, just manipulates them simd chunks at once. Validating the content is important, but performance is taking a real toll...

One possible workaround is to use simd operation eplicitly. I've played around a bit with this and came up with this:

https://gist.github.com/jerro/4988229

This currently only works with GDC and LDC (but it probably could be made to work with DMD too). It uses std.simd (not in phobos yet) from here:

https://github.com/TurkeyMan/phobos

Reply via email to