On Sun, Aug 15, 2010 at 10:00 AM, Ben Kloosterman <[email protected]> wrote: > Yes this is the biggest issue for SIMD , that said even if you remove > 70-90% of intrinsic lines it's a big win , will keep a solution simpler and > many programs will run without them and possibly on other platforms.
That does depend what your goal is. If you're trying to stuff SIMD into the compiler then this might be feasible (although I've never benchmarked the kinds of things you're proposing so I don't know how much of a win it is in practice.) If you want to write numerical/multimedia algorithms then the best place to put effort is in to making using intrinsics work as smoothly with the rest of the language. (You can almost point to which numerical/multimedia algorithm a given SSE instruction comes from; ARM is a bit more of an attempt at an orthogonal instruction set, but you can still see the motivating cases.) FWIW, it's very, very rare for me to see something like memcpy show up on a time profile, although routines like correlation, FFT, etc, show up all the time. _______________________________________________ bitc-dev mailing list [email protected] http://www.coyotos.org/mailman/listinfo/bitc-dev
