== Quote from Don ([email protected])'s article > Of course, in the occasions when SSE lets you do 4 operations at once, > you get nearly a 4X speedup...
Is SSE(2) inherently faster then (at least in real-world implementations) than x87, even when you don't vectorize? Would I be able to expect any speedup from going from x87 to SSE(2) for code that has a decent amount of implicit instruction level parallelism but wasn't explicitly vectorized either by me or the compiler?
