== Quote from Don ([email protected])'s article
> Of course, in the occasions when SSE lets you do 4 operations at once,
> you get nearly a 4X speedup...

Is SSE(2) inherently faster then (at least in real-world implementations) than
x87, even when you don't vectorize?  Would I be able to expect any speedup from
going from x87 to SSE(2) for code that has a decent amount of implicit 
instruction
level parallelism but wasn't explicitly vectorized either by me or the compiler?

Reply via email to