On Wed, 6 Oct 2004, Dieter [iso-8859-15] Nützel wrote:

Am Mittwoch, 6. Oktober 2004 03:52 schrieb Ian Romanick:
Here's a simple patch that gives about a 50% (on my box) speed boost to
glReadPixels performance in 24-bit.  I measured using the benchmark
built into progs/demos/readpix.  The interesting thing is that the core
MMX & SSE2 routines can be used for other cards as well.  For example,
it looks like MGA, Unichrome, and others can use the same code for 24-bit.

Stupid question - aren't newer versions of gcc capable of producing SSE/MMX code ? Would it be enough just to turn on appropriate flags ?


                        best

                          Vladimir Dergachev


Before persuing this too far, I'd like to look at ways to make the *compiled* code from spantmp.h be more device-independent. That would make it easier to generate a bunch of these generic routines and just plug them in.

You have forgotten 'read_rgba_span_SSE2.S".

What about MMX2, 3DNow, 3DNow2 (pro), SSE (1)?

It would be nice if we have this like MPlayer:

CPU: Advanced Micro Devices Athlon 4 /Athlon MP/XP Palomino 1763 MHz (Family:
6, Stepping: 2)
Detected cache-line size is 64 bytes
CPUflags:  MMX: 1 MMX2: 1 3DNow: 1 3DNow2: 1 SSE: 1 SSE2: 0
Compiled with runtime CPU detection - WARNING - this is not optimal!

What do you think?

-Dieter


------------------------------------------------------- This SF.net email is sponsored by: IT Product Guide on ITManagersJournal Use IT products in your business? Tell us what you think of them. Give us Your Opinions, Get Free ThinkGeek Gift Certificates! Click to find out more http://productguide.itmanagersjournal.com/guidepromo.tmpl -- _______________________________________________ Dri-devel mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/dri-devel

Reply via email to