Re: [Xpert]Using MMX assembly (for video card drivers

2002-01-11 Thread Ewald Snel
At 11:26 AM 4/01/02 +0100, Ewald Snel wrote: (sorry for the duplicate message, it was delayed for one week (see date)) [...] It would be interesting to see if the same could be achieved with 3DNow! instructions, as this would provide a welcome boost for anyone with an AMD K6-2 or K6-3 or

Re: [Xpert]Using MMX assembly (for video card drivers)

2002-01-11 Thread Ewald Snel
[...] BTW - does anyone know why the mga driver internally converts to 422 format ? It seems to me that mga 400 and 450 chips do support 420 planar format... (I saw some sample code using it, I can probably find it back if needed). I think XFree would benefit from using this feature instead

Re: [Xpert]Using MMX assembly (for video card drivers)

2002-01-10 Thread Stuart Young
At 11:26 AM 4/01/02 +0100, Ewald Snel wrote: Hi, Could I use MMX assembly for improving the mga video driver? I wrote a vertical chrominance filter (*) for the XVideo module using inline MMX assembly. This allows me to improve output quality without any speed penalty. It would be interesting to

Re: [Xpert]Using MMX assembly (for video card drivers)

2002-01-04 Thread Billy Biggs
Ewald Snel ([EMAIL PROTECTED]): Of course, I'm using #ifdef USE_MMX_ASM and the original C code as an alternative for other CPU architectures. Runtime detection of MMX support is not included yet, but will be added if MMX is allowed. I've also been playing with some mmx-ification of the

Re: [Xpert]Using MMX assembly (for video card drivers)

2002-01-04 Thread greg wright
I've also been playing with some mmx-ification of the XVideo routines, for example I also did an SSE-4:2:0-to-4:2:2 function. I just did this too, MMX only though. How many cycles/pixel did you end up with? What percentage of pairing did you achieve? There was some discussion on #xfree86

Re: [Xpert]Using MMX assembly (for video card drivers)

2002-01-04 Thread Erik Walthinsen
On Fri, 4 Jan 2002, greg wright wrote: I just did this too, MMX only though. How many cycles/pixel did you end up with? What percentage of pairing did you achieve? Note that only P5-core chips care about pairing, per-se. There are much nastier issues involved in modern P6 cores. I haven't

Re: [Xpert]Using MMX assembly (for video card drivers)

2002-01-04 Thread Billy Biggs
I've also been playing with some mmx-ification of the XVideo routines, for example I also did an SSE-4:2:0-to-4:2:2 function. I just did this too, MMX only though. How many cycles/pixel did you end up with? What percentage of pairing did you achieve? I'll get some numbers in a sec.

Re: [Xpert]Using MMX assembly (for video card drivers)

2002-01-04 Thread Ewald Snel
Hi, I wrote a vertical chrominance filter (*) for the XVideo module using inline MMX assembly. This allows me to improve output quality without any speed penalty. Do you mean for upsampling to 4:2:2 ? How do you filter? Do you average to create the new chroma line? Something like

Re: [Xpert]Using MMX assembly (for video card drivers)

2002-01-04 Thread Erik Walthinsen
On Fri, 4 Jan 2002, Billy Biggs wrote: Please, please correct me if I'm wrong here. In MPEG sampling, the chrominance sample is halfway between the two luminance samples on the same vertical scanline (by is138182): o o where o == luma sample x x == chroma

Re: [Xpert]Using MMX assembly (for video card drivers)

2002-01-04 Thread Ewald Snel
Hi, [...] Something like that, the filter uses 0.75x nearest chrominance sample and 0.25x second nearest chrominance sample. This is more accurate as it doesn't shift the chrominance signal by 1 pixel. Please, please correct me if I'm wrong here. In MPEG sampling, the chrominance

Re: [Xpert]Using MMX assembly (for video card drivers)

2002-01-04 Thread Billy Biggs
To reply to my own mail :) Billy Biggs ([EMAIL PROTECTED]): It's actually 0.5 pixel (my mistake :)) using the following filter : o o (c=c1) c1 o o (c=.5*c1 + .5*c2) o o (c=c2) c2 o o (c=.5*c2 + .5*c3) I don't think this is right for MPEG2. I sent