Re: [FFmpeg-devel] [PATCH] swscale_unscaled: fix and speed up DITHER_COPY macro for x86 with SSE2

2017-09-22 Thread James Almer
On 9/22/2017 2:06 PM, Mateusz wrote: > W dniu 2017-09-22 o 17:47, James Almer pisze: >> On 9/22/2017 12:23 PM, Mateusz wrote: >>> New version of the patch -- now it uses the same logic independent of the >>> target bitdepth. >>> >>> For x86_64 it is much faster than current code (with perfect qual

Re: [FFmpeg-devel] [PATCH] swscale_unscaled: fix and speed up DITHER_COPY macro for x86 with SSE2

2017-09-22 Thread Mateusz
W dniu 2017-09-22 o 17:47, James Almer pisze: > On 9/22/2017 12:23 PM, Mateusz wrote: >> New version of the patch -- now it uses the same logic independent of the >> target bitdepth. >> >> For x86_64 it is much faster than current code (with perfect quality), for >> x86_32 it is fast >> if you ad

Re: [FFmpeg-devel] [PATCH] swscale_unscaled: fix and speed up DITHER_COPY macro for x86 with SSE2

2017-09-22 Thread James Almer
On 9/22/2017 12:23 PM, Mateusz wrote: > New version of the patch -- now it uses the same logic independent of the > target bitdepth. > > For x86_64 it is much faster than current code (with perfect quality), for > x86_32 it is fast > if you add to configure: --extra-cflags="-msse2" > (for x86_32

[FFmpeg-devel] [PATCH] swscale_unscaled: fix and speed up DITHER_COPY macro for x86 with SSE2

2017-09-22 Thread Mateusz
New version of the patch -- now it uses the same logic independent of the target bitdepth. For x86_64 it is much faster than current code (with perfect quality), for x86_32 it is fast if you add to configure: --extra-cflags="-msse2" (for x86_32 with default configure options it is slower than cu