2017-07-20 18:07 GMT+02:00 maxime taisant <[email protected]>: > From: Maxime Taisant <[email protected]>
> I am currently working on SSE optimisations for the dwt functions used to > decode JPEG2000. > For the moment, I have only managed to produce a SSE-optimized version of the > sr_1d97_float function > (with relatively good results). Please add some numbers. > + if (ARCH_X86) > + ff_sr_1d97_float_sse(line, mh, mh + lh); > + else > + sr_1d97_float(line, mh, mh + lh); This is not correct, look at ff_jpeg2000dsp_init_x86() and Jpeg2000DSPContext to see what has to be done. Others will comment on the asm code. Thank you, Carl Eugen _______________________________________________ ffmpeg-devel mailing list [email protected] http://ffmpeg.org/mailman/listinfo/ffmpeg-devel
