It is same speed or slower than SSE2 version, I guess this size is too small for AVX2 I have sent a demo to improve SSE2 code
At 2015-03-02 16:47:23,[email protected] wrote: ># HG changeset patch ># User Sumalatha Polureddy<[email protected]> ># Date 1425286035 -19800 ># Node ID 1be088c8bc675752ebfebc4fda3bad41659269a4 ># Parent a9ad4d8202796dfb78e9d180f5fdb7cc0996ea66 >asm: avx2 code for add_ps[8x8] for 10bpp -- 24.9x > >add_ps[ 8x8] 24.97x 275.68 6882.88 >
_______________________________________________ x265-devel mailing list [email protected] https://mailman.videolan.org/listinfo/x265-devel
