>+ mova m3, [r3 - 6 * 16] ; [10] >+ mova m6, [r3 - 12 * 16] ; [04] >+ pmaddubsw m4, m0, m3 >+ pmulhrsw m4, m7 >+ pmaddubsw m1, m5, m6 >+ pmulhrsw m1, m7 >+ packuswb m4, m1 >+ mova m3, [r3 + 14 * 16] ; [30]
the constant in m3 use only one time, don't need load into register pmaddubsw m4, m0, [r3 - 6 * 16] ; [10]
_______________________________________________ x265-devel mailing list [email protected] https://mailman.videolan.org/listinfo/x265-devel
