pxor didn't make uops, and m7 is temporary in your macro
At 2015-03-13 06:40:10,dave <[email protected]> wrote: >On 03/12/2015 03:16 PM, chen wrote: >> I use 'pxor m7,m7' to replace your [pb_0], but it is same cycles in >> IACA, the bottleneck on Port0 >> Not sure how about performance on old CPU >I would have used something like that but there are no available >registers by that point. They are used up on holding other >constants(pw_planar..) in the case of x86_64 and there just aren't >enough in x86_32. Performance on my old CPU seems unaffected by using >constants in registers or from memory. >_______________________________________________ >x265-devel mailing list >[email protected] >https://mailman.videolan.org/listinfo/x265-devel
_______________________________________________ x265-devel mailing list [email protected] https://mailman.videolan.org/listinfo/x265-devel
