This patch is pending, right Nabajit? I havent pushed the luma_vss
/chroma_vss assembly patches or the testbench edits to luma_vss.




On Wed, Jan 29, 2014 at 1:56 PM, chen <chenm...@163.com> wrote:

> >@@ -5105,8 +5108,9 @@
> >     pmaddwd    m5, [r6 + 3 * 16]
>
> >     paddd      m1, m5                          ;m1=[1+2+3+4+5+6+7+8]  Row2 
> > end
> >     psrad      m1, 6
> >-
> >-    packssdw   m0, m1
> >+    pand       m1, m7
> >+
> >+    packusdw   m0, m1
> >
> >     movlps     [r2], m0
> >     movhps     [r2 + r3], m0
> PAND + PACKUSDW may avoid overflow problem, but it is wrong way here
> as you said, you got a result value 0x8D84, it is overflow on 16bits, so
> we need to find really reason,
> I check the HM code, it use Short, so I suggest you catch input data and
> put into HM to check HM's output.
>
>
> _______________________________________________
> x265-devel mailing list
> x265-devel@videolan.org
> https://mailman.videolan.org/listinfo/x265-devel
>
>
_______________________________________________
x265-devel mailing list
x265-devel@videolan.org
https://mailman.videolan.org/listinfo/x265-devel

Reply via email to