Hi,

On Thu, Jul 7, 2016 at 5:25 AM, Alexandra Hájková <
[email protected]> wrote:

>          else if (lc->cu.pred_mode == MODE_INTRA && c_idx == 0 &&
>                   log2_trafo_size == 2)
> -            s->hevcdsp.transform_4x4_luma_add(dst, coeffs, stride);
> +            s->hevcdsp.idct_4x4_luma(coeffs);
>

This is not an idct.


> +    s->hevcdsp.add_residual[log2_trafo_size - 2](dst, coeffs, stride);


Won't this be slower since there's a memory store intermediate?

(I know it's faster now because you don't have inverse transform simd, but
you should fix that by writing inverse transform simd, not by splitting the
transform and the add.)

Ronald
_______________________________________________
libav-devel mailing list
[email protected]
https://lists.libav.org/mailman/listinfo/libav-devel

Reply via email to