We have av_pixelutils_sad_fn which does SAD and has SIMD, there's no point in reinventing the wheel.I also don't see why this needs to be implemented with CUDA. You're not even doing the SAD in CUDA. I bet it'll be just as fast if not faster in C (unless you cheat somehow).
The point is to do it on CUDA frames without copying them to system ram first.
smime.p7s
Description: S/MIME Cryptographic Signature
_______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org http://ffmpeg.org/mailman/listinfo/ffmpeg-devel