Sep 4, 2022, 15:54 by r...@remlab.net: > The following changes since commit b6e8fc1c201d58672639134a737137e1ba7b55fe: > > avcodec/speexdec: improve support for speex in non-ogg (2022-09-04 11:31:57 > +0200) > > are waiting thorough bashing at your express convenience up to: > > riscv: float vector dot product with RVV (2022-09-04 16:45:38 +0300) > > Changes since v1: > > - Removed stray define. > - Fixed mismatch between byte and element size in mul-scalar. > - Added fmul, fac, dmul, dmac, fmul-add, fmul-reverse, fmul-window. > - Added float butterfly and dot product. > > All operations are unrolled to the maximum group size (8), with the > exception of overlap/add. The later seems to require a minimum of 6 > vectors (maybe 5 by extremely careful ordering), so the group size is > only 4. > > The pointer arithmetic could be slightly optimised with SH2ADD and > SH3ADD instructions from the Zvba extension. This would require more > conditional code, or requiring support for Zvba for probably neglible > performance gains though. >
Did you test on real hardware or a VM? If the former, what does checkasm --bench report? _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org https://ffmpeg.org/mailman/listinfo/ffmpeg-devel To unsubscribe, visit link above, or email ffmpeg-devel-requ...@ffmpeg.org with subject "unsubscribe".