Re: [FFmpeg-devel] gcc: Remove auto-vectorization limitation.

Michael Niedermayer Sat, 31 May 2025 06:39:41 -0700

Hi Remi

On Fri, May 30, 2025 at 09:58:48AM +0300, Rémi Denis-Courmont wrote:
> 
> 
> Le 30 mai 2025 03:46:05 GMT+03:00, Michael Niedermayer 
> <mich...@niedermayer.cc> a écrit :
> >On Mon, May 26, 2025 at 11:43:15AM +0300, Rémi Denis-Courmont wrote:
> >> 
> >> 
> >> Le 26 mai 2025 00:37:08 GMT+03:00, Michael Niedermayer 
> >> <mich...@niedermayer.cc> a écrit :
> >> >Hi Rémi
> >> >
> >> >On Sat, May 24, 2025 at 07:10:57PM +0300, Rémi Denis-Courmont wrote:
> >> >> Le torstaina 22. toukokuuta 2025, 9.32.18 Itä-Euroopan kesäaika Jiawei 
> >> >> a écrit 
> >> >> :
> >> >> > > The RISC-V autovectorised output looks like it has a warning "Odd
> >> >> > > rotation angle" which is not present in the non-autovectorised 
> >> >> > > output.
> >> >> > 
> >> >> > I found this occured when using '-ffast-math' in RISC-V, also occur in
> >> >> > -O3 -ffast-math -fno-tree-vectorize case(much slower due to the
> >> >> > -ffast-math)，supplementary more comparison results here:
> >> >> 
> >> >
> >> >> Unfortunately, the FFmpeg code is written with x87 semantics in mind.
> >> >
> >> >I dont remember ever writing code intentionally with x87 semantics. And i
> >> >have doubts other people did.
> >> 
> >> It doesn't have to be intentional. FFmpeg was started and mostly developed 
> >> with x86-32 then x86-64 in mind. It's entirely possible that this happened 
> >> innocently.
> >> 
> >> Specifically, FFmpeg uses open-code for minimum, maximum, absolute value 
> >> and so on (see FFMIN, FFMAX, FFABS). They work nicely for integer maths. 
> >> They also work nicely on x87 with the current set of FPU optimisations, 
> >> but they differ from IEEE semantics because of NaNs, negative zeros and 
> >> such.
> >> 
> >> Because of that the compiler will *not* use the native FPU instructions on 
> >> platforms with native IEEE floats.
> >
> >replace all FFMIN with fminf() / fmin() where the arguments are float or
> >double, assuming that has no ill performance effects
>


> That will harm performance on x87, whence fminf() and co are function calls 
> rather than single instructions. What we actually should do is define 
> separate macros for integer vs float vs double.

ok


> 
> But there are hundreds of use sites to patch. To be bluntly honest, I don't 
> have the motivation to carry that tedious repetitive work out in my free time.

ok, can you review my patch ?


thx

[...]
-- 
Michael     GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB

Why not whip the teacher when the pupil misbehaves? -- Diogenes of Sinope

signature.asc
Description: PGP signature

_______________________________________________
ffmpeg-devel mailing list
ffmpeg-devel@ffmpeg.org
https://ffmpeg.org/mailman/listinfo/ffmpeg-devel

To unsubscribe, visit link above, or email
ffmpeg-devel-requ...@ffmpeg.org with subject "unsubscribe".

Re: [FFmpeg-devel] gcc: Remove auto-vectorization limitation.

Reply via email to