Le 22 février 2024 05:04:58 GMT+02:00, flow gg a écrit :
>.macro vabsaddu dst src tmp
>- vneg.v \tmp, \src
>- vmax.vv \tmp, \src, \tmp
>+ vfabs.v \tmp, \src
>vwaddu.wv \dst, \dst, \tmp
>.endm
>
>After making this change, the test
.macro vabsaddu dst src tmp
- vneg.v \tmp, \src
- vmax.vv \tmp, \src, \tmp
+ vfabs.v \tmp, \src
vwaddu.wv \dst, \dst, \tmp
.endm
After making this change, the tests did not pass. I'm not quite clear on
how to understand the differe
Le tiistaina 6. helmikuuta 2024, 17.56.32 EET flow gg a écrit :
>
Did you try to compute integral absolute values with the ad-hoc (floating
point) instruction instead of vneg/vmax? It should work since the sign is in
the same place, though I don't know if it will be faster.
--
レミ・デニ-クールモン
htt
From 67f2a662be1533e52a28971152bff670f78544fd Mon Sep 17 00:00:00 2001
From: sunyuechi
Date: Tue, 6 Feb 2024 23:18:51 +0800
Subject: [PATCH 5/7] lavc/me_cmp: R-V V vsse vsad
C908:
vsad_0_c: 936.0
vsad_0_rvv_i32: 236.2
vsad_1_c: 424.0
vsad_1_rvv_i32: 190.2
vsse_0_c: 877.0
vsse_0_rvv_i32: 204.2
vs