subject:"\[FFmpeg\-devel\] \[PATCH 5\/7\] lavc\/me_cmp\: R\-V V vsse vsad"

Re: [FFmpeg-devel] [PATCH 5/7] lavc/me_cmp: R-V V vsse vsad

2024-02-22 Thread Rémi Denis-Courmont

Le 22 février 2024 05:04:58 GMT+02:00, flow gg a écrit : >.macro vabsaddu dst src tmp >- vneg.v \tmp, \src >- vmax.vv \tmp, \src, \tmp >+ vfabs.v \tmp, \src >vwaddu.wv \dst, \dst, \tmp >.endm > >After making this change, the test

Re: [FFmpeg-devel] [PATCH 5/7] lavc/me_cmp: R-V V vsse vsad

2024-02-21 Thread flow gg

.macro vabsaddu dst src tmp - vneg.v \tmp, \src - vmax.vv \tmp, \src, \tmp + vfabs.v \tmp, \src vwaddu.wv \dst, \dst, \tmp .endm After making this change, the tests did not pass. I'm not quite clear on how to understand the differe

Re: [FFmpeg-devel] [PATCH 5/7] lavc/me_cmp: R-V V vsse vsad

2024-02-21 Thread Rémi Denis-Courmont

Le tiistaina 6. helmikuuta 2024, 17.56.32 EET flow gg a écrit : > Did you try to compute integral absolute values with the ad-hoc (floating point) instruction instead of vneg/vmax? It should work since the sign is in the same place, though I don't know if it will be faster. -- レミ・デニ-クールモン htt

[FFmpeg-devel] [PATCH 5/7] lavc/me_cmp: R-V V vsse vsad

2024-02-06 Thread flow gg

From 67f2a662be1533e52a28971152bff670f78544fd Mon Sep 17 00:00:00 2001 From: sunyuechi Date: Tue, 6 Feb 2024 23:18:51 +0800 Subject: [PATCH 5/7] lavc/me_cmp: R-V V vsse vsad C908: vsad_0_c: 936.0 vsad_0_rvv_i32: 236.2 vsad_1_c: 424.0 vsad_1_rvv_i32: 190.2 vsse_0_c: 877.0 vsse_0_rvv_i32: 204.2 vs