Re: [PATCH v5 0/9] aarch64: Add fp8 sme 2.1 features per ACLE 2024Q4

Alice Carlotti Thu, 18 Dec 2025 09:53:17 -0800

On Thu, Dec 18, 2025 at 05:14:50PM +0000, Claudio Bantaloukas wrote:
> 
> This patch series completes support for SME2 and SME2p1 intrinsics relative to
> modal 8bit floating point types.
> 
> - The first patch in the series introduces tests for using luti intrinsics 
> with
>   mf8 that was already working since their introduction, now that their use is
>   documented in ACLE.
> - The second patch extends the definitions of existing non-interpreting 
> sve2/sme
>   intrinsics to support mfloat8 types.
> - The third and fourth patches add widening and narrowing sme2 fp8 conversions
>   respectively (svcvt).
> - The fifth patch adds multi-vector floating-point adjust exponent intrinsics
>   (svscale).
> - The sixth patch adds support for the sme-f8f16 and sme-f8f32 arch features
>   and related defines.
> - Patch 7 adds Multi-vector 8-bit floating-point multiply-add long intrinsics.
> - Patch 8 adds 8-bit floating-point sum of outer products and accumulate
>   intrinsics.
> - Patch 9 adds 8-bit floating point dot product intrinsics.


This is going to be awkward to implement, but I think we also need to make the
existing FEAT_SME_F16F16 add/sub intrinsics available when +sme-f8f16 is
enabled (without +sme-f16f16).  That is, the feature requirements need updating
for:

DEF_SME_ZA_FUNCTION_GS (svadd, unary_za_slice, za_d_float, vg1x24, none)
DEF_SME_ZA_FUNCTION_GS (svsub, unary_za_slice, za_d_float, vg1x24, none)

Alice

Re: [PATCH v5 0/9] aarch64: Add fp8 sme 2.1 features per ACLE 2024Q4

Reply via email to