On Thu, Jul 31, 2025 at 01:37:34PM +0100, Simon Horman wrote:
> While I appreciate the desire for improved performance and nicer code
> generation. I think the idea of writing 64 bits of data to the
> address of a 32 bit member of a structure goes against the direction
> of hardening work by Kees and others.

Agreed: it's better to avoid obscuring these details from the compiler
so it can have an "actual" view of the object sizes involved.

> Indeed, it seems to me this is the kind of thing that struct_group()
> aims to avoid.
> 
> In this case struct group() doesn't seem like the best option,
> because it would provide a 64-bit buffer that we can memcpy into.
> But it seems altogether better to simply assign u64 value to a u64 member.

Agreed: with struct_group you get a sized pointer, and while you can
provide a struct tag to make it an assignable object, it doesn't make
too much sense here.

> So I'm wondering if an approach along the following lines is appropriate
> (Very lightly compile tested only!).
> 
> And yes, there is room for improvement of the wording of the comment
> I included below.
> 
> diff --git a/include/net/libeth/xdp.h b/include/net/libeth/xdp.h
> index f4880b50e804..a7d3d8e44aa6 100644
> --- a/include/net/libeth/xdp.h
> +++ b/include/net/libeth/xdp.h
> @@ -1283,11 +1283,7 @@ static inline void libeth_xdp_prepare_buff(struct 
> libeth_xdp_buff *xdp,
>       const struct page *page = __netmem_to_page(fqe->netmem);
>  
>  #ifdef __LIBETH_WORD_ACCESS
> -     static_assert(offsetofend(typeof(xdp->base), flags) -
> -                   offsetof(typeof(xdp->base), frame_sz) ==
> -                   sizeof(u64));
> -
> -     *(u64 *)&xdp->base.frame_sz = fqe->truesize;
> +     xdp->base.frame_sz_le_qword = fqe->truesize;
>  #else
>       xdp_init_buff(&xdp->base, fqe->truesize, xdp->base.rxq);
>  #endif
> diff --git a/include/net/xdp.h b/include/net/xdp.h
> index b40f1f96cb11..b5eedeb82c9b 100644
> --- a/include/net/xdp.h
> +++ b/include/net/xdp.h
> @@ -85,8 +85,19 @@ struct xdp_buff {
>       void *data_hard_start;
>       struct xdp_rxq_info *rxq;
>       struct xdp_txq_info *txq;
> -     u32 frame_sz; /* frame size to deduce data_hard_end/reserved tailroom*/
> -     u32 flags; /* supported values defined in xdp_buff_flags */
> +     union {
> +             /* Allow setting frame_sz and flags as a single u64 on
> +              * little endian systems. This may may give optimal
> +              * performance. */
> +             u64 frame_sz_le_qword;
> +             struct {
> +                     /* Frame size to deduce data_hard_end/reserved
> +                      * tailroom. */
> +                     u32 frame_sz;
> +                     /* Supported values defined in xdp_buff_flags. */
> +                     u32 flags;
> +             };
> +     };
>  };

Yeah, this looks like a nice way to express this, and is way more
descriptive than "(u64 *)&xdp->base.frame_sz" :)

-- 
Kees Cook

Reply via email to