On 6/6/26 22:19, Matthew Wilcox wrote:
> On Sat, Jun 06, 2026 at 04:58:55PM +0100, Mohammed EL Kadiri wrote:
>> +The SLUB allocator merges slab caches with compatible size, alignment, and
> 
> More of a question for Vlastimil ... do we want to continue to
> distinguish between slab (the API) and SLUB (the implementation)?
> I don't think we ever want to go back to a situation where we have
> multiple competing implementations of the slab API in the kernel.
> So shouldn't we deprecate uses of SLUB, particularly in the
> documentation?

Indeed, any new stuff should talk about the slab allocator, with old stuff
can be converted if touched for other reasons.

>> +flags to reduce memory fragmentation. While this improves memory efficiency,
>> +it allows objects of different types to share the same slab pages. This
> 
> s/ pages//
> 
>> +enables cross-cache heap exploitation, where a use-after-free in one object
>> +type can be leveraged to corrupt an unrelated type.
>> +
>> +The `SLAB_NO_MERGE` flag prevents a cache from being merged, ensuring it
>> +receives dedicated slab pages.
> 
> s/slab pages/a dedicated slab/
> 
>> +2. *Actually mergeable*: The cache must not already be unmergeable.
>> +   A cache is already unmergeable if any of the following is true:
>> +
>> +   - It has a constructor (`ctor` argument is non-NULL).
>> +   - It has a non-zero `usersize` (with `CONFIG_HARDENED_USERCOPY`).
>> +   - It already has `SLAB_NO_MERGE` or another `SLAB_NEVER_MERGE` flag.
> 
> I don't know if this is good advice for users of the API.  It's true
> that the slab will already be unmergable for these other reasons, but
> it's harmless to specify SLAB_NO_MERGE in that case.  And it
> communicates intent.  And in case somebody removes the ctor in the
> future, or we decide to change which flags are in SLAB_NEVER_MERGE,
> the slab will still be unmergable.

Agreed.

>> +3. *Bounded allocation volume*: The cache has a predictable number of
>> +   active objects, so the memory cost of dedicated slab pages is
>> +   acceptable.
> 
> I don't understand why this is a criteria.

+1

>> +How merging works
>> +=================
>> +
>> +When `kmem_cache_create()` is called:
>> +
>> +1. If `usersize` is non-zero, the merge path is skipped entirely.
>> +
>> +2. Otherwise, `find_mergeable()` in `mm/slab_common.c` searches for a
>> +   compatible existing cache. A merge is prevented if:
>> +
>> +   - The `slab_nomerge` boot parameter is set
>> +   - The new cache has a constructor
>> +   - The new cache's flags include `SLAB_NO_MERGE`
>> +   - No existing cache has compatible size and flags
>> +
>> +3. If a compatible cache is found, the new cache becomes an alias. Both
>> +   share the same slab pages.
> 
> This feels like documenting internals rather than documenting how to use
> the flag.  I'd drop it entirely.

+1

>> +The cross-cache attack class
>> +=============================
>> +
>> +Cross-cache attacks exploit slab merging to achieve type confusion:
>> +
>> +1. Attacker triggers a use-after-free in object type A.
>> +2. Type A's cache is merged with type B (they share slab pages).
>> +3. The freed type A slot is reallocated as type B.
>> +4. Attacker uses the dangling pointer to corrupt type B.
>> +5. Privilege escalation.
>> +
>> +CVE-2022-29582 demonstrates this technique: an io_uring use-after-free is
>> +exploited via cross-cache page-level reallocation to achieve root.
>> +
>> +`SLAB_NO_MERGE` prevents step 2: dedicated pages mean a freed slot of
>> +one type cannot be reallocated as a different type.
> 
> Not sure this section adds anything to what was already described.
> 
>> +Tradeoffs
>> +=========
>> +
>> +*Memory*: Isolated caches may have partially-filled slab pages that
>> +cannot be used by other types. For caches with bounded allocation counts,
>> +this is typically a few extra pages.
>> +
>> +*Performance*: Zero impact on `kmem_cache_alloc()` and
>> +`kmem_cache_free()`. The only effect is at boot when the cache is
>> +created.
>> +
>> +Relationship to other mitigations
>> +==================================
>> +
>> +`CONFIG_RANDOM_KMALLOC_CACHES`
>> +    Creates 16 copies of each `kmalloc` size class and randomly assigns
>> +    allocations among them. Only affects `kmalloc()` users. Does not
>> +    affect named caches created with `kmem_cache_create()`.
>> +
>> +`SLAB_TYPESAFE_BY_RCU`
>> +    Delays freeing the slab page by an RCU grace period. Does not delay
>> +    object slot reuse. Does not prevent cross-cache merging. Solves a
>> +    different problem: safe lockless access to freed-and-reallocated
>> +    objects of the same type.
>> +
>> +`slab_nomerge` boot parameter
>> +    Disables merging for all caches globally. `SLAB_NO_MERGE` provides
>> +    the same protection selectively for individual caches without the
>> +    global memory cost.
> 
> These two sections also feel unnecessary.

Many "product of LLM" hallmarks, sigh.

Reply via email to