Public bug reported:

Several different systems, both vm and bare metal, running Ubuntu 20.04
LTS and different Ubuntu kernel versions with the generic kernel,
5.4.0-91.81, a mainline 5.15 build from kernel.ubuntu.com, 5.11-hwe,
show random panics.

The crash dumps all show panics within the SLUB memory management stuff.
Analyzing the kdump show invalid free list pointers:

CACHE             OBJSIZE  ALLOCATED     TOTAL  SLABS  SSIZE  NAME
kmem: kmalloc-256: slab: ffffefc73e319900 invalid freepointer: d28996eab0548a12
ffff8a863dc06f40      256      15454     76096   1189    16k  kmalloc-256

The kernel log shows warnings about wrong slab cache:

[148148.037307] cache_from_obj: Wrong slab cache. jbd2_journal_handle but 
object is from kmalloc-256
[148148.037348] WARNING: CPU: 20 PID: 4141624 at mm/slab.h:521 
kmem_cache_free+0x260/0x2b0

This shows up if there are several hundred MB per second backup traffic
via ceph (I think using rbd). Sometimes it only takes minutes for the
system to panic.

Hardware:
    HP DL380 Gen10
    AMD EPYC 7502 32-Core Processor
    256GB RAM

Firmware and microcode are uptodate.

I attached one backtraces, the warning message, the panic.

** Affects: linux (Ubuntu)
     Importance: Undecided
         Status: New

** Attachment added: "Panic 1"
   https://bugs.launchpad.net/bugs/1952425/+attachment/5543591/+files/out

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1952425

Title:
  SLUB freelist corruption with ceph

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1952425/+subscriptions


-- 
ubuntu-bugs mailing list
[email protected]
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to