Thank you all for your ideas!

Sure, we do have some modules not from the kernel source tree. These are
Mellanox (our NICs) and OpenvSwitch, as we've had some problems that
were fixed in the newer driver versions.

We don't have apport enabled, and actually, the hypervisor nodes don't even 
have direct access to the internet (only some VMs on them).
I checked on a test VM what kind of info it collects, and it seems that these 
are the arch, kernel version, and the stack trace. That kind of info is 
attached manually, we have netconsole enabled that collected it.

When the issue started, it was even reproducible on the then-latest
kernel (5.4.0-66), so I'm not sure that simply upgrading can help.

Currently I'm working on integrating kdump into our infrastructure,
trying to reproduce again, and I'll also try to schedule migration +
upgrade for our hypervisor node (that's not fast though).

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1921355

Title:
  cgroups related kernel panics

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1921355/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to