The short answer:  'man 7 sched'

Thanks I read this and I think I might still be confused. I am using cgroups
and have cpu.cfs_quota_us configured as 2300000 and cpu.cfs_period_us
configured as 100000 for 3 different cgroups of, all of these I assume
equates
69 cpus along with a couple of other cgroups with cpu.cfs_quota_us
configured
on a 80 cpu machine which is why I made my original guess.

The first question is, of course: "Did you see any actual evidence of kernel
threads being starved?"

I have a couple of very similar machines with similar workloads and observed
the below type of messages in dmesg on several of them:

rcu_sched detected stalls on CPUs
Sending NMI from CPU 43 to CPUs 14
watchdog: BUG: soft lockup - CPU#26 stuck for 22s [migration/54:335]
ixgbe 0000:19:00.1 eno2: initiating reset due to tx timeout

Which is why I have this hypothesis.

I am still unclear if the cgroup group controller makes guarantees such that
tasks in the cgroup cannot be preempted even if a kernel thread requires cpu
time.

Thanks for your time!

Abejide Ayodele
It always seems impossible until it's done. --Nelson Mandela
_______________________________________________
Kernelnewbies mailing list
[email protected]
https://lists.kernelnewbies.org/mailman/listinfo/kernelnewbies

Reply via email to