On Tue, Dec 07, 2021 at 11:38:51AM +0000, Mikolaj Kucharski wrote: > More than a week of uptime on both machines: > > pce-0041# uptime > 11:30AM up 8 days, 18:48, 1 user, load averages: 0.02, 0.05, 0.01 > > pce-0035# uptime > 11:31AM up 8 days, 18:39, 1 user, load averages: 0.11, 0.14, 0.08 > > >From dmesg logs on my debugging kernel I see that, pce-0041 had zero > codepaths triggered so far of newly introduced code, but on pce-0035 I > see that new code path was triggered about 4 times. > > I'm planning to keep that kernel version running on those for a month or > maybe even more, but so far result looks very good. I think at this > stage pce-0035 would already panic(), based on my historical stats, how > often machine paniced before. Machine pce-0041 panics once per a quarter, > so I would need to wait a bit more to have good level of confidence, > against my stats.
I am confident that we have found the root cause of this issue. I have committed the fix. Thank you for your patience and all the help with tracking this down!
