Thanks for everyone's input on this. The patch pointed out by Timothy Day seems to have fixed our problem.
Much appreciated! Eric -- Eric J. Walter Executive Director, Research Computing Information Technology William & Mary Office: 757-221-1886 ________________________________ From: Day, Timothy <[email protected]> Sent: Thursday, February 13, 2025 10:13 AM To: ?k?slompolo Simppa <[email protected]>; Andreas Dilger <[email protected]>; Oleg Drokin <[email protected]> Cc: [email protected] <[email protected]>; Walter, Eric <[email protected]> Subject: Re: [lustre-discuss] Kernel oops with lustre 2.15.6 on rocky 9.5 kernel 5.14.0-503.22.1.el9_5.x86_64 [You don't often get email from [email protected]. Learn why this is important at https://aka.ms/LearnAboutSenderIdentification ] > Hi! > > We have been suffering this with RHEL9.5 a couple of weeks now. I finally got > kernel crash dumps saved, and also see similar "RIP: > 0010:ll_prune_negative_children" > > I tried applying the patch: > git cherry-pick 983999bda71115595df48d614ca1aaf9b746c75f to commit > f7948c626181cda1f72d148adc73ad499eb60307 (HEAD -> b2_15, tag: v2_15_6, tag: > 2.15.6, origin/b2_15) > > I get a conflict in lustre/llite/statahead.c Someone already did a backport to 2.15, so you could cherry-pick that instead https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Freview.whamcloud.com%2Fc%2Ffs%2Flustre-release%2F%2B%2F57007&data=05%7C02%7Cejwalt%40wm.edu%7C86d662da7c61422521a308dd4c40fcbe%7Cb93cbc3e661d40588693a897b924b8d7%7C0%7C0%7C638750564183618220%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=7wKl8RLWOsi%2FmllC51u%2BZfLi5E1vmmt49NMhTpytFNA%3D&reserved=0<https://review.whamcloud.com/c/fs/lustre-release/+/57007>. Tim Day
_______________________________________________ lustre-discuss mailing list [email protected] http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org
