Thanks for everyone's input on this.  The patch pointed out by Timothy Day 
seems to have fixed our problem.

Much appreciated!

Eric



--
Eric J. Walter
Executive Director, Research Computing
Information Technology
William & Mary
Office: 757-221-1886
________________________________
From: Day, Timothy <[email protected]>
Sent: Thursday, February 13, 2025 10:13 AM
To: ?k?slompolo Simppa <[email protected]>; Andreas Dilger 
<[email protected]>; Oleg Drokin <[email protected]>
Cc: [email protected] <[email protected]>; Walter, 
Eric <[email protected]>
Subject: Re: [lustre-discuss] Kernel oops with lustre 2.15.6 on rocky 9.5 
kernel 5.14.0-503.22.1.el9_5.x86_64

[You don't often get email from [email protected]. Learn why this is important 
at https://aka.ms/LearnAboutSenderIdentification ]

> Hi!
>
> We have been suffering this with RHEL9.5 a couple of weeks now. I finally got 
> kernel crash dumps saved, and also see similar "RIP: 
> 0010:ll_prune_negative_children"
>
> I tried applying the patch:
> git cherry-pick 983999bda71115595df48d614ca1aaf9b746c75f to commit 
> f7948c626181cda1f72d148adc73ad499eb60307 (HEAD -> b2_15, tag: v2_15_6, tag: 
> 2.15.6, origin/b2_15)
>
> I get a conflict in lustre/llite/statahead.c

Someone already did a backport to 2.15, so you could cherry-pick that instead 
https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Freview.whamcloud.com%2Fc%2Ffs%2Flustre-release%2F%2B%2F57007&data=05%7C02%7Cejwalt%40wm.edu%7C86d662da7c61422521a308dd4c40fcbe%7Cb93cbc3e661d40588693a897b924b8d7%7C0%7C0%7C638750564183618220%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=7wKl8RLWOsi%2FmllC51u%2BZfLi5E1vmmt49NMhTpytFNA%3D&reserved=0<https://review.whamcloud.com/c/fs/lustre-release/+/57007>.

Tim Day

_______________________________________________
lustre-discuss mailing list
[email protected]
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org

Reply via email to