Craig Prescott wrote, On 2009. 11. 19. 20:42: > Papp Tamás wrote: >> The logs are full with this: >> >> Nov 19 20:03:32 node1 kernel: BUG: soft lockup - CPU#3 stuck for 10s! >> [ll_ost_80:4894] >> Nov 19 20:03:32 node1 kernel: CPU 3: > <snip> >> Nov 19 20:03:34 node1 kernel: Lustre: Skipped 40339060 previous >> similar messages 0; still busy with 3 active RPCs > > We had the same problem with 1.8.x.x. > > We set lnet.printk=0 on our OSS nodes and it has helped us > dramatically - we have not seen the 'soft lockup' problem since. > > sysctl -w lnet.printk=0 > > This will turn off all but 'emerg' messages from lnet. > > It would be interesting to know if this avoided the lockups for you, too.
I set it up. We'll see. Thank you very much! tamas _______________________________________________ Lustre-discuss mailing list [email protected] http://lists.lustre.org/mailman/listinfo/lustre-discuss
