Hi, I have a pair of Dell PE1950 in H.A. running OpenFiler 2.2 with kernel 2.6.19.7-0.3.smp.gcc3.4.x86.i686 and the system freeze when stopping nfs (typically, after changing config in Web UI, openfiler apply changes by stopping then starting nfs, and system freeze after running "nfs stop").
The machine is still responding to network ICMP pings, but ssh sessions are frozen, and services not responding. In the syslog I got a bit of kernel messages about "Soft lockup detected on CPU#2 etc." (see attached file for details). I searched in the forum/ml, and it seems there are some people having the same problem (soft lockup kernel messages) : https://www.openfiler.com/community/forums/viewtopic.php?id=1001 Is there a newest kernel available with precompiled drbd modules ? I installed the 2.6.21 but the drbd module are not present (I need them for the data replication between my two machines) ... Do you plan on integrating/distributing precompiled drbd modules with this updated 2.6.21 kernel ? Regards, Jérôme Augé
Oct 3 20:07:04 nash kernel: BUG: soft lockup detected on CPU#2! Oct 3 20:07:04 nash kernel: [<c014c42e>] softlockup_tick+0xa5/0xb4 Oct 3 20:07:04 nash kernel: [<c012c704>] update_process_times+0x39/0x5c Oct 3 20:07:04 nash kernel: [<c0116cfc>] smp_apic_timer_interrupt+0x8f/0xa7 Oct 3 20:07:04 nash kernel: [<c010482b>] apic_timer_interrupt+0x1f/0x24 Oct 3 20:07:04 nash kernel: [<c031407f>] lock_kernel+0x1a/0x32 Oct 3 20:07:04 nash kernel: [<c019d3be>] proc_lookup+0x17/0xad Oct 3 20:07:04 nash kernel: [<c019a870>] proc_root_lookup+0xe/0x26 Oct 3 20:07:04 nash kernel: [<c01723de>] real_lookup+0x53/0xc7 Oct 3 20:07:04 nash kernel: [<c0172642>] do_lookup+0x57/0xa2 Oct 3 20:07:04 nash kernel: [<c0172db5>] __link_path_walk+0x728/0xb34 Oct 3 20:07:04 nash kernel: [<c01540aa>] __alloc_pages+0x5e/0x2ad Oct 3 20:07:04 nash kernel: [<c0173204>] link_path_walk+0x43/0xae Oct 3 20:07:04 nash kernel: [<c015ca65>] __handle_mm_fault+0x202/0x255 Oct 3 20:07:04 nash kernel: [<c017356e>] do_path_lookup+0x184/0x1e8 Oct 3 20:07:04 nash kernel: [<c0173801>] __user_walk_fd+0x2d/0x3e Oct 3 20:07:04 nash kernel: [<c016ead6>] vfs_stat_fd+0x19/0x40 Oct 3 20:07:04 nash kernel: [<c015ca65>] __handle_mm_fault+0x202/0x255 Oct 3 20:07:04 nash kernel: [<c016f103>] sys_stat64+0xf/0x23 Oct 3 20:07:04 nash kernel: [<c0315795>] do_page_fault+0x2a5/0x4ff Oct 3 20:07:04 nash kernel: [<c01e260d>] copy_to_user+0x46/0x4e Oct 3 20:07:04 nash kernel: [<c012ee0b>] sys_rt_sigprocmask+0xb2/0xc5 Oct 3 20:07:04 nash kernel: [<c0103da9>] sysenter_past_esp+0x56/0x79 Oct 3 20:07:04 nash kernel: [<c031007b>] packet_mmap+0x98/0xec Oct 3 20:07:04 nash kernel: ======================= Oct 3 20:07:04 nash kernel: BUG: soft lockup detected on CPU#1! Oct 3 20:07:04 nash kernel: [<c014c42e>] softlockup_tick+0xa5/0xb4 Oct 3 20:07:04 nash kernel: [<c012c704>] update_process_times+0x39/0x5c Oct 3 20:07:04 nash kernel: [<c0116cfc>] smp_apic_timer_interrupt+0x8f/0xa7 Oct 3 20:07:04 nash kernel: [<c010482b>] apic_timer_interrupt+0x1f/0x24 Oct 3 20:07:04 nash kernel: [<f8b85a58>] svc_close_socket+0xb/0x86 [sunrpc] Oct 3 20:07:04 nash kernel: [<f8b82d8f>] svc_destroy+0x6f/0xb5 [sunrpc] Oct 3 20:07:04 nash kernel: [<faf8870d>] nfsd+0x26e/0x27f [nfsd] Oct 3 20:07:04 nash kernel: [<faf8849f>] nfsd+0x0/0x27f [nfsd] Oct 3 20:07:04 nash kernel: [<c010497b>] kernel_thread_helper+0x7/0x10 Oct 3 20:07:04 nash kernel: ======================= Oct 3 20:07:06 nash kernel: BUG: soft lockup detected on CPU#0! Oct 3 20:07:06 nash kernel: [<c014c42e>] softlockup_tick+0xa5/0xb4 Oct 3 20:07:06 nash kernel: [<c012c704>] update_process_times+0x39/0x5c Oct 3 20:07:06 nash kernel: [<c0116cfc>] smp_apic_timer_interrupt+0x8f/0xa7 Oct 3 20:07:06 nash kernel: [<c010482b>] apic_timer_interrupt+0x1f/0x24 Oct 3 20:07:06 nash kernel: [<c013007b>] kernel_restart_prepare+0x1c/0x20 Oct 3 20:07:06 nash kernel: [<c031407f>] lock_kernel+0x1a/0x32 Oct 3 20:07:06 nash kernel: [<c01765e7>] do_ioctl+0x3f/0x64 Oct 3 20:07:06 nash kernel: [<c0176890>] vfs_ioctl+0x187/0x195 Oct 3 20:07:06 nash kernel: [<c0137e49>] hrtimer_wakeup+0x0/0x18 Oct 3 20:07:06 nash kernel: [<c01768e9>] sys_ioctl+0x4b/0x66 Oct 3 20:07:06 nash kernel: [<c0103da9>] sysenter_past_esp+0x56/0x79 Oct 3 20:07:06 nash kernel: =======================
_______________________________________________ Openfiler-users mailing list [email protected] https://lists.openfiler.com/mailman/listinfo/openfiler-users
