Hi LMB,
> > - e.g. the problem with SLES 11 SP2 kernels crash - the same as
> described by Martin:
> >> SP2 kernels crash seriously (when a node rejoins the cluster) when
> >> using STCP as recommended in the SLES HA documentation and offered via the
> >> wizards.
> Is this not fixed by the latest maintenance upgrades?
To my knowledge the latest maintenance kernel ist 3.0.34-0.7.9.
I validated that the following SuSE kernel show the crash.
vmlinux-3.0.26-0.7-default.gz
vmlinux-3.0.34-0.7-default.gz
vmlinux-3.0.31-0.9-default.gz
vmlinux-3.0.36-5-default.gz
vmlinux-3.0.36-10-default.gz
KERNEL: vmlinux-3.0.34-0.7-default.gz
DEBUGINFO: ./vmlinux-3.0.34-0.7-default.debug
DUMPFILE: vmcore
CPUS: 24
DATE: Mon Jul 2 09:36:33 2012
UPTIME: 2 days, 17:12:14
LOAD AVERAGE: 1.57, 1.42, 1.45
TASKS: 551
NODENAME: rt-lxcl9b
RELEASE: 3.0.34-0.7-default
VERSION: #1 SMP Tue Jun 19 09:56:30 UTC 2012 (fbfc70c)
MACHINE: x86_64 (2932 Mhz)
MEMORY: 48 GB
PANIC: "[234603.020857] Oops: 0000 [#1] SMP " (check log for details)
PID: 19580
COMMAND: "sh"
TASK: ffff880b6bc26140 [THREAD_INFO: ffff880bceb40000]
CPU: 7
STATE: TASK_RUNNING (PANIC)
crash> bt
PID: 19580 TASK: ffff880b6bc26140 CPU: 7 COMMAND: "sh"
#0 [ffff880bceb41b30] machine_kexec at ffffffff810265fe
#1 [ffff880bceb41b80] crash_kexec at ffffffff810a31fa
#2 [ffff880bceb41c50] oops_end at ffffffff81442b88
#3 [ffff880bceb41c70] __bad_area_nosemaphore at ffffffff810324e5
#4 [ffff880bceb41d30] do_page_fault at ffffffff814451cb
#5 [ffff880bceb41e30] page_fault at ffffffff81441d65
[exception RIP: sock_ioctl+40]
RIP: ffffffff81370258 RSP: ffff880bceb41ee8 RFLAGS: 00010296
RAX: 0000000000000000 RBX: 0000000000005401 RCX: 00007fff87485790
RDX: 00007fff87485790 RSI: 0000000000005401 RDI: ffff880b96a98a80
RBP: 00007fff87485790 R8: 0000000000000000 R9: 00007f9bd6e1e640
R10: 00007fff87485730 R11: ffffffff811e0a90 R12: 00007fff87485790
R13: 0000000000000000 R14: 0000000000005401 R15: 0000000000000000
ORIG_RAX: ffffffffffffffff CS: 0010 SS: 0018
#6 [ffff880bceb41f10] do_vfs_ioctl at ffffffff81160f5b
#7 [ffff880bceb41f40] sys_ioctl at ffffffff81161321
#8 [ffff880bceb41f80] system_call_fastpath at ffffffff81449392
RIP: 00007f9bd6725677 RSP: 00007fff874857c8 RFLAGS: 00010202
RAX: 0000000000000010 RBX: ffffffff81449392 RCX: ffffffffffffffa8
RDX: 00007fff87485790 RSI: 0000000000005401 RDI: 0000000000000000
RBP: 0000000000000006 R8: 00007fff874858f0 R9: 00007f9bd6e1e640
R10: 00007fff87485730 R11: 0000000000000202 R12: 00007f9bd6fef700
R13: ffffffffffffffa8 R14: 0000000000000000 R15: 0000000000000000
ORIG_RAX: 0000000000000010 CS: 0033 SS: 002b
> I don't see an open bug for something like this right now.
Are you serious?
It was you who resolved this bug as INVALID in bugzilla
https://bugzilla.novell.com/show_bug.cgi?id=769292.
Best regards
Martin Konold
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems