Hi Roland,

Apr  7 18:17:17 lab105 kernel: Unable to handle kernel paging request at 
virtual address 6b6b6b6b6b6b6b6b

I think I fixed the bug causing this oops (I was able to reproduce it,
and I don't see it any more).  I checked the following patch in and
queued it for kernel 2.6.17:


My ia64 system still crashes with the patch applied. Please see log below


Apr 13 13:10:21 lab105 kernel: Abort for req_index 1
Apr 13 13:10:26 lab105 kernel: ib_srp: SRP reset_host called
Apr 13 13:10:28 lab105 kernel: ib_srp: connection closed
Apr 13 13:10:28 lab105 kernel: Unable to handle kernel paging request at virtual address 6b6b6b6b6b6b6b6b
Apr 13 13:10:28 lab105 kernel: scsi_eh_2[13324]: Oops 11012296146944 [1]
Apr 13 13:10:28 lab105 kernel: Modules linked in: ib_srp ib_cm ib_sa evdev joydev sg st sr_mod ide_cd cdrom usbserial parport_pc lp parport ipv6 thermal processor fan button binfmt_misc usbhid ib_mthca ib_mad ib_core ehci_hcd uhci_hcd usbcore i2c_i801 i2c_core e1000 nls_iso8859_1 nls_cp437 dm_mod reiserfs mptspi scsi_transport_spi mptscsih mptbase sd_mod scsi_mod
Apr 13 13:10:28 lab105 kernel:
Apr 13 13:10:28 lab105 kernel: Pid: 13324, CPU 1, comm:            scsi_eh_2
Apr 13 13:10:28 lab105 kernel: psr : 0000121008026018 ifs : 800000000000050d ip : [<a00000020235a0f1>] Not tainted Apr 13 13:10:28 lab105 kernel: ip is at srp_reconnect_target+0x2b1/0x5c0 [ib_srp] Apr 13 13:10:28 lab105 kernel: unat: 0000000000000000 pfs : 000000000000050d rsc : 0000000000000003 Apr 13 13:10:28 lab105 kernel: rnat: 0000000000000000 bsps: 0000000000000000 pr : 0000000000009541 Apr 13 13:10:28 lab105 kernel: ldrs: 0000000000000000 ccv : 0000000000000000 fpsr: 0009804c8a70433f
Apr 13 13:10:28 lab105 kernel: csd : 0000000000000000 ssd : 0000000000000000
Apr 13 13:10:28 lab105 kernel: b0 : a00000020235a060 b6 : a000000100003320 b7 : a0000002023ddd80 Apr 13 13:10:28 lab105 kernel: f6 : 1003e6b6b6b6b6b6b6b6b f7 : 0ffdd8000000000000000 Apr 13 13:10:28 lab105 kernel: f8 : 1003e0000000000003598 f9 : 1003e0000000000000118 Apr 13 13:10:28 lab105 kernel: f10 : 1003e0000000000000000 f11 : 1003e0000000000000000 Apr 13 13:10:28 lab105 kernel: r1 : a00000020235c200 r2 : e0000001e58f8b58 r3 : e00000018d748a40 Apr 13 13:10:28 lab105 kernel: r8 : e0000001e58f8ba8 r9 : e0000001e58f89f8 r10 : a000000100931338 Apr 13 13:10:28 lab105 kernel: r11 : 0000000000000001 r12 : e0000001ea8f7d00 r13 : e0000001ea8f0000 Apr 13 13:10:28 lab105 kernel: r14 : a000000100931340 r15 : e0000001ea8f0000 r16 : 0000000000000001 Apr 13 13:10:28 lab105 kernel: r17 : 0000000000000001 r18 : e0000001ea8f0f84 r19 : a000000100931348 Apr 13 13:10:28 lab105 kernel: r20 : ffffffffffffffff r21 : 0000000000000008 r22 : e00000000479c980 Apr 13 13:10:28 lab105 kernel: r23 : e0000001f5e7a920 r24 : 0000000000000080 r25 : e00000000479c99f Apr 13 13:10:28 lab105 kernel: r26 : a0000002023ddd80 r27 : e000000187d4c1e0 r28 : e000000187d4c000 Apr 13 13:10:28 lab105 kernel: r29 : e0000001f5e7a880 r30 : e00000018d748ab8 r31 : e00000018d748a20
Apr 13 13:10:28 lab105 kernel:
Apr 13 13:10:28 lab105 kernel: Call Trace:
Apr 13 13:10:28 lab105 kernel:  [<a000000100013000>] show_stack+0x80/0xa0
Apr 13 13:10:28 lab105 kernel: sp=e0000001ea8f7880 bsp=e0000001ea8f1308
Apr 13 13:10:28 lab105 kernel:  [<a000000100013860>] show_regs+0x840/0x880
Apr 13 13:10:28 lab105 kernel: sp=e0000001ea8f7a50 bsp=e0000001ea8f12a8
Apr 13 13:10:28 lab105 kernel:  [<a000000100035a10>] die+0x1b0/0x2e0
Apr 13 13:10:28 lab105 kernel: sp=e0000001ea8f7a60 bsp=e0000001ea8f1260 Apr 13 13:10:28 lab105 kernel: [<a000000100057840>] ia64_do_page_fault+0x9a0/0xb20 Apr 13 13:10:28 lab105 kernel: sp=e0000001ea8f7a80 bsp=e0000001ea8f11f0 Apr 13 13:10:28 lab105 kernel: [<a00000010000bc80>] ia64_leave_kernel+0x0/0x280 Apr 13 13:10:28 lab105 kernel: sp=e0000001ea8f7b30 bsp=e0000001ea8f11f0 Apr 13 13:10:28 lab105 kernel: [<a00000020235a0f0>] srp_reconnect_target+0x2b0/0x5c0 [ib_srp] Apr 13 13:10:28 lab105 kernel: sp=e0000001ea8f7d00 bsp=e0000001ea8f1188 Apr 13 13:10:28 lab105 kernel: [<a00000020235a460>] srp_reset_host+0x60/0xa0 [ib_srp] Apr 13 13:10:28 lab105 kernel: sp=e0000001ea8f7dc0 bsp=e0000001ea8f1160 Apr 13 13:10:28 lab105 kernel: [<a000000201b2f4d0>] scsi_try_host_reset+0xd0/0x240 [scsi_mod] Apr 13 13:10:28 lab105 kernel: sp=e0000001ea8f7dc0 bsp=e0000001ea8f1130 Apr 13 13:10:28 lab105 kernel: [<a000000201b320a0>] scsi_error_handler+0x1860/0x2000 [scsi_mod] Apr 13 13:10:28 lab105 kernel: sp=e0000001ea8f7dc0 bsp=e0000001ea8f1040
Apr 13 13:10:28 lab105 kernel:  [<a0000001000b98e0>] kthread+0x220/0x280
Apr 13 13:10:28 lab105 kernel: sp=e0000001ea8f7e10 bsp=e0000001ea8f1000 Apr 13 13:10:28 lab105 kernel: [<a000000100011440>] kernel_thread_helper+0xe0/0x100 Apr 13 13:10:28 lab105 kernel: sp=e0000001ea8f7e30 bsp=e0000001ea8f0fd0 Apr 13 13:10:28 lab105 kernel: [<a000000100009140>] start_kernel_thread+0x20/0x40 Apr 13 13:10:28 lab105 kernel: sp=e0000001ea8f7e30 bsp=e0000001ea8f0fd0 Apr 13 13:10:35 lab105 kernel: <3>Slab corruption: start=e0000001e58f89f8, len=448
Apr 13 13:10:35 lab105 kernel: Redzone: 0x5a2cf071/0x5a2cf071.
Apr 13 13:10:35 lab105 kernel: Last user: [<a000000201b289f0>](scsi_put_command+0x150/0x1c0 [scsi_mod]) Apr 13 13:10:35 lab105 kernel: 1b0: 00 00 08 00 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b a5
Apr 13 13:10:35 lab105 kernel: Prev obj: start=e0000001e58f8820, len=448
Apr 13 13:10:35 lab105 kernel: Redzone: 0x5a2cf071/0x5a2cf071.
Apr 13 13:10:35 lab105 kernel: Last user: [<a000000201b289f0>](scsi_put_command+0x150/0x1c0 [scsi_mod]) Apr 13 13:10:35 lab105 kernel: 000: 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b Apr 13 13:10:35 lab105 kernel: 010: 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b

_______________________________________________
openib-general mailing list
[email protected]
http://openib.org/mailman/listinfo/openib-general

To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

Reply via email to