Jon Mason wrote:
On Fri, Mar 20, 2009 at 04:17:56PM -0700, Vu Pham wrote:
Hi Jon,

I ran connectathon test -N100 and get this crash on the server. Both server/client are RHEL 5.2 x64 with connectX HCAs

Should I open a bug# on bugzilla?

If you hit a bug, you should open one in bugzilla so it can be tracked.

OK - I just open bug #1571
Do you see the same behavior on mainline or is this isolated to the
RHEL5.2 backport?

I run server on mainline kernel 2.6.27. The server fail at same place; however, it does not crash

general protection fault: 0000 [1] svcrdma: error fast registering xdr for xprt
ffff81022e4f0c00SMP

thanks,
-vu

Thanks,
Jon

thanks,
-vu

Mar 20 10:05:34 vlab-007 kernel: EXT3-fs: mounted filesystem with ordered data m
ode. general protection fault: 0000 [1] svcrdma: error fast registering xdr for xprt ffff81022e4f0c00SMP last sysfs file: /devices/pci0000:00/0000:00:00.0/local_cpus CPU 4 Modules linked in: svcrdma(U) nfsd(U) lockd(U) nfs_acl(U) auth_rpcgss(U) exportf
s(U) autofs4 hidp rfcomm l2cap bluetooth sunrpc(U) rdma_ucm(U) rdma_cm(U) iw_cm(
U) ib_addr(U) ib_ipoib(U) ipoib_helper(U) ib_cm(U) ib_sa(U) ipv6 xfrm_nalgo cryp
to_api ib_uverbs(U) ib_umad(U) mlx4_ib(U) dm_mirror dm_multipath dm_mod raid0 vi
deo sbs backlight i2c_ec button battery asus_acpi acpi_memhotplug ac parport_pc lp parport i2c_i801 mlx4_core(U) e1000e serio_raw pcspkr ib_mthca(U) shpchp i2c_
core ib_mad(U) ib_core(U) sg ata_piix libata mptsas mptscsih mptbase scsi_transp
ort_sas sd_mod scsi_mod ext3 jbd uhci_hcd ohci_hcd ehci_hcd Pid: 0, comm: swapper Tainted: G 2.6.18-92.el5 #1 RIP: 0010:[<ffffffff80149991>] [<ffffffff80149991>] mark_clean+0x50/0x77 RSP: 0018:ffff81022fc6fe18 EFLAGS: 00010202 RAX: 5b98396687b9ba94 RBX: ffff81022349d0c0 RCX: 0000000000000080 RDX: 0000140d41402000 RSI: 0140d41402000000 RDI: 0140551402001000 RBP: 0140d41402000000 R08: 0140551402001000 R09: 5b98396687b9be94 R10: ffff81022fd68038 R11: ffffffff800928d3 R12: ffff81022e4f0c00 R13: 0000000000000000 R14: 0000000000000000 R15: ffffffff803c82e0 FS: 0000000000000000(0000) GS:ffff81022fc20d40(0000) knlGS:0000000000000000 CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b CR2: 00002b5f92237000 CR3: 0000000215470000 CR4: 00000000000006e0 Process swapper (pid: 0, threadinfo ffff81022fc66000, task ffff81022fc21080) Stack: ffffffff885a45da ffff81022e4f0e60 ffff81022e4f0c00 ffff8102150d6140 ffff8102152ca1c0 ffff81022c7e6600 ffffffff885a4a8f ffff8102150d6140 ffffffff00000004 0000000000000032 ffff81022a338200 ffff81022a338af0 Call Trace: <IRQ> [<ffffffff885a45da>] :svcrdma:svc_rdma_put_frmr+0xbc/0x117 [<ffffffff885a4a8f>] :svcrdma:sq_cq_reap+0x11a/0x1a8 [<ffffffff80064a81>] _spin_lock_bh+0x9/0x14 [<ffffffff885a53f8>] :svcrdma:dto_tasklet_func+0x13a/0x17a [<ffffffff8821238d>] :mlx4_core:mlx4_eq_int+0x27e/0x28f [<ffffffff800928d3>] tasklet_action+0x62/0xac [<ffffffff80011ed2>] __do_softirq+0x5e/0xd6 [<ffffffff801549f5>] end_msi_irq_w_maskbit+0xf/0x1c [<ffffffff8005e2fc>] call_softirq+0x1c/0x28 [<ffffffff8006c571>] do_softirq+0x2c/0x85 [<ffffffff8006c3f9>] do_IRQ+0xec/0xf5 [<ffffffff80056c64>] mwait_idle+0x0/0x4a [<ffffffff8005d615>] ret_from_intr+0x0/0xa <EOI> [<ffffffff80056c9a>] mwait_idle+0x36/0x4a [<ffffffff80048b1d>] cpu_idle+0x95/0xb8 [<ffffffff80076667>] start_secondary+0x45a/0x469 Code: 49 8b 01 48 6b d2 38 48 83 e0 fc 48 01 d0 f0 0f ba 28 09 49 RIP [<ffffffff80149991>] mark_clean+0x50/0x77 RSP <ffff81022fc6fe18>


_______________________________________________
general mailing list
[email protected]
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general

To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

Reply via email to