Just want to let you all know that OpenSM seems to work fine with Centos5.5 on the same HW.
Thanks, Suri > -----Original Message----- > From: [email protected] > [mailto:[email protected]] On Behalf Of Suresh > Shelvapille > Sent: Wednesday, October 13, 2010 3:07 PM > To: 'Linux RDMA list'; 'Tziporet Koren' > Subject: RE: Opensm crash with OFED 1.5 > > > I tried 1.5.2 and that did not help, same kernel oops..... > > > -----Original Message----- > > From: [email protected] > > [mailto:[email protected]] On Behalf Of > Suresh > > Shelvapille > > Sent: Tuesday, October 12, 2010 7:22 PM > > To: 'Linux RDMA list' > > Subject: Opensm crash with OFED 1.5 > > > > > > Folks: > > > > I have a multi-processor machine, running FedoraCore 12. I have installed > > OFED 1.5. Everything > seems > > to come up ok, I > > can look at the ibstat and it shows that the Mellanox card stats etc... > > > > As soon as I start opensm, I get the following kernel oops and the machine > > locks up. > > > > Any ideas.... > > > > Thanks, > > Suri > > > > -------------------------------------------------------------------------------------------------- > > > > Oct 12 17:19:38 localhost OpenSM[2617]: OpenSM 3.3.5#012 > > > > Oct 12 17:19:38 localhost OpenSM[2617]: Entering DISCOVERING state#012 > > > > Oct 12 17:20:20 localhost kernel: ib0: ib_query_gid() failed > > > > Oct 12 17:20:30 localhost kernel: ib0: ib_query_port failed > > > > Oct 12 17:20:52 localhost kernel: BUG: soft lockup - CPU#15 stuck for 61s! > > [opensm:2637] > > > > Oct 12 17:20:52 localhost kernel: Modules linked in: fuse sunrpc > > ip6t_REJECT nf_conntrack_ipv6 > > ip6table_filter > > ip6_tables cpufreq_ondemand acpi_cpufreq freq_table rdma_ucm ib_sdp rdma_cm > > iw_cm ib_addr ib_ipoib > > ib_cm ib_sa ipv6 > > ib_uverbs ib_umad iw_nes libcrc32c iw_cxgb3 cxgb3 mlx4_en mlx4_ib ib_mthca > > ib_mad ib_core > > dm_multipath uinput mlx4_core > > igb i2c_i801 joydev dca i2c_core iTCO_wdt iTCO_vendor_support mpt2sas > > scsi_transport_sas [last > > unloaded: microcode] > > > > Oct 12 17:20:52 localhost kernel: CPU 15: > > > > Oct 12 17:20:52 localhost kernel: Modules linked in: fuse sunrpc > > ip6t_REJECT nf_conntrack_ipv6 > > ip6table_filter > > ip6_tables cpufreq_ondemand acpi_cpufreq freq_table rdma_ucm ib_sdp rdma_cm > > iw_cm ib_addr ib_ipoib > > ib_cm ib_sa ipv6 > > ib_uverbs ib_umad iw_nes libcrc32c iw_cxgb3 cxgb3 mlx4_en mlx4_ib ib_mthca > > ib_mad ib_core > > dm_multipath uinput mlx4_core > > igb i2c_i801 joydev dca i2c_core iTCO_wdt iTCO_vendor_support mpt2sas > > scsi_transport_sas [last > > unloaded: microcode] > > > > Oct 12 17:20:52 localhost kernel: Pid: 2637, comm: opensm Not tainted > > 2.6.31.5-127.fc12.x86_64 #1 > > X8DTH-i/6/iF/6F > > > > Oct 12 17:20:52 localhost kernel: RIP: 0010:[<ffffffff81203558>] > > [<ffffffff81203558>] > > __bitmap_empty+0x0/0x64 > > > > Oct 12 17:20:52 localhost kernel: RSP: 0018:ffff880c174bbd90 EFLAGS: > > 00000246 > > > > Oct 12 17:20:52 localhost kernel: RAX: 0000000000000000 RBX: > > ffff880c174bbdd8 RCX: 0000000000000001 > > > > Oct 12 17:20:52 localhost kernel: RDX: ffffffff818ba920 RSI: > > 0000000000000100 RDI: ffffffff818ba918 > > > > Oct 12 17:20:52 localhost kernel: RBP: ffffffff8101286e R08: > > 0000000000000000 R09: 0000000000000004 > > > > Oct 12 17:20:52 localhost kernel: R10: 0000000000000004 R11: > > 0000000000000206 R12: ffff880c174bbdd8 > > > > Oct 12 17:20:52 localhost kernel: R13: ffffffff8101286e R14: > > ffffffff810dc920 R15: ffff880c174bbcf8 > > > > Oct 12 17:20:52 localhost kernel: FS: 00007ff2d02e7710(0000) > > GS:ffffc90001e00000(0000) > > knlGS:0000000000000000 > > > > Oct 12 17:20:52 localhost kernel: CS: 0010 DS: 0000 ES: 0000 CR0: > > 0000000080050033 > > > > Oct 12 17:20:52 localhost kernel: CR2: 000000000041f0c0 CR3: > > 0000000c19074000 CR4: 00000000000006e0 > > > > Oct 12 17:20:52 localhost kernel: DR0: 0000000000000000 DR1: > > 0000000000000000 DR2: 0000000000000000 > > > > Oct 12 17:20:52 localhost kernel: DR3: 0000000000000000 DR6: > > 00000000ffff0ff0 DR7: 0000000000000400 > > > > Oct 12 17:20:52 localhost kernel: Call Trace: > > > > Oct 12 17:20:52 localhost kernel: [<ffffffff810383f2>] ? > > native_flush_tlb_others+0xc3/0xf2 > > > > Oct 12 17:20:52 localhost kernel: [<ffffffff8103859d>] ? > > flush_tlb_mm+0x6f/0x76 > > > > Oct 12 17:20:52 localhost kernel: [<ffffffff810debbc>] ? > > mprotect_fixup+0x480/0x611 > > > > Oct 12 17:20:52 localhost kernel: [<ffffffff810da81d>] ? > > free_pgtables+0xa9/0xcc > > > > Oct 12 17:20:52 localhost kernel: [<ffffffff810f185d>] ? > > virt_to_head_page+0xe/0x2f > > > > Oct 12 17:20:52 localhost kernel: [<ffffffff810deee9>] ? > > sys_mprotect+0x19c/0x227 > > > > Oct 12 17:20:52 localhost kernel: [<ffffffff81011cf2>] ? > > system_call_fastpath+0x16/0x1b > > > > -- > > To unsubscribe from this list: send the line "unsubscribe linux-rdma" in > > the body of a message to [email protected] > > More majordomo info at http://vger.kernel.org/majordomo-info.html > > -- > To unsubscribe from this list: send the line "unsubscribe linux-rdma" in > the body of a message to [email protected] > More majordomo info at http://vger.kernel.org/majordomo-info.html -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to [email protected] More majordomo info at http://vger.kernel.org/majordomo-info.html
