I tried 1.5.2 and that did not help, same kernel oops..... > -----Original Message----- > From: [email protected] > [mailto:[email protected]] On Behalf Of Suresh > Shelvapille > Sent: Tuesday, October 12, 2010 7:22 PM > To: 'Linux RDMA list' > Subject: Opensm crash with OFED 1.5 > > > Folks: > > I have a multi-processor machine, running FedoraCore 12. I have installed > OFED 1.5. Everything seems > to come up ok, I > can look at the ibstat and it shows that the Mellanox card stats etc... > > As soon as I start opensm, I get the following kernel oops and the machine > locks up. > > Any ideas.... > > Thanks, > Suri > > -------------------------------------------------------------------------------------------------- > > Oct 12 17:19:38 localhost OpenSM[2617]: OpenSM 3.3.5#012 > > Oct 12 17:19:38 localhost OpenSM[2617]: Entering DISCOVERING state#012 > > Oct 12 17:20:20 localhost kernel: ib0: ib_query_gid() failed > > Oct 12 17:20:30 localhost kernel: ib0: ib_query_port failed > > Oct 12 17:20:52 localhost kernel: BUG: soft lockup - CPU#15 stuck for 61s! > [opensm:2637] > > Oct 12 17:20:52 localhost kernel: Modules linked in: fuse sunrpc ip6t_REJECT > nf_conntrack_ipv6 > ip6table_filter > ip6_tables cpufreq_ondemand acpi_cpufreq freq_table rdma_ucm ib_sdp rdma_cm > iw_cm ib_addr ib_ipoib > ib_cm ib_sa ipv6 > ib_uverbs ib_umad iw_nes libcrc32c iw_cxgb3 cxgb3 mlx4_en mlx4_ib ib_mthca > ib_mad ib_core > dm_multipath uinput mlx4_core > igb i2c_i801 joydev dca i2c_core iTCO_wdt iTCO_vendor_support mpt2sas > scsi_transport_sas [last > unloaded: microcode] > > Oct 12 17:20:52 localhost kernel: CPU 15: > > Oct 12 17:20:52 localhost kernel: Modules linked in: fuse sunrpc ip6t_REJECT > nf_conntrack_ipv6 > ip6table_filter > ip6_tables cpufreq_ondemand acpi_cpufreq freq_table rdma_ucm ib_sdp rdma_cm > iw_cm ib_addr ib_ipoib > ib_cm ib_sa ipv6 > ib_uverbs ib_umad iw_nes libcrc32c iw_cxgb3 cxgb3 mlx4_en mlx4_ib ib_mthca > ib_mad ib_core > dm_multipath uinput mlx4_core > igb i2c_i801 joydev dca i2c_core iTCO_wdt iTCO_vendor_support mpt2sas > scsi_transport_sas [last > unloaded: microcode] > > Oct 12 17:20:52 localhost kernel: Pid: 2637, comm: opensm Not tainted > 2.6.31.5-127.fc12.x86_64 #1 > X8DTH-i/6/iF/6F > > Oct 12 17:20:52 localhost kernel: RIP: 0010:[<ffffffff81203558>] > [<ffffffff81203558>] > __bitmap_empty+0x0/0x64 > > Oct 12 17:20:52 localhost kernel: RSP: 0018:ffff880c174bbd90 EFLAGS: 00000246 > > Oct 12 17:20:52 localhost kernel: RAX: 0000000000000000 RBX: ffff880c174bbdd8 > RCX: 0000000000000001 > > Oct 12 17:20:52 localhost kernel: RDX: ffffffff818ba920 RSI: 0000000000000100 > RDI: ffffffff818ba918 > > Oct 12 17:20:52 localhost kernel: RBP: ffffffff8101286e R08: 0000000000000000 > R09: 0000000000000004 > > Oct 12 17:20:52 localhost kernel: R10: 0000000000000004 R11: 0000000000000206 > R12: ffff880c174bbdd8 > > Oct 12 17:20:52 localhost kernel: R13: ffffffff8101286e R14: ffffffff810dc920 > R15: ffff880c174bbcf8 > > Oct 12 17:20:52 localhost kernel: FS: 00007ff2d02e7710(0000) > GS:ffffc90001e00000(0000) > knlGS:0000000000000000 > > Oct 12 17:20:52 localhost kernel: CS: 0010 DS: 0000 ES: 0000 CR0: > 0000000080050033 > > Oct 12 17:20:52 localhost kernel: CR2: 000000000041f0c0 CR3: 0000000c19074000 > CR4: 00000000000006e0 > > Oct 12 17:20:52 localhost kernel: DR0: 0000000000000000 DR1: 0000000000000000 > DR2: 0000000000000000 > > Oct 12 17:20:52 localhost kernel: DR3: 0000000000000000 DR6: 00000000ffff0ff0 > DR7: 0000000000000400 > > Oct 12 17:20:52 localhost kernel: Call Trace: > > Oct 12 17:20:52 localhost kernel: [<ffffffff810383f2>] ? > native_flush_tlb_others+0xc3/0xf2 > > Oct 12 17:20:52 localhost kernel: [<ffffffff8103859d>] ? > flush_tlb_mm+0x6f/0x76 > > Oct 12 17:20:52 localhost kernel: [<ffffffff810debbc>] ? > mprotect_fixup+0x480/0x611 > > Oct 12 17:20:52 localhost kernel: [<ffffffff810da81d>] ? > free_pgtables+0xa9/0xcc > > Oct 12 17:20:52 localhost kernel: [<ffffffff810f185d>] ? > virt_to_head_page+0xe/0x2f > > Oct 12 17:20:52 localhost kernel: [<ffffffff810deee9>] ? > sys_mprotect+0x19c/0x227 > > Oct 12 17:20:52 localhost kernel: [<ffffffff81011cf2>] ? > system_call_fastpath+0x16/0x1b > > -- > To unsubscribe from this list: send the line "unsubscribe linux-rdma" in > the body of a message to [email protected] > More majordomo info at http://vger.kernel.org/majordomo-info.html
-- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to [email protected] More majordomo info at http://vger.kernel.org/majordomo-info.html
