Folks: I have a multi-processor machine, running FedoraCore 12. I have installed OFED 1.5. Everything seems to come up ok, I can look at the ibstat and it shows that the Mellanox card stats etc...
As soon as I start opensm, I get the following kernel oops and the machine locks up. Any ideas.... Thanks, Suri -------------------------------------------------------------------------------------------------- Oct 12 17:19:38 localhost OpenSM[2617]: OpenSM 3.3.5#012 Oct 12 17:19:38 localhost OpenSM[2617]: Entering DISCOVERING state#012 Oct 12 17:20:20 localhost kernel: ib0: ib_query_gid() failed Oct 12 17:20:30 localhost kernel: ib0: ib_query_port failed Oct 12 17:20:52 localhost kernel: BUG: soft lockup - CPU#15 stuck for 61s! [opensm:2637] Oct 12 17:20:52 localhost kernel: Modules linked in: fuse sunrpc ip6t_REJECT nf_conntrack_ipv6 ip6table_filter ip6_tables cpufreq_ondemand acpi_cpufreq freq_table rdma_ucm ib_sdp rdma_cm iw_cm ib_addr ib_ipoib ib_cm ib_sa ipv6 ib_uverbs ib_umad iw_nes libcrc32c iw_cxgb3 cxgb3 mlx4_en mlx4_ib ib_mthca ib_mad ib_core dm_multipath uinput mlx4_core igb i2c_i801 joydev dca i2c_core iTCO_wdt iTCO_vendor_support mpt2sas scsi_transport_sas [last unloaded: microcode] Oct 12 17:20:52 localhost kernel: CPU 15: Oct 12 17:20:52 localhost kernel: Modules linked in: fuse sunrpc ip6t_REJECT nf_conntrack_ipv6 ip6table_filter ip6_tables cpufreq_ondemand acpi_cpufreq freq_table rdma_ucm ib_sdp rdma_cm iw_cm ib_addr ib_ipoib ib_cm ib_sa ipv6 ib_uverbs ib_umad iw_nes libcrc32c iw_cxgb3 cxgb3 mlx4_en mlx4_ib ib_mthca ib_mad ib_core dm_multipath uinput mlx4_core igb i2c_i801 joydev dca i2c_core iTCO_wdt iTCO_vendor_support mpt2sas scsi_transport_sas [last unloaded: microcode] Oct 12 17:20:52 localhost kernel: Pid: 2637, comm: opensm Not tainted 2.6.31.5-127.fc12.x86_64 #1 X8DTH-i/6/iF/6F Oct 12 17:20:52 localhost kernel: RIP: 0010:[<ffffffff81203558>] [<ffffffff81203558>] __bitmap_empty+0x0/0x64 Oct 12 17:20:52 localhost kernel: RSP: 0018:ffff880c174bbd90 EFLAGS: 00000246 Oct 12 17:20:52 localhost kernel: RAX: 0000000000000000 RBX: ffff880c174bbdd8 RCX: 0000000000000001 Oct 12 17:20:52 localhost kernel: RDX: ffffffff818ba920 RSI: 0000000000000100 RDI: ffffffff818ba918 Oct 12 17:20:52 localhost kernel: RBP: ffffffff8101286e R08: 0000000000000000 R09: 0000000000000004 Oct 12 17:20:52 localhost kernel: R10: 0000000000000004 R11: 0000000000000206 R12: ffff880c174bbdd8 Oct 12 17:20:52 localhost kernel: R13: ffffffff8101286e R14: ffffffff810dc920 R15: ffff880c174bbcf8 Oct 12 17:20:52 localhost kernel: FS: 00007ff2d02e7710(0000) GS:ffffc90001e00000(0000) knlGS:0000000000000000 Oct 12 17:20:52 localhost kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Oct 12 17:20:52 localhost kernel: CR2: 000000000041f0c0 CR3: 0000000c19074000 CR4: 00000000000006e0 Oct 12 17:20:52 localhost kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 Oct 12 17:20:52 localhost kernel: DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 Oct 12 17:20:52 localhost kernel: Call Trace: Oct 12 17:20:52 localhost kernel: [<ffffffff810383f2>] ? native_flush_tlb_others+0xc3/0xf2 Oct 12 17:20:52 localhost kernel: [<ffffffff8103859d>] ? flush_tlb_mm+0x6f/0x76 Oct 12 17:20:52 localhost kernel: [<ffffffff810debbc>] ? mprotect_fixup+0x480/0x611 Oct 12 17:20:52 localhost kernel: [<ffffffff810da81d>] ? free_pgtables+0xa9/0xcc Oct 12 17:20:52 localhost kernel: [<ffffffff810f185d>] ? virt_to_head_page+0xe/0x2f Oct 12 17:20:52 localhost kernel: [<ffffffff810deee9>] ? sys_mprotect+0x19c/0x227 Oct 12 17:20:52 localhost kernel: [<ffffffff81011cf2>] ? system_call_fastpath+0x16/0x1b -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to [email protected] More majordomo info at http://vger.kernel.org/majordomo-info.html
