Folks:

I have a multi-processor machine, running FedoraCore 12. I have installed OFED 
1.5. Everything seems to come up ok, I
can look at the ibstat and it shows that the Mellanox card stats etc...

As soon as I start opensm, I get the following kernel oops and the machine 
locks up.

Any ideas....

Thanks,
Suri

--------------------------------------------------------------------------------------------------

Oct 12 17:19:38 localhost OpenSM[2617]: OpenSM 3.3.5#012

Oct 12 17:19:38 localhost OpenSM[2617]: Entering DISCOVERING state#012

Oct 12 17:20:20 localhost kernel: ib0: ib_query_gid() failed

Oct 12 17:20:30 localhost kernel: ib0: ib_query_port failed

Oct 12 17:20:52 localhost kernel: BUG: soft lockup - CPU#15 stuck for 61s! 
[opensm:2637]

Oct 12 17:20:52 localhost kernel: Modules linked in: fuse sunrpc ip6t_REJECT 
nf_conntrack_ipv6 ip6table_filter
ip6_tables cpufreq_ondemand acpi_cpufreq freq_table rdma_ucm ib_sdp rdma_cm 
iw_cm ib_addr ib_ipoib ib_cm ib_sa ipv6
ib_uverbs ib_umad iw_nes libcrc32c iw_cxgb3 cxgb3 mlx4_en mlx4_ib ib_mthca 
ib_mad ib_core dm_multipath uinput mlx4_core
igb i2c_i801 joydev dca i2c_core iTCO_wdt iTCO_vendor_support mpt2sas 
scsi_transport_sas [last unloaded: microcode]

Oct 12 17:20:52 localhost kernel: CPU 15:

Oct 12 17:20:52 localhost kernel: Modules linked in: fuse sunrpc ip6t_REJECT 
nf_conntrack_ipv6 ip6table_filter
ip6_tables cpufreq_ondemand acpi_cpufreq freq_table rdma_ucm ib_sdp rdma_cm 
iw_cm ib_addr ib_ipoib ib_cm ib_sa ipv6
ib_uverbs ib_umad iw_nes libcrc32c iw_cxgb3 cxgb3 mlx4_en mlx4_ib ib_mthca 
ib_mad ib_core dm_multipath uinput mlx4_core
igb i2c_i801 joydev dca i2c_core iTCO_wdt iTCO_vendor_support mpt2sas 
scsi_transport_sas [last unloaded: microcode]

Oct 12 17:20:52 localhost kernel: Pid: 2637, comm: opensm Not tainted 
2.6.31.5-127.fc12.x86_64 #1 X8DTH-i/6/iF/6F

Oct 12 17:20:52 localhost kernel: RIP: 0010:[<ffffffff81203558>]  
[<ffffffff81203558>] __bitmap_empty+0x0/0x64

Oct 12 17:20:52 localhost kernel: RSP: 0018:ffff880c174bbd90  EFLAGS: 00000246

Oct 12 17:20:52 localhost kernel: RAX: 0000000000000000 RBX: ffff880c174bbdd8 
RCX: 0000000000000001

Oct 12 17:20:52 localhost kernel: RDX: ffffffff818ba920 RSI: 0000000000000100 
RDI: ffffffff818ba918

Oct 12 17:20:52 localhost kernel: RBP: ffffffff8101286e R08: 0000000000000000 
R09: 0000000000000004

Oct 12 17:20:52 localhost kernel: R10: 0000000000000004 R11: 0000000000000206 
R12: ffff880c174bbdd8

Oct 12 17:20:52 localhost kernel: R13: ffffffff8101286e R14: ffffffff810dc920 
R15: ffff880c174bbcf8

Oct 12 17:20:52 localhost kernel: FS:  00007ff2d02e7710(0000) 
GS:ffffc90001e00000(0000) knlGS:0000000000000000

Oct 12 17:20:52 localhost kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 
0000000080050033

Oct 12 17:20:52 localhost kernel: CR2: 000000000041f0c0 CR3: 0000000c19074000 
CR4: 00000000000006e0

Oct 12 17:20:52 localhost kernel: DR0: 0000000000000000 DR1: 0000000000000000 
DR2: 0000000000000000

Oct 12 17:20:52 localhost kernel: DR3: 0000000000000000 DR6: 00000000ffff0ff0 
DR7: 0000000000000400

Oct 12 17:20:52 localhost kernel: Call Trace:

Oct 12 17:20:52 localhost kernel: [<ffffffff810383f2>] ? 
native_flush_tlb_others+0xc3/0xf2

Oct 12 17:20:52 localhost kernel: [<ffffffff8103859d>] ? flush_tlb_mm+0x6f/0x76

Oct 12 17:20:52 localhost kernel: [<ffffffff810debbc>] ? 
mprotect_fixup+0x480/0x611

Oct 12 17:20:52 localhost kernel: [<ffffffff810da81d>] ? free_pgtables+0xa9/0xcc

Oct 12 17:20:52 localhost kernel: [<ffffffff810f185d>] ? 
virt_to_head_page+0xe/0x2f

Oct 12 17:20:52 localhost kernel: [<ffffffff810deee9>] ? 
sys_mprotect+0x19c/0x227

Oct 12 17:20:52 localhost kernel: [<ffffffff81011cf2>] ? 
system_call_fastpath+0x16/0x1b

--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to