> On Jan 6, 2016, at 1:16 PM, Chuck Lever <chuck.le...@oracle.com> wrote:
> 
> Encountered the below just after booting my NFS/RDMA
> server with 4.4.0-rc6-00011-g6948cb2 (k.o/for-4.5 plus
> my NFS/RDMA for-4.5 patches). The system is up and
> ping-able via eth0, but high-level networking (like sshd
> and nfsd) does not work, and my ib0 i/f is missing.
> 
> This is an x86_64 system with one CX-3 Pro HCA.

And appears to be 100% reproducible. Any debugging
advice welcome!


> All seems well with a stock v4.4-rc4 kernel.
> 
> 
> Jan  6 12:44:13 klimt kernel: <mlx4_ib> mlx4_ib_add: mlx4_ib: Mellanox 
> ConnectX InfiniBand driver v2.2-1 (Feb 2014)
> Jan  6 12:44:13 klimt kernel: <mlx4_ib> mlx4_ib_add: counter index 0 for port 
> 1 allocated 0
> Jan  6 12:44:13 klimt kernel: BUG: unable to handle kernel NULL pointer 
> dereference at           (null)
> Jan  6 12:44:13 klimt kernel: IP: [<ffffffff81652655>] 
> __mutex_lock_slowpath+0x75/0x120
> Jan  6 12:44:13 klimt kernel: PGD 853947067 PUD 8546cb067 PMD 0 
> Jan  6 12:44:13 klimt kernel: Oops: 0002 [#1] SMP 
> Jan  6 12:44:13 klimt kernel: Modules linked in: mlx4_ib(+) mlx4_en ib_sa 
> ib_mad ib_core vxlan ip6_udp_tunnel udp_tunnel ib_addr sr_mod cdrom sd_mod 
> ast drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops ttm drm 
> mlx4_core igb ahci libahci libata ptp pps_core dca i2c_algo_bit i2c_core 
> dm_mirror dm_region_hash dm_log dm_mod
> Jan  6 12:44:13 klimt kernel: CPU: 3 PID: 431 Comm: modprobe Not tainted 
> 4.4.0-rc6-00011-g6948cb2 #79
> Jan  6 12:44:13 klimt kernel: Hardware name: Supermicro Super 
> Server/X10SRL-F, BIOS 1.0c 09/09/2015
> Jan  6 12:44:13 klimt kernel: task: ffff88085571aa80 ti: ffff88084f414000 
> task.ti: ffff88084f414000
> Jan  6 12:44:13 klimt kernel: RIP: 0010:[<ffffffff81652655>]  
> [<ffffffff81652655>] __mutex_lock_slowpath+0x75/0x120
> Jan  6 12:44:13 klimt kernel: RSP: 0018:ffff88084f417810  EFLAGS: 00010282
> Jan  6 12:44:13 klimt kernel: RAX: 0000000000000000 RBX: ffff88084f633950 
> RCX: ffff88085571aa80
> Jan  6 12:44:13 klimt kernel: RDX: 0000000000000001 RSI: ffff88085571aae0 
> RDI: ffff88084f633954
> Jan  6 12:44:13 klimt kernel: RBP: ffff88084f417858 R08: 0000000000000101 
> R09: ffff880854f02f00
> Jan  6 12:44:13 klimt kernel: R10: ffffffffa0150a85 R11: ffffea002156d400 
> R12: ffff88084f633954
> Jan  6 12:44:13 klimt kernel: R13: ffff88085571aa80 R14: 00000000ffffffff 
> R15: ffff88084f633958
> Jan  6 12:44:13 klimt kernel: FS:  00007f32227c0740(0000) 
> GS:ffff88087fcc0000(0000) knlGS:0000000000000000
> Jan  6 12:44:13 klimt kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 
> 0000000080050033
> Jan  6 12:44:13 klimt kernel: CR2: 0000000000000000 CR3: 0000000853cb6000 
> CR4: 00000000001406e0
> Jan  6 12:44:13 klimt kernel: Stack:
> Jan  6 12:44:13 klimt kernel: ffff88084f633958 0000000000000000 
> ffffffff81309502 000000003b473ac0
> Jan  6 12:44:13 klimt kernel: ffff88084f633950 ffff88084f417888 
> ffff88084f633940 ffff88084f633950
> Jan  6 12:44:13 klimt kernel: ffff88084f630000 ffff88084f417870 
> ffffffff8165271f ffff88084f630000
> Jan  6 12:44:13 klimt kernel: Call Trace:
> Jan  6 12:44:13 klimt kernel: [<ffffffff81309502>] ? 
> get_from_free_list+0x42/0x50
> Jan  6 12:44:13 klimt kernel: [<ffffffff8165271f>] mutex_lock+0x1f/0x2f
> Jan  6 12:44:13 klimt kernel: [<ffffffffa02d7af7>] 
> iboe_process_mad.isra.13+0x77/0x190 [mlx4_ib]
> Jan  6 12:44:13 klimt kernel: [<ffffffffa02da3a4>] 
> mlx4_ib_process_mad+0x4d4/0x550 [mlx4_ib]
> Jan  6 12:44:13 klimt kernel: [<ffffffff8126256a>] ? 
> kernfs_next_descendant_post+0x1a/0x50
> Jan  6 12:44:13 klimt kernel: [<ffffffff81263332>] ? 
> kernfs_add_one+0x112/0x150
> Jan  6 12:44:13 klimt kernel: [<ffffffff811cc46d>] ? 
> kmem_cache_alloc_trace+0x3d/0x1d0
> Jan  6 12:44:13 klimt kernel: [<ffffffffa0150a85>] ? get_perf_mad+0x85/0x160 
> [ib_core]
> Jan  6 12:44:13 klimt kernel: [<ffffffffa0150aee>] get_perf_mad+0xee/0x160 
> [ib_core]
> Jan  6 12:44:13 klimt kernel: [<ffffffffa0150b98>] 
> get_counter_table+0x38/0x70 [ib_core]
> Jan  6 12:44:13 klimt kernel: [<ffffffff811cc528>] ? 
> kmem_cache_alloc_trace+0xf8/0x1d0
> Jan  6 12:44:13 klimt kernel: [<ffffffffa0150dc2>] ? add_port+0xc2/0x450 
> [ib_core]
> Jan  6 12:44:13 klimt kernel: [<ffffffffa0150e0f>] add_port+0x10f/0x450 
> [ib_core]
> Jan  6 12:44:13 klimt kernel: [<ffffffffa0151238>] 
> ib_device_register_sysfs+0xe8/0x160 [ib_core]
> Jan  6 12:44:13 klimt kernel: [<ffffffffa0152280>] 
> ib_register_device+0x320/0x500 [ib_core]
> Jan  6 12:44:13 klimt kernel: [<ffffffff810cb13b>] ? vprintk_default+0x3b/0x40
> Jan  6 12:44:13 klimt kernel: [<ffffffff8117080f>] ? printk+0x5d/0x74
> Jan  6 12:44:13 klimt kernel: [<ffffffffa02dee69>] mlx4_ib_add+0xbb9/0xfe0 
> [mlx4_ib]
> Jan  6 12:44:13 klimt kernel: [<ffffffffa023f000>] ? 0xffffffffa023f000
> Jan  6 12:44:13 klimt kernel: [<ffffffffa0192f6f>] mlx4_add_device+0x3f/0xb0 
> [mlx4_core]
> Jan  6 12:44:13 klimt kernel: [<ffffffffa023f000>] ? 0xffffffffa023f000
> Jan  6 12:44:13 klimt kernel: [<ffffffffa01930b2>] 
> mlx4_register_interface+0xd2/0x100 [mlx4_core]
> Jan  6 12:44:13 klimt kernel: [<ffffffffa023f04c>] mlx4_ib_init+0x4c/0x1000 
> [mlx4_ib]
> Jan  6 12:44:13 klimt kernel: [<ffffffff81002183>] do_one_initcall+0x113/0x1f0
> Jan  6 12:44:13 klimt kernel: [<ffffffff811afdd7>] ? __vunmap+0xd7/0x100
> Jan  6 12:44:13 klimt kernel: [<ffffffff811cc46d>] ? 
> kmem_cache_alloc_trace+0x3d/0x1d0
> Jan  6 12:44:13 klimt kernel: [<ffffffff811709c3>] ? do_init_module+0x27/0x1e8
> Jan  6 12:44:13 klimt kernel: [<ffffffff811709fc>] do_init_module+0x60/0x1e8
> Jan  6 12:44:13 klimt kernel: [<ffffffff810f7fb7>] load_module+0x12f7/0x1950
> Jan  6 12:44:13 klimt kernel: [<ffffffff810f4020>] ? store_uevent+0x70/0x70
> Jan  6 12:44:13 klimt kernel: [<ffffffff810f49d4>] ? 
> copy_module_from_fd.isra.37+0xb4/0x160
> Jan  6 12:44:13 klimt kernel: [<ffffffff810f881f>] SyS_finit_module+0x9f/0xd0
> Jan  6 12:44:13 klimt kernel: [<ffffffff8165446e>] 
> entry_SYSCALL_64_fastpath+0x12/0x71
> Jan  6 12:44:13 klimt kernel: Code: 04 4c 89 e7 e8 9d 1a 00 00 8b 03 83 f8 01 
> 74 25 48 8b 43 10 4c 8d 7b 08 48 89 63 10 41 be ff ff ff ff 4c 89 3c 24 48 89 
> 44 24 08 <48> 89 20 4c 89 6c 24 10 eb 0b 31 c0 87 03 83 f8 01 75 d2 eb 57 
> Jan  6 12:44:13 klimt kernel: RIP  [<ffffffff81652655>] 
> __mutex_lock_slowpath+0x75/0x120
> Jan  6 12:44:13 klimt kernel: RSP <ffff88084f417810>
> Jan  6 12:44:13 klimt kernel: CR2: 0000000000000000
> Jan  6 12:44:13 klimt kernel: ---[ end trace cea4b2a7abe96d8c ]---
> 
> 
> --
> Chuck Lever
> 
> 
> 
> 
> --
> To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
> the body of a message to majord...@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html

--
Chuck Lever




--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to