Encountered the below just after booting my NFS/RDMA server with 4.4.0-rc6-00011-g6948cb2 (k.o/for-4.5 plus my NFS/RDMA for-4.5 patches). The system is up and ping-able via eth0, but high-level networking (like sshd and nfsd) does not work, and my ib0 i/f is missing.
This is an x86_64 system with one CX-3 Pro HCA. All seems well with a stock v4.4-rc4 kernel. Jan 6 12:44:13 klimt kernel: <mlx4_ib> mlx4_ib_add: mlx4_ib: Mellanox ConnectX InfiniBand driver v2.2-1 (Feb 2014) Jan 6 12:44:13 klimt kernel: <mlx4_ib> mlx4_ib_add: counter index 0 for port 1 allocated 0 Jan 6 12:44:13 klimt kernel: BUG: unable to handle kernel NULL pointer dereference at (null) Jan 6 12:44:13 klimt kernel: IP: [<ffffffff81652655>] __mutex_lock_slowpath+0x75/0x120 Jan 6 12:44:13 klimt kernel: PGD 853947067 PUD 8546cb067 PMD 0 Jan 6 12:44:13 klimt kernel: Oops: 0002 [#1] SMP Jan 6 12:44:13 klimt kernel: Modules linked in: mlx4_ib(+) mlx4_en ib_sa ib_mad ib_core vxlan ip6_udp_tunnel udp_tunnel ib_addr sr_mod cdrom sd_mod ast drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops ttm drm mlx4_core igb ahci libahci libata ptp pps_core dca i2c_algo_bit i2c_core dm_mirror dm_region_hash dm_log dm_mod Jan 6 12:44:13 klimt kernel: CPU: 3 PID: 431 Comm: modprobe Not tainted 4.4.0-rc6-00011-g6948cb2 #79 Jan 6 12:44:13 klimt kernel: Hardware name: Supermicro Super Server/X10SRL-F, BIOS 1.0c 09/09/2015 Jan 6 12:44:13 klimt kernel: task: ffff88085571aa80 ti: ffff88084f414000 task.ti: ffff88084f414000 Jan 6 12:44:13 klimt kernel: RIP: 0010:[<ffffffff81652655>] [<ffffffff81652655>] __mutex_lock_slowpath+0x75/0x120 Jan 6 12:44:13 klimt kernel: RSP: 0018:ffff88084f417810 EFLAGS: 00010282 Jan 6 12:44:13 klimt kernel: RAX: 0000000000000000 RBX: ffff88084f633950 RCX: ffff88085571aa80 Jan 6 12:44:13 klimt kernel: RDX: 0000000000000001 RSI: ffff88085571aae0 RDI: ffff88084f633954 Jan 6 12:44:13 klimt kernel: RBP: ffff88084f417858 R08: 0000000000000101 R09: ffff880854f02f00 Jan 6 12:44:13 klimt kernel: R10: ffffffffa0150a85 R11: ffffea002156d400 R12: ffff88084f633954 Jan 6 12:44:13 klimt kernel: R13: ffff88085571aa80 R14: 00000000ffffffff R15: ffff88084f633958 Jan 6 12:44:13 klimt kernel: FS: 00007f32227c0740(0000) GS:ffff88087fcc0000(0000) knlGS:0000000000000000 Jan 6 12:44:13 klimt kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Jan 6 12:44:13 klimt kernel: CR2: 0000000000000000 CR3: 0000000853cb6000 CR4: 00000000001406e0 Jan 6 12:44:13 klimt kernel: Stack: Jan 6 12:44:13 klimt kernel: ffff88084f633958 0000000000000000 ffffffff81309502 000000003b473ac0 Jan 6 12:44:13 klimt kernel: ffff88084f633950 ffff88084f417888 ffff88084f633940 ffff88084f633950 Jan 6 12:44:13 klimt kernel: ffff88084f630000 ffff88084f417870 ffffffff8165271f ffff88084f630000 Jan 6 12:44:13 klimt kernel: Call Trace: Jan 6 12:44:13 klimt kernel: [<ffffffff81309502>] ? get_from_free_list+0x42/0x50 Jan 6 12:44:13 klimt kernel: [<ffffffff8165271f>] mutex_lock+0x1f/0x2f Jan 6 12:44:13 klimt kernel: [<ffffffffa02d7af7>] iboe_process_mad.isra.13+0x77/0x190 [mlx4_ib] Jan 6 12:44:13 klimt kernel: [<ffffffffa02da3a4>] mlx4_ib_process_mad+0x4d4/0x550 [mlx4_ib] Jan 6 12:44:13 klimt kernel: [<ffffffff8126256a>] ? kernfs_next_descendant_post+0x1a/0x50 Jan 6 12:44:13 klimt kernel: [<ffffffff81263332>] ? kernfs_add_one+0x112/0x150 Jan 6 12:44:13 klimt kernel: [<ffffffff811cc46d>] ? kmem_cache_alloc_trace+0x3d/0x1d0 Jan 6 12:44:13 klimt kernel: [<ffffffffa0150a85>] ? get_perf_mad+0x85/0x160 [ib_core] Jan 6 12:44:13 klimt kernel: [<ffffffffa0150aee>] get_perf_mad+0xee/0x160 [ib_core] Jan 6 12:44:13 klimt kernel: [<ffffffffa0150b98>] get_counter_table+0x38/0x70 [ib_core] Jan 6 12:44:13 klimt kernel: [<ffffffff811cc528>] ? kmem_cache_alloc_trace+0xf8/0x1d0 Jan 6 12:44:13 klimt kernel: [<ffffffffa0150dc2>] ? add_port+0xc2/0x450 [ib_core] Jan 6 12:44:13 klimt kernel: [<ffffffffa0150e0f>] add_port+0x10f/0x450 [ib_core] Jan 6 12:44:13 klimt kernel: [<ffffffffa0151238>] ib_device_register_sysfs+0xe8/0x160 [ib_core] Jan 6 12:44:13 klimt kernel: [<ffffffffa0152280>] ib_register_device+0x320/0x500 [ib_core] Jan 6 12:44:13 klimt kernel: [<ffffffff810cb13b>] ? vprintk_default+0x3b/0x40 Jan 6 12:44:13 klimt kernel: [<ffffffff8117080f>] ? printk+0x5d/0x74 Jan 6 12:44:13 klimt kernel: [<ffffffffa02dee69>] mlx4_ib_add+0xbb9/0xfe0 [mlx4_ib] Jan 6 12:44:13 klimt kernel: [<ffffffffa023f000>] ? 0xffffffffa023f000 Jan 6 12:44:13 klimt kernel: [<ffffffffa0192f6f>] mlx4_add_device+0x3f/0xb0 [mlx4_core] Jan 6 12:44:13 klimt kernel: [<ffffffffa023f000>] ? 0xffffffffa023f000 Jan 6 12:44:13 klimt kernel: [<ffffffffa01930b2>] mlx4_register_interface+0xd2/0x100 [mlx4_core] Jan 6 12:44:13 klimt kernel: [<ffffffffa023f04c>] mlx4_ib_init+0x4c/0x1000 [mlx4_ib] Jan 6 12:44:13 klimt kernel: [<ffffffff81002183>] do_one_initcall+0x113/0x1f0 Jan 6 12:44:13 klimt kernel: [<ffffffff811afdd7>] ? __vunmap+0xd7/0x100 Jan 6 12:44:13 klimt kernel: [<ffffffff811cc46d>] ? kmem_cache_alloc_trace+0x3d/0x1d0 Jan 6 12:44:13 klimt kernel: [<ffffffff811709c3>] ? do_init_module+0x27/0x1e8 Jan 6 12:44:13 klimt kernel: [<ffffffff811709fc>] do_init_module+0x60/0x1e8 Jan 6 12:44:13 klimt kernel: [<ffffffff810f7fb7>] load_module+0x12f7/0x1950 Jan 6 12:44:13 klimt kernel: [<ffffffff810f4020>] ? store_uevent+0x70/0x70 Jan 6 12:44:13 klimt kernel: [<ffffffff810f49d4>] ? copy_module_from_fd.isra.37+0xb4/0x160 Jan 6 12:44:13 klimt kernel: [<ffffffff810f881f>] SyS_finit_module+0x9f/0xd0 Jan 6 12:44:13 klimt kernel: [<ffffffff8165446e>] entry_SYSCALL_64_fastpath+0x12/0x71 Jan 6 12:44:13 klimt kernel: Code: 04 4c 89 e7 e8 9d 1a 00 00 8b 03 83 f8 01 74 25 48 8b 43 10 4c 8d 7b 08 48 89 63 10 41 be ff ff ff ff 4c 89 3c 24 48 89 44 24 08 <48> 89 20 4c 89 6c 24 10 eb 0b 31 c0 87 03 83 f8 01 75 d2 eb 57 Jan 6 12:44:13 klimt kernel: RIP [<ffffffff81652655>] __mutex_lock_slowpath+0x75/0x120 Jan 6 12:44:13 klimt kernel: RSP <ffff88084f417810> Jan 6 12:44:13 klimt kernel: CR2: 0000000000000000 Jan 6 12:44:13 klimt kernel: ---[ end trace cea4b2a7abe96d8c ]--- -- Chuck Lever -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html