[Kernel-packages] [Bug 1704437] Re: [Bug] Crystal Ridge - libnvdimm: debug create-namespace sysfs collision report from Jeff
** Description changed: Description If you run ndctl create-namespace simultaneously on every cpu on a 72 cpu system, for exactly the same namespace, things break. Shocking, I know. I've attached the script and the panic. Not high priority, I don't think. I tried to attach the shell script, but your email server rejected it. So, it's inline below. Search for thrash-ns.sh. Cheers, Jeff [ 71.912227] ---[ cut here ]--- [ 71.935600] WARNING: CPU: 39 PID: 1039 at fs/sysfs/group.c:237 sysfs_remove_group+0x8e/0x90 [ 71.978797] sysfs group 'power' not found for kobject 'btt0.1' [ 72.004940] Modules linked in: xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ipt_REJECT nf_reject_ipv4 tun bridge stp llc ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter intel_rapl sb_edac edac_core x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm irqbypass crct10dif_pclmul crc32_pclmul vfat ghash_clmulni_intel fat pcbc aesni_intel nd_pmem crypto_simd ipmi_si glue_helper dax_pmem iTCO_wdt nd_btt iTCO_vendor_support dax cryptd ipmi_devintf hpwdt lpc_ich hpilo pcspkr i2c_i801 ioatdma sg ipmi_msghandler mfd_core wmi shpchp dca nfit acpi_power_meter nfsd auth_rpcgss nfs_acl lockd grace sunrpc ip_tables xfs sr_mod cdrom sd_mod mgag200 i2c_algo_bit drm_kms_helper syscopyarea [ 72.326921] sysfillrect sysimgblt fb_sys_fops ttm bnx2x mdio ahci tg3 drm hpsa libahci ptp nvme libata pps_core i2c_core libcrc32c crc32c_intel scsi_transport_sas nvme_core dm_mirror dm_region_hash dm_log dm_mod [ 72.411729] CPU: 39 PID: 1039 Comm: kworker/u145:2 Not tainted 4.11.0-rc1+ #25 [ 72.444221] Hardware name: HP ProLiant DL380 Gen9/ProLiant DL380 Gen9, BIOS P89 10/05/2016 [ 72.486852] Workqueue: events_unbound async_run_entry_fn [ 72.512078] Call Trace: [ 72.523223] dump_stack+0x63/0x87 [ 72.538230] __warn+0xd1/0xf0 [ 72.551520] warn_slowpath_fmt+0x5f/0x80 [ 72.569169] ? kernfs_find_and_get_ns+0x48/0x60 [ 72.589568] sysfs_remove_group+0x8e/0x90 [ 72.607495] dpm_sysfs_remove+0x57/0x60 [ 72.624715] device_del+0x111/0x350 [ 72.640292] ? pick_next_task_fair+0x122/0x540 [ 72.660482] device_unregister+0x1a/0x60 [ 72.678191] nd_async_device_unregister+0x22/0x30 [ 72.699306] async_run_entry_fn+0x39/0x170 [ 72.717642] process_one_work+0x165/0x410 [ 72.735566] worker_thread+0x137/0x4c0 [ 72.752370] kthread+0x101/0x140 [ 72.766449] ? rescuer_thread+0x3b0/0x3b0 [ 72.784357] ? kthread_park+0x90/0x90 [ 72.800810] ret_from_fork+0x2c/0x40 [ 72.816960] --[ end trace cb734114d5364186 ]-- [ 72.837661] BUG: unable to handle kernel NULL pointer dereference at 0020 [ 72.872912] IP: klist_put+0x1b/0x90 [ 72.31] PGD 0 [ 72.31] [ 72.904390] Oops: 1 SMP [ 72.918698] Modules linked in: xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ipt_REJECT nf_reject_ipv4 tun bridge stp llc ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter intel_rapl sb_edac edac_core x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm irqbypass crct10dif_pclmul crc32_pclmul vfat ghash_clmulni_intel fat pcbc aesni_intel nd_pmem crypto_simd ipmi_si glue_helper dax_pmem iTCO_wdt nd_btt iTCO_vendor_support dax cryptd ipmi_devintf hpwdt lpc_ich hpilo pcspkr i2c_i801 ioatdma sg ipmi_msghandler mfd_core wmi shpchp dca nfit acpi_power_meter nfsd auth_rpcgss nfs_acl lockd grace sunrpc ip_tables xfs sr_mod cdrom sd_mod mgag200 i2c_algo_bit drm_kms_helper syscopyarea [ 73.245742] sysfillrect sysimgblt fb_sys_fops ttm bnx2x mdio ahci tg3 drm hpsa libahci ptp nvme libata pps_core i2c_core libcrc32c crc32c_intel scsi_transport_sas nvme_core dm_mirror dm_region_hash dm_log dm_mod [ 73.331485] CPU: 39 PID: 1039 Comm: kworker/u145:2 Tainted: G W 4.11.0-rc1+ #25 [ 73.370525] Hardware name: HP ProLiant DL380 Gen9/ProLiant DL380 Gen9, BIOS P89 10/05/2016 [ 73.408108] Workqueue: events_unbound async_run_entry_fn [ 73.432199] task: 88046523 task.stack: c900061f4000 [ 73.458817] RIP: 0010:klist_put+0x1b/0x90 [ 73.479280] RSP: 0018:c900061f7d68 EFLAGS: 00010246 [ 73.507441] RAX: 88017ccd4b40 RBX: RCX: [ 73.539498] RDX: RSI: 0001 RDI: [ 73.571576] RBP: c900061f7d88 R08: fffe R09: 00047417 [ 73.603526] R10: 0005 R11: 00047416 R12: 88017ccd4b68 [ 73.635479] R13: 88017d558800 R14: 0001 R15: 880469cf44b0 [ 73.667484] FS: ()
[Kernel-packages] [Bug 1704437] Re: [Bug] Crystal Ridge - libnvdimm: debug create-namespace sysfs collision report from Jeff
** Description changed: Description If you run ndctl create-namespace simultaneously on every cpu on a 72 cpu system, for exactly the same namespace, things break. Shocking, I know. I've attached the script and the panic. Not high priority, I don't think. I tried to attach the shell script, but your email server rejected it. So, it's inline below. Search for thrash-ns.sh. Cheers, Jeff [ 71.912227] ---[ cut here ]--- [ 71.935600] WARNING: CPU: 39 PID: 1039 at fs/sysfs/group.c:237 sysfs_remove_group+0x8e/0x90 [ 71.978797] sysfs group 'power' not found for kobject 'btt0.1' [ 72.004940] Modules linked in: xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ipt_REJECT nf_reject_ipv4 tun bridge stp llc ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter intel_rapl sb_edac edac_core x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm irqbypass crct10dif_pclmul crc32_pclmul vfat ghash_clmulni_intel fat pcbc aesni_intel nd_pmem crypto_simd ipmi_si glue_helper dax_pmem iTCO_wdt nd_btt iTCO_vendor_support dax cryptd ipmi_devintf hpwdt lpc_ich hpilo pcspkr i2c_i801 ioatdma sg ipmi_msghandler mfd_core wmi shpchp dca nfit acpi_power_meter nfsd auth_rpcgss nfs_acl lockd grace sunrpc ip_tables xfs sr_mod cdrom sd_mod mgag200 i2c_algo_bit drm_kms_helper syscopyarea [ 72.326921] sysfillrect sysimgblt fb_sys_fops ttm bnx2x mdio ahci tg3 drm hpsa libahci ptp nvme libata pps_core i2c_core libcrc32c crc32c_intel scsi_transport_sas nvme_core dm_mirror dm_region_hash dm_log dm_mod [ 72.411729] CPU: 39 PID: 1039 Comm: kworker/u145:2 Not tainted 4.11.0-rc1+ #25 [ 72.444221] Hardware name: HP ProLiant DL380 Gen9/ProLiant DL380 Gen9, BIOS P89 10/05/2016 [ 72.486852] Workqueue: events_unbound async_run_entry_fn [ 72.512078] Call Trace: [ 72.523223] dump_stack+0x63/0x87 [ 72.538230] __warn+0xd1/0xf0 [ 72.551520] warn_slowpath_fmt+0x5f/0x80 [ 72.569169] ? kernfs_find_and_get_ns+0x48/0x60 [ 72.589568] sysfs_remove_group+0x8e/0x90 [ 72.607495] dpm_sysfs_remove+0x57/0x60 [ 72.624715] device_del+0x111/0x350 [ 72.640292] ? pick_next_task_fair+0x122/0x540 [ 72.660482] device_unregister+0x1a/0x60 [ 72.678191] nd_async_device_unregister+0x22/0x30 [ 72.699306] async_run_entry_fn+0x39/0x170 [ 72.717642] process_one_work+0x165/0x410 [ 72.735566] worker_thread+0x137/0x4c0 [ 72.752370] kthread+0x101/0x140 [ 72.766449] ? rescuer_thread+0x3b0/0x3b0 [ 72.784357] ? kthread_park+0x90/0x90 [ 72.800810] ret_from_fork+0x2c/0x40 [ 72.816960] --[ end trace cb734114d5364186 ]-- [ 72.837661] BUG: unable to handle kernel NULL pointer dereference at 0020 [ 72.872912] IP: klist_put+0x1b/0x90 [ 72.31] PGD 0 [ 72.31] [ 72.904390] Oops: 1 SMP [ 72.918698] Modules linked in: xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ipt_REJECT nf_reject_ipv4 tun bridge stp llc ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter intel_rapl sb_edac edac_core x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm irqbypass crct10dif_pclmul crc32_pclmul vfat ghash_clmulni_intel fat pcbc aesni_intel nd_pmem crypto_simd ipmi_si glue_helper dax_pmem iTCO_wdt nd_btt iTCO_vendor_support dax cryptd ipmi_devintf hpwdt lpc_ich hpilo pcspkr i2c_i801 ioatdma sg ipmi_msghandler mfd_core wmi shpchp dca nfit acpi_power_meter nfsd auth_rpcgss nfs_acl lockd grace sunrpc ip_tables xfs sr_mod cdrom sd_mod mgag200 i2c_algo_bit drm_kms_helper syscopyarea [ 73.245742] sysfillrect sysimgblt fb_sys_fops ttm bnx2x mdio ahci tg3 drm hpsa libahci ptp nvme libata pps_core i2c_core libcrc32c crc32c_intel scsi_transport_sas nvme_core dm_mirror dm_region_hash dm_log dm_mod [ 73.331485] CPU: 39 PID: 1039 Comm: kworker/u145:2 Tainted: G W 4.11.0-rc1+ #25 [ 73.370525] Hardware name: HP ProLiant DL380 Gen9/ProLiant DL380 Gen9, BIOS P89 10/05/2016 [ 73.408108] Workqueue: events_unbound async_run_entry_fn [ 73.432199] task: 88046523 task.stack: c900061f4000 [ 73.458817] RIP: 0010:klist_put+0x1b/0x90 [ 73.479280] RSP: 0018:c900061f7d68 EFLAGS: 00010246 [ 73.507441] RAX: 88017ccd4b40 RBX: RCX: [ 73.539498] RDX: RSI: 0001 RDI: [ 73.571576] RBP: c900061f7d88 R08: fffe R09: 00047417 [ 73.603526] R10: 0005 R11: 00047416 R12: 88017ccd4b68 [ 73.635479] R13: 88017d558800 R14: 0001 R15: 880469cf44b0 [ 73.667484] FS: ()
[Kernel-packages] [Bug 1704437] Re: [Bug] Crystal Ridge - libnvdimm: debug create-namespace sysfs collision report from Jeff
** Package changed: ubuntu => linux (Ubuntu) ** Description changed: Description If you run ndctl create-namespace simultaneously on every cpu on a 72 cpu system, for exactly the same namespace, things break. Shocking, I know. I've attached the script and the panic. Not high priority, I don't think. I tried to attach the shell script, but your email server rejected it. So, it's inline below. Search for thrash-ns.sh. Cheers, Jeff [ 71.912227] ---[ cut here ]--- [ 71.935600] WARNING: CPU: 39 PID: 1039 at fs/sysfs/group.c:237 sysfs_remove_group+0x8e/0x90 [ 71.978797] sysfs group 'power' not found for kobject 'btt0.1' [ 72.004940] Modules linked in: xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ipt_REJECT nf_reject_ipv4 tun bridge stp llc ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter intel_rapl sb_edac edac_core x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm irqbypass crct10dif_pclmul crc32_pclmul vfat ghash_clmulni_intel fat pcbc aesni_intel nd_pmem crypto_simd ipmi_si glue_helper dax_pmem iTCO_wdt nd_btt iTCO_vendor_support dax cryptd ipmi_devintf hpwdt lpc_ich hpilo pcspkr i2c_i801 ioatdma sg ipmi_msghandler mfd_core wmi shpchp dca nfit acpi_power_meter nfsd auth_rpcgss nfs_acl lockd grace sunrpc ip_tables xfs sr_mod cdrom sd_mod mgag200 i2c_algo_bit drm_kms_helper syscopyarea [ 72.326921] sysfillrect sysimgblt fb_sys_fops ttm bnx2x mdio ahci tg3 drm hpsa libahci ptp nvme libata pps_core i2c_core libcrc32c crc32c_intel scsi_transport_sas nvme_core dm_mirror dm_region_hash dm_log dm_mod [ 72.411729] CPU: 39 PID: 1039 Comm: kworker/u145:2 Not tainted 4.11.0-rc1+ #25 [ 72.444221] Hardware name: HP ProLiant DL380 Gen9/ProLiant DL380 Gen9, BIOS P89 10/05/2016 [ 72.486852] Workqueue: events_unbound async_run_entry_fn [ 72.512078] Call Trace: [ 72.523223] dump_stack+0x63/0x87 [ 72.538230] __warn+0xd1/0xf0 [ 72.551520] warn_slowpath_fmt+0x5f/0x80 [ 72.569169] ? kernfs_find_and_get_ns+0x48/0x60 [ 72.589568] sysfs_remove_group+0x8e/0x90 [ 72.607495] dpm_sysfs_remove+0x57/0x60 [ 72.624715] device_del+0x111/0x350 [ 72.640292] ? pick_next_task_fair+0x122/0x540 [ 72.660482] device_unregister+0x1a/0x60 [ 72.678191] nd_async_device_unregister+0x22/0x30 [ 72.699306] async_run_entry_fn+0x39/0x170 [ 72.717642] process_one_work+0x165/0x410 [ 72.735566] worker_thread+0x137/0x4c0 [ 72.752370] kthread+0x101/0x140 [ 72.766449] ? rescuer_thread+0x3b0/0x3b0 [ 72.784357] ? kthread_park+0x90/0x90 [ 72.800810] ret_from_fork+0x2c/0x40 [ 72.816960] --[ end trace cb734114d5364186 ]-- [ 72.837661] BUG: unable to handle kernel NULL pointer dereference at 0020 [ 72.872912] IP: klist_put+0x1b/0x90 [ 72.31] PGD 0 [ 72.31] [ 72.904390] Oops: 1 SMP [ 72.918698] Modules linked in: xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ipt_REJECT nf_reject_ipv4 tun bridge stp llc ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter intel_rapl sb_edac edac_core x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm irqbypass crct10dif_pclmul crc32_pclmul vfat ghash_clmulni_intel fat pcbc aesni_intel nd_pmem crypto_simd ipmi_si glue_helper dax_pmem iTCO_wdt nd_btt iTCO_vendor_support dax cryptd ipmi_devintf hpwdt lpc_ich hpilo pcspkr i2c_i801 ioatdma sg ipmi_msghandler mfd_core wmi shpchp dca nfit acpi_power_meter nfsd auth_rpcgss nfs_acl lockd grace sunrpc ip_tables xfs sr_mod cdrom sd_mod mgag200 i2c_algo_bit drm_kms_helper syscopyarea [ 73.245742] sysfillrect sysimgblt fb_sys_fops ttm bnx2x mdio ahci tg3 drm hpsa libahci ptp nvme libata pps_core i2c_core libcrc32c crc32c_intel scsi_transport_sas nvme_core dm_mirror dm_region_hash dm_log dm_mod [ 73.331485] CPU: 39 PID: 1039 Comm: kworker/u145:2 Tainted: G W 4.11.0-rc1+ #25 [ 73.370525] Hardware name: HP ProLiant DL380 Gen9/ProLiant DL380 Gen9, BIOS P89 10/05/2016 [ 73.408108] Workqueue: events_unbound async_run_entry_fn [ 73.432199] task: 88046523 task.stack: c900061f4000 [ 73.458817] RIP: 0010:klist_put+0x1b/0x90 [ 73.479280] RSP: 0018:c900061f7d68 EFLAGS: 00010246 [ 73.507441] RAX: 88017ccd4b40 RBX: RCX: [ 73.539498] RDX: RSI: 0001 RDI: [ 73.571576] RBP: c900061f7d88 R08: fffe R09: 00047417 [ 73.603526] R10: 0005 R11: 00047416 R12: 88017ccd4b68 [ 73.635479] R13: 88017d558800 R14: 0001 R15: 880469cf44b0 [ 73.667484]