[Kernel-packages] [Bug 1811470] Re: [witherspoon] removing module nouveau causes cpu hard lockup
** Tags removed: targetmilestone-inin18043 ** Tags added: targetmilestone-inin--- -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1811470 Title: [witherspoon] removing module nouveau causes cpu hard lockup Status in The Ubuntu-power-systems project: Won't Fix Status in linux package in Ubuntu: Won't Fix Bug description: Installed 18.04 and upgraded kernel to linux-image-generic-hwe-18.04 (4.18.0-13-generic #14~18.04.1-Ubuntu SMP Thu Dec 6 14:03:47). Copied gv100 firmware to /lib/firmware/nvidia, removed and reloaded nouveau (modprobe -r and modprobe). Tried to remove nouveau again using modprobe -r and I see the trace below. After a while the modprobe -r command completed the module was removed successfully. [ 618.185258] nouveau 0035:04:00.0: DRM: failed to idle channel 1 [DRM] [ 630.314599] watchdog: CPU 4 self-detected hard LOCKUP @ ioread32+0x2c/0x170 [ 630.314601] watchdog: CPU 4 TB:415266697100, last heartbeat TB:410146341428 (1ms ago) [ 630.314601] Modules linked in: nouveau(-) ofpart at24 cmdlinepart uio_pdrv_genirq ipmi_powernv ipmi_devintf powernv_flash uio mtd opal_prd ipmi_msghandler ibmpowernv vmx_crypto sch_fq_codel ib_iser rdma_cm iw_cm ib_cm ib_core iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi ip_tables x_tables autofs4 btrfs zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear ast i2c_algo_bit ttm drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops drm crct10dif_vpmsum ahci crc32c_vpmsum tg3 libahci drm_panel_orientation_quirks [last unloaded: nouveau] [ 630.314629] CPU: 4 PID: 6623 Comm: modprobe Not tainted 4.18.0-13-generic #14~18.04.1-Ubuntu [ 630.314630] NIP: c0729afc LR: c0080f766990 CTR: c0729ad0 [ 630.314630] REGS: c03fffd87d80 TRAP: 0900 Not tainted (4.18.0-13-generic) [ 630.314631] MSR: 9280b033 CR: 44002824 XER: [ 630.314638] CFAR: c0080f7b2ac4 IRQMASK: 1 [ 630.314639] GPR00: c0080f7ad3f8 c03f6c1bb850 c178c200 c00c8e7e [ 630.314642] GPR04: 0008 [ 630.314644] GPR08: c03fa800 7fff c00c8e7e c0080f7b2ab0 [ 630.314647] GPR12: c0729ad0 c03fa800 0001 [ 630.314649] GPR16: 039a7cc60074 039a7cc3c978 [ 630.314652] GPR20: 039a7cc60070 78448f88 [ 630.314654] GPR24: 039a8bf40ee8 c0080f82dca0 c03fed072098 [ 630.314657] GPR28: c000203968a2c290 c000203968840908 c000203968840900 0001 [ 630.314660] NIP [c0729afc] ioread32+0x2c/0x170 [ 630.314660] LR [c0080f766990] nouveau_bo_rd32+0x48/0x70 [nouveau] [ 630.314661] Call Trace: [ 630.314662] [c03f6c1bb850] [c03f6c1bb890] 0xc03f6c1bb890 (unreliable) [ 630.314663] [c03f6c1bb880] [c03f6c1bb8b0] 0xc03f6c1bb8b0 [ 630.314664] [c03f6c1bb8a0] [c0080f7ad3f8] nv84_fence_read+0x40/0x60 [nouveau] [ 630.314666] [c03f6c1bb8c0] [c0080f7aab3c] nouveau_fence_update+0x44/0x100 [nouveau] [ 630.314667] [c03f6c1bb900] [c0080f7ab5d8] nouveau_fence_done+0x100/0x180 [nouveau] [ 630.314668] [c03f6c1bb940] [c0080f7ab8c8] nouveau_fence_wait+0x90/0x150 [nouveau] [ 630.314669] [c03f6c1bb970] [c0080f7a8f90] nouveau_channel_idle+0xd8/0x140 [nouveau] [ 630.314670] [c03f6c1bba00] [c0080f75f75c] nouveau_accel_fini+0x74/0xe0 [nouveau] [ 630.314671] [c03f6c1bba30] [c0080f75f8e8] nouveau_drm_unload+0x60/0x130 [nouveau] [ 630.314672] [c03f6c1bba60] [c00813b1b118] drm_dev_unregister+0x70/0x160 [drm] [ 630.314673] [c03f6c1bbaa0] [c00813b1b3f0] drm_put_dev+0x48/0xa0 [drm] [ 630.314675] [c03f6c1bbb10] [c0080f760f8c] nouveau_drm_device_remove+0x54/0x90 [nouveau] [ 630.314676] [c03f6c1bbb50] [c07a746c] pci_device_remove+0x6c/0x120 [ 630.314677] [c03f6c1bbb90] [c08a7014] device_release_driver_internal+0x294/0x380 [ 630.314678] [c03f6c1bbbe0] [c08a719c] driver_detach+0x7c/0x140 [ 630.314679] [c03f6c1bbc20] [c08a5304] bus_remove_driver+0x84/0x170 [ 630.314680] [c03f6c1bbc90] [c08a7ef8] driver_unregister+0x48/0x90 [ 630.314681] [c03f6c1bbd00] [c07a52b8] pci_unregister_driver+0x38/0x150 [ 630.314682] [c03f6c1bbd50] [c0080f7af048] nouveau_drm_exit+0x30/0xfc08 [nouveau] [ 630.314683] [c03f6c1bbd70] [c01e6b14] sys_delete_module+0x1d4/0x310 [ 630.314684] [c03f6c1bbe30] [c000b288] system_call+0x5c/0x70 [ 630.314685] Instruction dump: [ 630.314686]
[Kernel-packages] [Bug 1811470] Re: [witherspoon] removing module nouveau causes cpu hard lockup
** Changed in: linux (Ubuntu) Status: Incomplete => Won't Fix ** Changed in: ubuntu-power-systems Status: Triaged => Won't Fix -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1811470 Title: [witherspoon] removing module nouveau causes cpu hard lockup Status in The Ubuntu-power-systems project: Won't Fix Status in linux package in Ubuntu: Won't Fix Bug description: Installed 18.04 and upgraded kernel to linux-image-generic-hwe-18.04 (4.18.0-13-generic #14~18.04.1-Ubuntu SMP Thu Dec 6 14:03:47). Copied gv100 firmware to /lib/firmware/nvidia, removed and reloaded nouveau (modprobe -r and modprobe). Tried to remove nouveau again using modprobe -r and I see the trace below. After a while the modprobe -r command completed the module was removed successfully. [ 618.185258] nouveau 0035:04:00.0: DRM: failed to idle channel 1 [DRM] [ 630.314599] watchdog: CPU 4 self-detected hard LOCKUP @ ioread32+0x2c/0x170 [ 630.314601] watchdog: CPU 4 TB:415266697100, last heartbeat TB:410146341428 (1ms ago) [ 630.314601] Modules linked in: nouveau(-) ofpart at24 cmdlinepart uio_pdrv_genirq ipmi_powernv ipmi_devintf powernv_flash uio mtd opal_prd ipmi_msghandler ibmpowernv vmx_crypto sch_fq_codel ib_iser rdma_cm iw_cm ib_cm ib_core iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi ip_tables x_tables autofs4 btrfs zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear ast i2c_algo_bit ttm drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops drm crct10dif_vpmsum ahci crc32c_vpmsum tg3 libahci drm_panel_orientation_quirks [last unloaded: nouveau] [ 630.314629] CPU: 4 PID: 6623 Comm: modprobe Not tainted 4.18.0-13-generic #14~18.04.1-Ubuntu [ 630.314630] NIP: c0729afc LR: c0080f766990 CTR: c0729ad0 [ 630.314630] REGS: c03fffd87d80 TRAP: 0900 Not tainted (4.18.0-13-generic) [ 630.314631] MSR: 9280b033 CR: 44002824 XER: [ 630.314638] CFAR: c0080f7b2ac4 IRQMASK: 1 [ 630.314639] GPR00: c0080f7ad3f8 c03f6c1bb850 c178c200 c00c8e7e [ 630.314642] GPR04: 0008 [ 630.314644] GPR08: c03fa800 7fff c00c8e7e c0080f7b2ab0 [ 630.314647] GPR12: c0729ad0 c03fa800 0001 [ 630.314649] GPR16: 039a7cc60074 039a7cc3c978 [ 630.314652] GPR20: 039a7cc60070 78448f88 [ 630.314654] GPR24: 039a8bf40ee8 c0080f82dca0 c03fed072098 [ 630.314657] GPR28: c000203968a2c290 c000203968840908 c000203968840900 0001 [ 630.314660] NIP [c0729afc] ioread32+0x2c/0x170 [ 630.314660] LR [c0080f766990] nouveau_bo_rd32+0x48/0x70 [nouveau] [ 630.314661] Call Trace: [ 630.314662] [c03f6c1bb850] [c03f6c1bb890] 0xc03f6c1bb890 (unreliable) [ 630.314663] [c03f6c1bb880] [c03f6c1bb8b0] 0xc03f6c1bb8b0 [ 630.314664] [c03f6c1bb8a0] [c0080f7ad3f8] nv84_fence_read+0x40/0x60 [nouveau] [ 630.314666] [c03f6c1bb8c0] [c0080f7aab3c] nouveau_fence_update+0x44/0x100 [nouveau] [ 630.314667] [c03f6c1bb900] [c0080f7ab5d8] nouveau_fence_done+0x100/0x180 [nouveau] [ 630.314668] [c03f6c1bb940] [c0080f7ab8c8] nouveau_fence_wait+0x90/0x150 [nouveau] [ 630.314669] [c03f6c1bb970] [c0080f7a8f90] nouveau_channel_idle+0xd8/0x140 [nouveau] [ 630.314670] [c03f6c1bba00] [c0080f75f75c] nouveau_accel_fini+0x74/0xe0 [nouveau] [ 630.314671] [c03f6c1bba30] [c0080f75f8e8] nouveau_drm_unload+0x60/0x130 [nouveau] [ 630.314672] [c03f6c1bba60] [c00813b1b118] drm_dev_unregister+0x70/0x160 [drm] [ 630.314673] [c03f6c1bbaa0] [c00813b1b3f0] drm_put_dev+0x48/0xa0 [drm] [ 630.314675] [c03f6c1bbb10] [c0080f760f8c] nouveau_drm_device_remove+0x54/0x90 [nouveau] [ 630.314676] [c03f6c1bbb50] [c07a746c] pci_device_remove+0x6c/0x120 [ 630.314677] [c03f6c1bbb90] [c08a7014] device_release_driver_internal+0x294/0x380 [ 630.314678] [c03f6c1bbbe0] [c08a719c] driver_detach+0x7c/0x140 [ 630.314679] [c03f6c1bbc20] [c08a5304] bus_remove_driver+0x84/0x170 [ 630.314680] [c03f6c1bbc90] [c08a7ef8] driver_unregister+0x48/0x90 [ 630.314681] [c03f6c1bbd00] [c07a52b8] pci_unregister_driver+0x38/0x150 [ 630.314682] [c03f6c1bbd50] [c0080f7af048] nouveau_drm_exit+0x30/0xfc08 [nouveau] [ 630.314683] [c03f6c1bbd70] [c01e6b14] sys_delete_module+0x1d4/0x310 [ 630.314684] [c03f6c1bbe30] [c000b288]
[Kernel-packages] [Bug 1811470] Re: [witherspoon] removing module nouveau causes cpu hard lockup
** Also affects: linux (Ubuntu) Importance: Undecided Status: New -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1811470 Title: [witherspoon] removing module nouveau causes cpu hard lockup Status in The Ubuntu-power-systems project: Triaged Status in linux package in Ubuntu: Incomplete Bug description: Installed 18.04 and upgraded kernel to linux-image-generic-hwe-18.04 (4.18.0-13-generic #14~18.04.1-Ubuntu SMP Thu Dec 6 14:03:47). Copied gv100 firmware to /lib/firmware/nvidia, removed and reloaded nouveau (modprobe -r and modprobe). Tried to remove nouveau again using modprobe -r and I see the trace below. After a while the modprobe -r command completed the module was removed successfully. [ 618.185258] nouveau 0035:04:00.0: DRM: failed to idle channel 1 [DRM] [ 630.314599] watchdog: CPU 4 self-detected hard LOCKUP @ ioread32+0x2c/0x170 [ 630.314601] watchdog: CPU 4 TB:415266697100, last heartbeat TB:410146341428 (1ms ago) [ 630.314601] Modules linked in: nouveau(-) ofpart at24 cmdlinepart uio_pdrv_genirq ipmi_powernv ipmi_devintf powernv_flash uio mtd opal_prd ipmi_msghandler ibmpowernv vmx_crypto sch_fq_codel ib_iser rdma_cm iw_cm ib_cm ib_core iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi ip_tables x_tables autofs4 btrfs zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear ast i2c_algo_bit ttm drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops drm crct10dif_vpmsum ahci crc32c_vpmsum tg3 libahci drm_panel_orientation_quirks [last unloaded: nouveau] [ 630.314629] CPU: 4 PID: 6623 Comm: modprobe Not tainted 4.18.0-13-generic #14~18.04.1-Ubuntu [ 630.314630] NIP: c0729afc LR: c0080f766990 CTR: c0729ad0 [ 630.314630] REGS: c03fffd87d80 TRAP: 0900 Not tainted (4.18.0-13-generic) [ 630.314631] MSR: 9280b033 CR: 44002824 XER: [ 630.314638] CFAR: c0080f7b2ac4 IRQMASK: 1 [ 630.314639] GPR00: c0080f7ad3f8 c03f6c1bb850 c178c200 c00c8e7e [ 630.314642] GPR04: 0008 [ 630.314644] GPR08: c03fa800 7fff c00c8e7e c0080f7b2ab0 [ 630.314647] GPR12: c0729ad0 c03fa800 0001 [ 630.314649] GPR16: 039a7cc60074 039a7cc3c978 [ 630.314652] GPR20: 039a7cc60070 78448f88 [ 630.314654] GPR24: 039a8bf40ee8 c0080f82dca0 c03fed072098 [ 630.314657] GPR28: c000203968a2c290 c000203968840908 c000203968840900 0001 [ 630.314660] NIP [c0729afc] ioread32+0x2c/0x170 [ 630.314660] LR [c0080f766990] nouveau_bo_rd32+0x48/0x70 [nouveau] [ 630.314661] Call Trace: [ 630.314662] [c03f6c1bb850] [c03f6c1bb890] 0xc03f6c1bb890 (unreliable) [ 630.314663] [c03f6c1bb880] [c03f6c1bb8b0] 0xc03f6c1bb8b0 [ 630.314664] [c03f6c1bb8a0] [c0080f7ad3f8] nv84_fence_read+0x40/0x60 [nouveau] [ 630.314666] [c03f6c1bb8c0] [c0080f7aab3c] nouveau_fence_update+0x44/0x100 [nouveau] [ 630.314667] [c03f6c1bb900] [c0080f7ab5d8] nouveau_fence_done+0x100/0x180 [nouveau] [ 630.314668] [c03f6c1bb940] [c0080f7ab8c8] nouveau_fence_wait+0x90/0x150 [nouveau] [ 630.314669] [c03f6c1bb970] [c0080f7a8f90] nouveau_channel_idle+0xd8/0x140 [nouveau] [ 630.314670] [c03f6c1bba00] [c0080f75f75c] nouveau_accel_fini+0x74/0xe0 [nouveau] [ 630.314671] [c03f6c1bba30] [c0080f75f8e8] nouveau_drm_unload+0x60/0x130 [nouveau] [ 630.314672] [c03f6c1bba60] [c00813b1b118] drm_dev_unregister+0x70/0x160 [drm] [ 630.314673] [c03f6c1bbaa0] [c00813b1b3f0] drm_put_dev+0x48/0xa0 [drm] [ 630.314675] [c03f6c1bbb10] [c0080f760f8c] nouveau_drm_device_remove+0x54/0x90 [nouveau] [ 630.314676] [c03f6c1bbb50] [c07a746c] pci_device_remove+0x6c/0x120 [ 630.314677] [c03f6c1bbb90] [c08a7014] device_release_driver_internal+0x294/0x380 [ 630.314678] [c03f6c1bbbe0] [c08a719c] driver_detach+0x7c/0x140 [ 630.314679] [c03f6c1bbc20] [c08a5304] bus_remove_driver+0x84/0x170 [ 630.314680] [c03f6c1bbc90] [c08a7ef8] driver_unregister+0x48/0x90 [ 630.314681] [c03f6c1bbd00] [c07a52b8] pci_unregister_driver+0x38/0x150 [ 630.314682] [c03f6c1bbd50] [c0080f7af048] nouveau_drm_exit+0x30/0xfc08 [nouveau] [ 630.314683] [c03f6c1bbd70] [c01e6b14] sys_delete_module+0x1d4/0x310 [ 630.314684] [c03f6c1bbe30] [c000b288] system_call+0x5c/0x70 [ 630.314685] Instruction dump: [ 630.314686]
[Kernel-packages] [Bug 1811470] Re: [witherspoon] removing module nouveau causes cpu hard lockup
** Also affects: linux Importance: Undecided Status: New ** Changed in: linux (Ubuntu) Importance: Undecided => Medium ** Changed in: linux (Ubuntu) Assignee: (unassigned) => Canonical Kernel Team (canonical-kernel-team) ** No longer affects: linux -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1811470 Title: [witherspoon] removing module nouveau causes cpu hard lockup Status in The Ubuntu-power-systems project: Triaged Status in linux package in Ubuntu: Incomplete Bug description: Installed 18.04 and upgraded kernel to linux-image-generic-hwe-18.04 (4.18.0-13-generic #14~18.04.1-Ubuntu SMP Thu Dec 6 14:03:47). Copied gv100 firmware to /lib/firmware/nvidia, removed and reloaded nouveau (modprobe -r and modprobe). Tried to remove nouveau again using modprobe -r and I see the trace below. After a while the modprobe -r command completed the module was removed successfully. [ 618.185258] nouveau 0035:04:00.0: DRM: failed to idle channel 1 [DRM] [ 630.314599] watchdog: CPU 4 self-detected hard LOCKUP @ ioread32+0x2c/0x170 [ 630.314601] watchdog: CPU 4 TB:415266697100, last heartbeat TB:410146341428 (1ms ago) [ 630.314601] Modules linked in: nouveau(-) ofpart at24 cmdlinepart uio_pdrv_genirq ipmi_powernv ipmi_devintf powernv_flash uio mtd opal_prd ipmi_msghandler ibmpowernv vmx_crypto sch_fq_codel ib_iser rdma_cm iw_cm ib_cm ib_core iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi ip_tables x_tables autofs4 btrfs zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear ast i2c_algo_bit ttm drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops drm crct10dif_vpmsum ahci crc32c_vpmsum tg3 libahci drm_panel_orientation_quirks [last unloaded: nouveau] [ 630.314629] CPU: 4 PID: 6623 Comm: modprobe Not tainted 4.18.0-13-generic #14~18.04.1-Ubuntu [ 630.314630] NIP: c0729afc LR: c0080f766990 CTR: c0729ad0 [ 630.314630] REGS: c03fffd87d80 TRAP: 0900 Not tainted (4.18.0-13-generic) [ 630.314631] MSR: 9280b033 CR: 44002824 XER: [ 630.314638] CFAR: c0080f7b2ac4 IRQMASK: 1 [ 630.314639] GPR00: c0080f7ad3f8 c03f6c1bb850 c178c200 c00c8e7e [ 630.314642] GPR04: 0008 [ 630.314644] GPR08: c03fa800 7fff c00c8e7e c0080f7b2ab0 [ 630.314647] GPR12: c0729ad0 c03fa800 0001 [ 630.314649] GPR16: 039a7cc60074 039a7cc3c978 [ 630.314652] GPR20: 039a7cc60070 78448f88 [ 630.314654] GPR24: 039a8bf40ee8 c0080f82dca0 c03fed072098 [ 630.314657] GPR28: c000203968a2c290 c000203968840908 c000203968840900 0001 [ 630.314660] NIP [c0729afc] ioread32+0x2c/0x170 [ 630.314660] LR [c0080f766990] nouveau_bo_rd32+0x48/0x70 [nouveau] [ 630.314661] Call Trace: [ 630.314662] [c03f6c1bb850] [c03f6c1bb890] 0xc03f6c1bb890 (unreliable) [ 630.314663] [c03f6c1bb880] [c03f6c1bb8b0] 0xc03f6c1bb8b0 [ 630.314664] [c03f6c1bb8a0] [c0080f7ad3f8] nv84_fence_read+0x40/0x60 [nouveau] [ 630.314666] [c03f6c1bb8c0] [c0080f7aab3c] nouveau_fence_update+0x44/0x100 [nouveau] [ 630.314667] [c03f6c1bb900] [c0080f7ab5d8] nouveau_fence_done+0x100/0x180 [nouveau] [ 630.314668] [c03f6c1bb940] [c0080f7ab8c8] nouveau_fence_wait+0x90/0x150 [nouveau] [ 630.314669] [c03f6c1bb970] [c0080f7a8f90] nouveau_channel_idle+0xd8/0x140 [nouveau] [ 630.314670] [c03f6c1bba00] [c0080f75f75c] nouveau_accel_fini+0x74/0xe0 [nouveau] [ 630.314671] [c03f6c1bba30] [c0080f75f8e8] nouveau_drm_unload+0x60/0x130 [nouveau] [ 630.314672] [c03f6c1bba60] [c00813b1b118] drm_dev_unregister+0x70/0x160 [drm] [ 630.314673] [c03f6c1bbaa0] [c00813b1b3f0] drm_put_dev+0x48/0xa0 [drm] [ 630.314675] [c03f6c1bbb10] [c0080f760f8c] nouveau_drm_device_remove+0x54/0x90 [nouveau] [ 630.314676] [c03f6c1bbb50] [c07a746c] pci_device_remove+0x6c/0x120 [ 630.314677] [c03f6c1bbb90] [c08a7014] device_release_driver_internal+0x294/0x380 [ 630.314678] [c03f6c1bbbe0] [c08a719c] driver_detach+0x7c/0x140 [ 630.314679] [c03f6c1bbc20] [c08a5304] bus_remove_driver+0x84/0x170 [ 630.314680] [c03f6c1bbc90] [c08a7ef8] driver_unregister+0x48/0x90 [ 630.314681] [c03f6c1bbd00] [c07a52b8] pci_unregister_driver+0x38/0x150 [ 630.314682] [c03f6c1bbd50] [c0080f7af048] nouveau_drm_exit+0x30/0xfc08 [nouveau] [ 630.314683]
[Kernel-packages] [Bug 1811470] Re: [witherspoon] removing module nouveau causes cpu hard lockup
** Changed in: ubuntu-power-systems Assignee: bugproxy (bugproxy) => Canonical Kernel Team (canonical-kernel-team) ** Changed in: ubuntu-power-systems Status: New => Triaged -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1811470 Title: [witherspoon] removing module nouveau causes cpu hard lockup Status in The Ubuntu-power-systems project: Triaged Status in linux package in Ubuntu: Incomplete Bug description: Installed 18.04 and upgraded kernel to linux-image-generic-hwe-18.04 (4.18.0-13-generic #14~18.04.1-Ubuntu SMP Thu Dec 6 14:03:47). Copied gv100 firmware to /lib/firmware/nvidia, removed and reloaded nouveau (modprobe -r and modprobe). Tried to remove nouveau again using modprobe -r and I see the trace below. After a while the modprobe -r command completed the module was removed successfully. [ 618.185258] nouveau 0035:04:00.0: DRM: failed to idle channel 1 [DRM] [ 630.314599] watchdog: CPU 4 self-detected hard LOCKUP @ ioread32+0x2c/0x170 [ 630.314601] watchdog: CPU 4 TB:415266697100, last heartbeat TB:410146341428 (1ms ago) [ 630.314601] Modules linked in: nouveau(-) ofpart at24 cmdlinepart uio_pdrv_genirq ipmi_powernv ipmi_devintf powernv_flash uio mtd opal_prd ipmi_msghandler ibmpowernv vmx_crypto sch_fq_codel ib_iser rdma_cm iw_cm ib_cm ib_core iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi ip_tables x_tables autofs4 btrfs zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear ast i2c_algo_bit ttm drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops drm crct10dif_vpmsum ahci crc32c_vpmsum tg3 libahci drm_panel_orientation_quirks [last unloaded: nouveau] [ 630.314629] CPU: 4 PID: 6623 Comm: modprobe Not tainted 4.18.0-13-generic #14~18.04.1-Ubuntu [ 630.314630] NIP: c0729afc LR: c0080f766990 CTR: c0729ad0 [ 630.314630] REGS: c03fffd87d80 TRAP: 0900 Not tainted (4.18.0-13-generic) [ 630.314631] MSR: 9280b033 CR: 44002824 XER: [ 630.314638] CFAR: c0080f7b2ac4 IRQMASK: 1 [ 630.314639] GPR00: c0080f7ad3f8 c03f6c1bb850 c178c200 c00c8e7e [ 630.314642] GPR04: 0008 [ 630.314644] GPR08: c03fa800 7fff c00c8e7e c0080f7b2ab0 [ 630.314647] GPR12: c0729ad0 c03fa800 0001 [ 630.314649] GPR16: 039a7cc60074 039a7cc3c978 [ 630.314652] GPR20: 039a7cc60070 78448f88 [ 630.314654] GPR24: 039a8bf40ee8 c0080f82dca0 c03fed072098 [ 630.314657] GPR28: c000203968a2c290 c000203968840908 c000203968840900 0001 [ 630.314660] NIP [c0729afc] ioread32+0x2c/0x170 [ 630.314660] LR [c0080f766990] nouveau_bo_rd32+0x48/0x70 [nouveau] [ 630.314661] Call Trace: [ 630.314662] [c03f6c1bb850] [c03f6c1bb890] 0xc03f6c1bb890 (unreliable) [ 630.314663] [c03f6c1bb880] [c03f6c1bb8b0] 0xc03f6c1bb8b0 [ 630.314664] [c03f6c1bb8a0] [c0080f7ad3f8] nv84_fence_read+0x40/0x60 [nouveau] [ 630.314666] [c03f6c1bb8c0] [c0080f7aab3c] nouveau_fence_update+0x44/0x100 [nouveau] [ 630.314667] [c03f6c1bb900] [c0080f7ab5d8] nouveau_fence_done+0x100/0x180 [nouveau] [ 630.314668] [c03f6c1bb940] [c0080f7ab8c8] nouveau_fence_wait+0x90/0x150 [nouveau] [ 630.314669] [c03f6c1bb970] [c0080f7a8f90] nouveau_channel_idle+0xd8/0x140 [nouveau] [ 630.314670] [c03f6c1bba00] [c0080f75f75c] nouveau_accel_fini+0x74/0xe0 [nouveau] [ 630.314671] [c03f6c1bba30] [c0080f75f8e8] nouveau_drm_unload+0x60/0x130 [nouveau] [ 630.314672] [c03f6c1bba60] [c00813b1b118] drm_dev_unregister+0x70/0x160 [drm] [ 630.314673] [c03f6c1bbaa0] [c00813b1b3f0] drm_put_dev+0x48/0xa0 [drm] [ 630.314675] [c03f6c1bbb10] [c0080f760f8c] nouveau_drm_device_remove+0x54/0x90 [nouveau] [ 630.314676] [c03f6c1bbb50] [c07a746c] pci_device_remove+0x6c/0x120 [ 630.314677] [c03f6c1bbb90] [c08a7014] device_release_driver_internal+0x294/0x380 [ 630.314678] [c03f6c1bbbe0] [c08a719c] driver_detach+0x7c/0x140 [ 630.314679] [c03f6c1bbc20] [c08a5304] bus_remove_driver+0x84/0x170 [ 630.314680] [c03f6c1bbc90] [c08a7ef8] driver_unregister+0x48/0x90 [ 630.314681] [c03f6c1bbd00] [c07a52b8] pci_unregister_driver+0x38/0x150 [ 630.314682] [c03f6c1bbd50] [c0080f7af048] nouveau_drm_exit+0x30/0xfc08 [nouveau] [ 630.314683] [c03f6c1bbd70] [c01e6b14] sys_delete_module+0x1d4/0x310 [ 630.314684]