[Kernel-packages] [Bug 1828131] Re: Qemu causes system hang
Please check whether this patch fixes the issue: [1] [1] https://lore.kernel.org/lkml/1558711908-15688-1-git-send-email- suzuki.poul...@arm.com/ -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to nvidia-graphics-drivers-418 in Ubuntu. https://bugs.launchpad.net/bugs/1828131 Title: Qemu causes system hang Status in nvidia-graphics-drivers-418 package in Ubuntu: Confirmed Status in qemu package in Ubuntu: Incomplete Bug description: I'm trying to use https://virt-manager.org/ / QEMU. I migrated over a Ubuntu Guest from Virtualbox, added it to virt-manager, and launch. It opens the window, then the system freezes. I don't know if it's a bug in QEMU, libvirt, or virt-manager, or nvidia. Nvidia is working fine otherwise, Version: 430.09. This is easily reproduced every time I try starting a qemu machine, so I can run any diagnostics while this happens if that helps. This is from journalctl at the time of the freeze: May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): The NVIDIA X driver has encountered an error; attempting to May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): recover... May 07 19:35:52 ap kernel: NVRM: GPU at PCI::01:00: GPU-d9fbb72e-29cb-d4db-ad8f-af242c1a6c15 May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics Exception: Class 0x0 Subchannel 0x0 Mismatch May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics Exception: ESR 0x4041b0=0x20 May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics Exception: ESR 0x404000=0x8002 May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics Exception: ChID 0008, Class 902d, Offset 0860, Data May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 32, Channel ID 0008 intr 0200 May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 31, Ch 0009, intr 5000. MMU Fault: ENGINE CE0 HUBCLIENT_CE0 faulted @ 0x1_005a. Fault is of type FAULT_PTE ACCESS_TYPE_READ May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (II) NVIDIA(0): Error recovery was successful. May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): The NVIDIA X driver has encountered an error; attempting to May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): recover... May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 32, Channel ID 000b intr 0200 May 07 19:36:03 ap kernel: Asynchronous wait on fence NVIDIA:nvidia.prime:10ad97 timed out (hint:intel_atomic_commit_ready+0x0/0x50 [i915]) May 07 19:36:21 ap libvirtd[1863]: internal error: connection closed due to keepalive timeout May 07 19:36:41 ap kernel: BUG: unable to handle kernel paging request at f9a5c3ff8030 May 07 19:36:41 ap kernel: #PF error: [normal kernel read fault] May 07 19:36:41 ap kernel: PGD 85c9cc067 P4D 85c9cc067 PUD 85c9cb067 PMD 0 May 07 19:36:41 ap kernel: Oops: [#1] SMP PTI May 07 19:36:41 ap kernel: CPU: 7 PID: 31632 Comm: worker Tainted: P U W OE 5.1.0-050100-generic #201905052130 May 07 19:36:41 ap kernel: Hardware name: Dell Inc. Precision 5530/0FP2W2, BIOS 1.10.1 04/26/2019 May 07 19:36:41 ap kernel: RIP: 0010:compaction_alloc+0x589/0x890 May 07 19:36:41 ap kernel: Code: 7d b0 41 83 e6 1f 41 83 c6 01 4d 39 fc 73 7b e9 57 01 00 00 4d 89 e2 49 c1 e2 06 4c 03 15 57 b9 18 01 4d 89 d7 4d 85 ff 74 44 <41> 8b 47 30 25 80 00 00 f0 3d 00 00 00 f0 0 May 07 19:36:41 ap kernel: RSP: 0018:b352435b34a0 EFLAGS: 00010286 May 07 19:36:41 ap kernel: RAX: a03dfc7d5d00 RBX: b352435b36a0 RCX: 003d May 07 19:36:41 ap kernel: RDX: 800ffe00 RSI: RDI: a03dfc7cd1e0 May 07 19:36:41 ap kernel: RBP: b352435b3538 R08: R09: a03dfc7d5d00 May 07 19:36:41 ap kernel: R10: f9a5c3ff8000 R11: b352435b3719 R12: 800ffe00 May 07 19:36:41 ap kernel: R13: 8010 R14: 0020 R15: f9a5c3ff8000 May 07 19:36:41 ap kernel: FS: 7f28c17fa700() GS:a03ddc3c() knlGS: May 07 19:36:41 ap kernel: CS: 0010 DS: ES: CR0: 80050033 May 07 19:36:41 ap kernel: CR2: f9a5c3ff8030 CR3: 00069e550003 CR4: 003626e0 May 07 19:36:41 ap kernel: DR0: DR1: DR2: May 07 19:36:41 ap kernel: DR3: DR6: fffe0ff0 DR7: 0400 May 07 19:36:41 ap kernel: Call Trace: May 07 19:36:41 ap kernel: migrate_pages+0x107/0xb40 May 07 19:36:41 ap kernel: ? move_freelist_tail+0xd0/0xd0 May 07 19:36:41 ap kernel: ? isolate_freepages_block+0x370/0x370 May 07 19:36:41 ap kernel: compact_zone+0x752/0xd70 May 07 19:36:41 ap kernel: compact_zone_order+0xd8/0x120 May 07 19:36:41 ap kernel: try_to_compact_pages+0xb0/0x260 May 07 19:36:41 ap
[Kernel-packages] [Bug 1828131] Re: Qemu causes system hang
I don't think this is nvidia-related. I have the same on Arch with Intel card only. -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to nvidia-graphics-drivers-418 in Ubuntu. https://bugs.launchpad.net/bugs/1828131 Title: Qemu causes system hang Status in nvidia-graphics-drivers-418 package in Ubuntu: Confirmed Status in qemu package in Ubuntu: Incomplete Bug description: I'm trying to use https://virt-manager.org/ / QEMU. I migrated over a Ubuntu Guest from Virtualbox, added it to virt-manager, and launch. It opens the window, then the system freezes. I don't know if it's a bug in QEMU, libvirt, or virt-manager, or nvidia. Nvidia is working fine otherwise, Version: 430.09. This is easily reproduced every time I try starting a qemu machine, so I can run any diagnostics while this happens if that helps. This is from journalctl at the time of the freeze: May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): The NVIDIA X driver has encountered an error; attempting to May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): recover... May 07 19:35:52 ap kernel: NVRM: GPU at PCI::01:00: GPU-d9fbb72e-29cb-d4db-ad8f-af242c1a6c15 May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics Exception: Class 0x0 Subchannel 0x0 Mismatch May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics Exception: ESR 0x4041b0=0x20 May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics Exception: ESR 0x404000=0x8002 May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics Exception: ChID 0008, Class 902d, Offset 0860, Data May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 32, Channel ID 0008 intr 0200 May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 31, Ch 0009, intr 5000. MMU Fault: ENGINE CE0 HUBCLIENT_CE0 faulted @ 0x1_005a. Fault is of type FAULT_PTE ACCESS_TYPE_READ May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (II) NVIDIA(0): Error recovery was successful. May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): The NVIDIA X driver has encountered an error; attempting to May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): recover... May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 32, Channel ID 000b intr 0200 May 07 19:36:03 ap kernel: Asynchronous wait on fence NVIDIA:nvidia.prime:10ad97 timed out (hint:intel_atomic_commit_ready+0x0/0x50 [i915]) May 07 19:36:21 ap libvirtd[1863]: internal error: connection closed due to keepalive timeout May 07 19:36:41 ap kernel: BUG: unable to handle kernel paging request at f9a5c3ff8030 May 07 19:36:41 ap kernel: #PF error: [normal kernel read fault] May 07 19:36:41 ap kernel: PGD 85c9cc067 P4D 85c9cc067 PUD 85c9cb067 PMD 0 May 07 19:36:41 ap kernel: Oops: [#1] SMP PTI May 07 19:36:41 ap kernel: CPU: 7 PID: 31632 Comm: worker Tainted: P U W OE 5.1.0-050100-generic #201905052130 May 07 19:36:41 ap kernel: Hardware name: Dell Inc. Precision 5530/0FP2W2, BIOS 1.10.1 04/26/2019 May 07 19:36:41 ap kernel: RIP: 0010:compaction_alloc+0x589/0x890 May 07 19:36:41 ap kernel: Code: 7d b0 41 83 e6 1f 41 83 c6 01 4d 39 fc 73 7b e9 57 01 00 00 4d 89 e2 49 c1 e2 06 4c 03 15 57 b9 18 01 4d 89 d7 4d 85 ff 74 44 <41> 8b 47 30 25 80 00 00 f0 3d 00 00 00 f0 0 May 07 19:36:41 ap kernel: RSP: 0018:b352435b34a0 EFLAGS: 00010286 May 07 19:36:41 ap kernel: RAX: a03dfc7d5d00 RBX: b352435b36a0 RCX: 003d May 07 19:36:41 ap kernel: RDX: 800ffe00 RSI: RDI: a03dfc7cd1e0 May 07 19:36:41 ap kernel: RBP: b352435b3538 R08: R09: a03dfc7d5d00 May 07 19:36:41 ap kernel: R10: f9a5c3ff8000 R11: b352435b3719 R12: 800ffe00 May 07 19:36:41 ap kernel: R13: 8010 R14: 0020 R15: f9a5c3ff8000 May 07 19:36:41 ap kernel: FS: 7f28c17fa700() GS:a03ddc3c() knlGS: May 07 19:36:41 ap kernel: CS: 0010 DS: ES: CR0: 80050033 May 07 19:36:41 ap kernel: CR2: f9a5c3ff8030 CR3: 00069e550003 CR4: 003626e0 May 07 19:36:41 ap kernel: DR0: DR1: DR2: May 07 19:36:41 ap kernel: DR3: DR6: fffe0ff0 DR7: 0400 May 07 19:36:41 ap kernel: Call Trace: May 07 19:36:41 ap kernel: migrate_pages+0x107/0xb40 May 07 19:36:41 ap kernel: ? move_freelist_tail+0xd0/0xd0 May 07 19:36:41 ap kernel: ? isolate_freepages_block+0x370/0x370 May 07 19:36:41 ap kernel: compact_zone+0x752/0xd70 May 07 19:36:41 ap kernel: compact_zone_order+0xd8/0x120 May 07 19:36:41 ap kernel: try_to_compact_pages+0xb0/0x260 May 07 19:36:41 ap kernel: __alloc_pages_direct_compact+0x8c/0x170 May 07
[Kernel-packages] [Bug 1828131] Re: Qemu causes system hang
Status changed to 'Confirmed' because the bug affects multiple users. ** Changed in: nvidia-graphics-drivers-418 (Ubuntu) Status: New => Confirmed -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to nvidia-graphics-drivers-418 in Ubuntu. https://bugs.launchpad.net/bugs/1828131 Title: Qemu causes system hang Status in nvidia-graphics-drivers-418 package in Ubuntu: Confirmed Status in qemu package in Ubuntu: Incomplete Bug description: I'm trying to use https://virt-manager.org/ / QEMU. I migrated over a Ubuntu Guest from Virtualbox, added it to virt-manager, and launch. It opens the window, then the system freezes. I don't know if it's a bug in QEMU, libvirt, or virt-manager, or nvidia. Nvidia is working fine otherwise, Version: 430.09. This is easily reproduced every time I try starting a qemu machine, so I can run any diagnostics while this happens if that helps. This is from journalctl at the time of the freeze: May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): The NVIDIA X driver has encountered an error; attempting to May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): recover... May 07 19:35:52 ap kernel: NVRM: GPU at PCI::01:00: GPU-d9fbb72e-29cb-d4db-ad8f-af242c1a6c15 May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics Exception: Class 0x0 Subchannel 0x0 Mismatch May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics Exception: ESR 0x4041b0=0x20 May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics Exception: ESR 0x404000=0x8002 May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics Exception: ChID 0008, Class 902d, Offset 0860, Data May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 32, Channel ID 0008 intr 0200 May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 31, Ch 0009, intr 5000. MMU Fault: ENGINE CE0 HUBCLIENT_CE0 faulted @ 0x1_005a. Fault is of type FAULT_PTE ACCESS_TYPE_READ May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (II) NVIDIA(0): Error recovery was successful. May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): The NVIDIA X driver has encountered an error; attempting to May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): recover... May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 32, Channel ID 000b intr 0200 May 07 19:36:03 ap kernel: Asynchronous wait on fence NVIDIA:nvidia.prime:10ad97 timed out (hint:intel_atomic_commit_ready+0x0/0x50 [i915]) May 07 19:36:21 ap libvirtd[1863]: internal error: connection closed due to keepalive timeout May 07 19:36:41 ap kernel: BUG: unable to handle kernel paging request at f9a5c3ff8030 May 07 19:36:41 ap kernel: #PF error: [normal kernel read fault] May 07 19:36:41 ap kernel: PGD 85c9cc067 P4D 85c9cc067 PUD 85c9cb067 PMD 0 May 07 19:36:41 ap kernel: Oops: [#1] SMP PTI May 07 19:36:41 ap kernel: CPU: 7 PID: 31632 Comm: worker Tainted: P U W OE 5.1.0-050100-generic #201905052130 May 07 19:36:41 ap kernel: Hardware name: Dell Inc. Precision 5530/0FP2W2, BIOS 1.10.1 04/26/2019 May 07 19:36:41 ap kernel: RIP: 0010:compaction_alloc+0x589/0x890 May 07 19:36:41 ap kernel: Code: 7d b0 41 83 e6 1f 41 83 c6 01 4d 39 fc 73 7b e9 57 01 00 00 4d 89 e2 49 c1 e2 06 4c 03 15 57 b9 18 01 4d 89 d7 4d 85 ff 74 44 <41> 8b 47 30 25 80 00 00 f0 3d 00 00 00 f0 0 May 07 19:36:41 ap kernel: RSP: 0018:b352435b34a0 EFLAGS: 00010286 May 07 19:36:41 ap kernel: RAX: a03dfc7d5d00 RBX: b352435b36a0 RCX: 003d May 07 19:36:41 ap kernel: RDX: 800ffe00 RSI: RDI: a03dfc7cd1e0 May 07 19:36:41 ap kernel: RBP: b352435b3538 R08: R09: a03dfc7d5d00 May 07 19:36:41 ap kernel: R10: f9a5c3ff8000 R11: b352435b3719 R12: 800ffe00 May 07 19:36:41 ap kernel: R13: 8010 R14: 0020 R15: f9a5c3ff8000 May 07 19:36:41 ap kernel: FS: 7f28c17fa700() GS:a03ddc3c() knlGS: May 07 19:36:41 ap kernel: CS: 0010 DS: ES: CR0: 80050033 May 07 19:36:41 ap kernel: CR2: f9a5c3ff8030 CR3: 00069e550003 CR4: 003626e0 May 07 19:36:41 ap kernel: DR0: DR1: DR2: May 07 19:36:41 ap kernel: DR3: DR6: fffe0ff0 DR7: 0400 May 07 19:36:41 ap kernel: Call Trace: May 07 19:36:41 ap kernel: migrate_pages+0x107/0xb40 May 07 19:36:41 ap kernel: ? move_freelist_tail+0xd0/0xd0 May 07 19:36:41 ap kernel: ? isolate_freepages_block+0x370/0x370 May 07 19:36:41 ap kernel: compact_zone+0x752/0xd70 May 07 19:36:41 ap kernel: compact_zone_order+0xd8/0x120 May 07 19:36:41 ap kernel: try_to_compact_pages+0xb0/0x260 May 07
[Kernel-packages] [Bug 1828131] Re: Qemu causes system hang
Yes, had the same issue on 418. -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to nvidia-graphics-drivers-418 in Ubuntu. https://bugs.launchpad.net/bugs/1828131 Title: Qemu causes system hang Status in nvidia-graphics-drivers-418 package in Ubuntu: New Status in qemu package in Ubuntu: Incomplete Bug description: I'm trying to use https://virt-manager.org/ / QEMU. I migrated over a Ubuntu Guest from Virtualbox, added it to virt-manager, and launch. It opens the window, then the system freezes. I don't know if it's a bug in QEMU, libvirt, or virt-manager, or nvidia. Nvidia is working fine otherwise, Version: 430.09. This is easily reproduced every time I try starting a qemu machine, so I can run any diagnostics while this happens if that helps. This is from journalctl at the time of the freeze: May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): The NVIDIA X driver has encountered an error; attempting to May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): recover... May 07 19:35:52 ap kernel: NVRM: GPU at PCI::01:00: GPU-d9fbb72e-29cb-d4db-ad8f-af242c1a6c15 May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics Exception: Class 0x0 Subchannel 0x0 Mismatch May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics Exception: ESR 0x4041b0=0x20 May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics Exception: ESR 0x404000=0x8002 May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics Exception: ChID 0008, Class 902d, Offset 0860, Data May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 32, Channel ID 0008 intr 0200 May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 31, Ch 0009, intr 5000. MMU Fault: ENGINE CE0 HUBCLIENT_CE0 faulted @ 0x1_005a. Fault is of type FAULT_PTE ACCESS_TYPE_READ May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (II) NVIDIA(0): Error recovery was successful. May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): The NVIDIA X driver has encountered an error; attempting to May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): recover... May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 32, Channel ID 000b intr 0200 May 07 19:36:03 ap kernel: Asynchronous wait on fence NVIDIA:nvidia.prime:10ad97 timed out (hint:intel_atomic_commit_ready+0x0/0x50 [i915]) May 07 19:36:21 ap libvirtd[1863]: internal error: connection closed due to keepalive timeout May 07 19:36:41 ap kernel: BUG: unable to handle kernel paging request at f9a5c3ff8030 May 07 19:36:41 ap kernel: #PF error: [normal kernel read fault] May 07 19:36:41 ap kernel: PGD 85c9cc067 P4D 85c9cc067 PUD 85c9cb067 PMD 0 May 07 19:36:41 ap kernel: Oops: [#1] SMP PTI May 07 19:36:41 ap kernel: CPU: 7 PID: 31632 Comm: worker Tainted: P U W OE 5.1.0-050100-generic #201905052130 May 07 19:36:41 ap kernel: Hardware name: Dell Inc. Precision 5530/0FP2W2, BIOS 1.10.1 04/26/2019 May 07 19:36:41 ap kernel: RIP: 0010:compaction_alloc+0x589/0x890 May 07 19:36:41 ap kernel: Code: 7d b0 41 83 e6 1f 41 83 c6 01 4d 39 fc 73 7b e9 57 01 00 00 4d 89 e2 49 c1 e2 06 4c 03 15 57 b9 18 01 4d 89 d7 4d 85 ff 74 44 <41> 8b 47 30 25 80 00 00 f0 3d 00 00 00 f0 0 May 07 19:36:41 ap kernel: RSP: 0018:b352435b34a0 EFLAGS: 00010286 May 07 19:36:41 ap kernel: RAX: a03dfc7d5d00 RBX: b352435b36a0 RCX: 003d May 07 19:36:41 ap kernel: RDX: 800ffe00 RSI: RDI: a03dfc7cd1e0 May 07 19:36:41 ap kernel: RBP: b352435b3538 R08: R09: a03dfc7d5d00 May 07 19:36:41 ap kernel: R10: f9a5c3ff8000 R11: b352435b3719 R12: 800ffe00 May 07 19:36:41 ap kernel: R13: 8010 R14: 0020 R15: f9a5c3ff8000 May 07 19:36:41 ap kernel: FS: 7f28c17fa700() GS:a03ddc3c() knlGS: May 07 19:36:41 ap kernel: CS: 0010 DS: ES: CR0: 80050033 May 07 19:36:41 ap kernel: CR2: f9a5c3ff8030 CR3: 00069e550003 CR4: 003626e0 May 07 19:36:41 ap kernel: DR0: DR1: DR2: May 07 19:36:41 ap kernel: DR3: DR6: fffe0ff0 DR7: 0400 May 07 19:36:41 ap kernel: Call Trace: May 07 19:36:41 ap kernel: migrate_pages+0x107/0xb40 May 07 19:36:41 ap kernel: ? move_freelist_tail+0xd0/0xd0 May 07 19:36:41 ap kernel: ? isolate_freepages_block+0x370/0x370 May 07 19:36:41 ap kernel: compact_zone+0x752/0xd70 May 07 19:36:41 ap kernel: compact_zone_order+0xd8/0x120 May 07 19:36:41 ap kernel: try_to_compact_pages+0xb0/0x260 May 07 19:36:41 ap kernel: __alloc_pages_direct_compact+0x8c/0x170 May 07 19:36:41 ap kernel: __alloc_pages_slowpath+0x4b3/0xeb0 May
[Kernel-packages] [Bug 1828131] Re: Qemu causes system hang
Since 430 is still from a PPA https://launchpad.net/~graphics- drivers/+archive/ubuntu/ I used 418 being the closest one. Do you hit the same with 418? -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to nvidia-graphics-drivers-418 in Ubuntu. https://bugs.launchpad.net/bugs/1828131 Title: Qemu causes system hang Status in nvidia-graphics-drivers-418 package in Ubuntu: New Status in qemu package in Ubuntu: Incomplete Bug description: I'm trying to use https://virt-manager.org/ / QEMU. I migrated over a Ubuntu Guest from Virtualbox, added it to virt-manager, and launch. It opens the window, then the system freezes. I don't know if it's a bug in QEMU, libvirt, or virt-manager, or nvidia. Nvidia is working fine otherwise, Version: 430.09. This is easily reproduced every time I try starting a qemu machine, so I can run any diagnostics while this happens if that helps. This is from journalctl at the time of the freeze: May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): The NVIDIA X driver has encountered an error; attempting to May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): recover... May 07 19:35:52 ap kernel: NVRM: GPU at PCI::01:00: GPU-d9fbb72e-29cb-d4db-ad8f-af242c1a6c15 May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics Exception: Class 0x0 Subchannel 0x0 Mismatch May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics Exception: ESR 0x4041b0=0x20 May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics Exception: ESR 0x404000=0x8002 May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics Exception: ChID 0008, Class 902d, Offset 0860, Data May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 32, Channel ID 0008 intr 0200 May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 31, Ch 0009, intr 5000. MMU Fault: ENGINE CE0 HUBCLIENT_CE0 faulted @ 0x1_005a. Fault is of type FAULT_PTE ACCESS_TYPE_READ May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (II) NVIDIA(0): Error recovery was successful. May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): The NVIDIA X driver has encountered an error; attempting to May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): recover... May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 32, Channel ID 000b intr 0200 May 07 19:36:03 ap kernel: Asynchronous wait on fence NVIDIA:nvidia.prime:10ad97 timed out (hint:intel_atomic_commit_ready+0x0/0x50 [i915]) May 07 19:36:21 ap libvirtd[1863]: internal error: connection closed due to keepalive timeout May 07 19:36:41 ap kernel: BUG: unable to handle kernel paging request at f9a5c3ff8030 May 07 19:36:41 ap kernel: #PF error: [normal kernel read fault] May 07 19:36:41 ap kernel: PGD 85c9cc067 P4D 85c9cc067 PUD 85c9cb067 PMD 0 May 07 19:36:41 ap kernel: Oops: [#1] SMP PTI May 07 19:36:41 ap kernel: CPU: 7 PID: 31632 Comm: worker Tainted: P U W OE 5.1.0-050100-generic #201905052130 May 07 19:36:41 ap kernel: Hardware name: Dell Inc. Precision 5530/0FP2W2, BIOS 1.10.1 04/26/2019 May 07 19:36:41 ap kernel: RIP: 0010:compaction_alloc+0x589/0x890 May 07 19:36:41 ap kernel: Code: 7d b0 41 83 e6 1f 41 83 c6 01 4d 39 fc 73 7b e9 57 01 00 00 4d 89 e2 49 c1 e2 06 4c 03 15 57 b9 18 01 4d 89 d7 4d 85 ff 74 44 <41> 8b 47 30 25 80 00 00 f0 3d 00 00 00 f0 0 May 07 19:36:41 ap kernel: RSP: 0018:b352435b34a0 EFLAGS: 00010286 May 07 19:36:41 ap kernel: RAX: a03dfc7d5d00 RBX: b352435b36a0 RCX: 003d May 07 19:36:41 ap kernel: RDX: 800ffe00 RSI: RDI: a03dfc7cd1e0 May 07 19:36:41 ap kernel: RBP: b352435b3538 R08: R09: a03dfc7d5d00 May 07 19:36:41 ap kernel: R10: f9a5c3ff8000 R11: b352435b3719 R12: 800ffe00 May 07 19:36:41 ap kernel: R13: 8010 R14: 0020 R15: f9a5c3ff8000 May 07 19:36:41 ap kernel: FS: 7f28c17fa700() GS:a03ddc3c() knlGS: May 07 19:36:41 ap kernel: CS: 0010 DS: ES: CR0: 80050033 May 07 19:36:41 ap kernel: CR2: f9a5c3ff8030 CR3: 00069e550003 CR4: 003626e0 May 07 19:36:41 ap kernel: DR0: DR1: DR2: May 07 19:36:41 ap kernel: DR3: DR6: fffe0ff0 DR7: 0400 May 07 19:36:41 ap kernel: Call Trace: May 07 19:36:41 ap kernel: migrate_pages+0x107/0xb40 May 07 19:36:41 ap kernel: ? move_freelist_tail+0xd0/0xd0 May 07 19:36:41 ap kernel: ? isolate_freepages_block+0x370/0x370 May 07 19:36:41 ap kernel: compact_zone+0x752/0xd70 May 07 19:36:41 ap kernel: compact_zone_order+0xd8/0x120 May 07 19:36:41 ap kernel: try_to_compact_pages+0xb0/0x260 May 07 19:36:41 ap
[Kernel-packages] [Bug 1828131] Re: Qemu causes system hang
So I guess we should then file the bug against nvidia-driver-430 then? ** Also affects: nvidia-graphics-drivers-418 (Ubuntu) Importance: Undecided Status: New -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to nvidia-graphics-drivers-418 in Ubuntu. https://bugs.launchpad.net/bugs/1828131 Title: Qemu causes system hang Status in nvidia-graphics-drivers-418 package in Ubuntu: New Status in qemu package in Ubuntu: Incomplete Bug description: I'm trying to use https://virt-manager.org/ / QEMU. I migrated over a Ubuntu Guest from Virtualbox, added it to virt-manager, and launch. It opens the window, then the system freezes. I don't know if it's a bug in QEMU, libvirt, or virt-manager, or nvidia. Nvidia is working fine otherwise, Version: 430.09. This is easily reproduced every time I try starting a qemu machine, so I can run any diagnostics while this happens if that helps. This is from journalctl at the time of the freeze: May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): The NVIDIA X driver has encountered an error; attempting to May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): recover... May 07 19:35:52 ap kernel: NVRM: GPU at PCI::01:00: GPU-d9fbb72e-29cb-d4db-ad8f-af242c1a6c15 May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics Exception: Class 0x0 Subchannel 0x0 Mismatch May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics Exception: ESR 0x4041b0=0x20 May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics Exception: ESR 0x404000=0x8002 May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics Exception: ChID 0008, Class 902d, Offset 0860, Data May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 32, Channel ID 0008 intr 0200 May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 31, Ch 0009, intr 5000. MMU Fault: ENGINE CE0 HUBCLIENT_CE0 faulted @ 0x1_005a. Fault is of type FAULT_PTE ACCESS_TYPE_READ May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (II) NVIDIA(0): Error recovery was successful. May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): The NVIDIA X driver has encountered an error; attempting to May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): recover... May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 32, Channel ID 000b intr 0200 May 07 19:36:03 ap kernel: Asynchronous wait on fence NVIDIA:nvidia.prime:10ad97 timed out (hint:intel_atomic_commit_ready+0x0/0x50 [i915]) May 07 19:36:21 ap libvirtd[1863]: internal error: connection closed due to keepalive timeout May 07 19:36:41 ap kernel: BUG: unable to handle kernel paging request at f9a5c3ff8030 May 07 19:36:41 ap kernel: #PF error: [normal kernel read fault] May 07 19:36:41 ap kernel: PGD 85c9cc067 P4D 85c9cc067 PUD 85c9cb067 PMD 0 May 07 19:36:41 ap kernel: Oops: [#1] SMP PTI May 07 19:36:41 ap kernel: CPU: 7 PID: 31632 Comm: worker Tainted: P U W OE 5.1.0-050100-generic #201905052130 May 07 19:36:41 ap kernel: Hardware name: Dell Inc. Precision 5530/0FP2W2, BIOS 1.10.1 04/26/2019 May 07 19:36:41 ap kernel: RIP: 0010:compaction_alloc+0x589/0x890 May 07 19:36:41 ap kernel: Code: 7d b0 41 83 e6 1f 41 83 c6 01 4d 39 fc 73 7b e9 57 01 00 00 4d 89 e2 49 c1 e2 06 4c 03 15 57 b9 18 01 4d 89 d7 4d 85 ff 74 44 <41> 8b 47 30 25 80 00 00 f0 3d 00 00 00 f0 0 May 07 19:36:41 ap kernel: RSP: 0018:b352435b34a0 EFLAGS: 00010286 May 07 19:36:41 ap kernel: RAX: a03dfc7d5d00 RBX: b352435b36a0 RCX: 003d May 07 19:36:41 ap kernel: RDX: 800ffe00 RSI: RDI: a03dfc7cd1e0 May 07 19:36:41 ap kernel: RBP: b352435b3538 R08: R09: a03dfc7d5d00 May 07 19:36:41 ap kernel: R10: f9a5c3ff8000 R11: b352435b3719 R12: 800ffe00 May 07 19:36:41 ap kernel: R13: 8010 R14: 0020 R15: f9a5c3ff8000 May 07 19:36:41 ap kernel: FS: 7f28c17fa700() GS:a03ddc3c() knlGS: May 07 19:36:41 ap kernel: CS: 0010 DS: ES: CR0: 80050033 May 07 19:36:41 ap kernel: CR2: f9a5c3ff8030 CR3: 00069e550003 CR4: 003626e0 May 07 19:36:41 ap kernel: DR0: DR1: DR2: May 07 19:36:41 ap kernel: DR3: DR6: fffe0ff0 DR7: 0400 May 07 19:36:41 ap kernel: Call Trace: May 07 19:36:41 ap kernel: migrate_pages+0x107/0xb40 May 07 19:36:41 ap kernel: ? move_freelist_tail+0xd0/0xd0 May 07 19:36:41 ap kernel: ? isolate_freepages_block+0x370/0x370 May 07 19:36:41 ap kernel: compact_zone+0x752/0xd70 May 07 19:36:41 ap kernel: compact_zone_order+0xd8/0x120 May 07 19:36:41 ap kernel: try_to_compact_pages+0xb0/0x260