[Kernel-packages] [Bug 1828131] Re: Qemu causes system hang

2019-05-25 Thread post-factum
Please check whether this patch fixes the issue: [1]

[1] https://lore.kernel.org/lkml/1558711908-15688-1-git-send-email-
suzuki.poul...@arm.com/

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to nvidia-graphics-drivers-418 in Ubuntu.
https://bugs.launchpad.net/bugs/1828131

Title:
  Qemu causes system hang

Status in nvidia-graphics-drivers-418 package in Ubuntu:
  Confirmed
Status in qemu package in Ubuntu:
  Incomplete

Bug description:
  I'm trying to use https://virt-manager.org/ / QEMU. I migrated over a
  Ubuntu Guest from Virtualbox, added it to virt-manager, and launch. It
  opens the window, then the system freezes.

  I don't know if it's a bug in QEMU, libvirt, or virt-manager, or
  nvidia. Nvidia is working fine otherwise, Version: 430.09.

  This is easily reproduced every time I try starting a qemu machine, so
  I can run any diagnostics while this happens if that helps.

  This is from journalctl at the time of the freeze:

  May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): The 
NVIDIA X driver has encountered an error; attempting to
  May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): 
recover...
  May 07 19:35:52 ap kernel: NVRM: GPU at PCI::01:00: 
GPU-d9fbb72e-29cb-d4db-ad8f-af242c1a6c15
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics 
Exception: Class 0x0 Subchannel 0x0 Mismatch
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics 
Exception: ESR 0x4041b0=0x20
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics 
Exception: ESR 0x404000=0x8002
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics 
Exception: ChID 0008, Class 902d, Offset 0860, Data 
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 32, Channel ID 
0008 intr 0200
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 31, Ch 0009, intr 
5000. MMU Fault: ENGINE CE0 HUBCLIENT_CE0 faulted @ 0x1_005a. Fault is 
of type FAULT_PTE ACCESS_TYPE_READ
  May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (II) NVIDIA(0): Error 
recovery was successful.
  May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): The 
NVIDIA X driver has encountered an error; attempting to
  May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): 
recover...
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 32, Channel ID 
000b intr 0200
  May 07 19:36:03 ap kernel: Asynchronous wait on fence 
NVIDIA:nvidia.prime:10ad97 timed out (hint:intel_atomic_commit_ready+0x0/0x50 
[i915])
  May 07 19:36:21 ap libvirtd[1863]: internal error: connection closed due to 
keepalive timeout
  May 07 19:36:41 ap kernel: BUG: unable to handle kernel paging request at 
f9a5c3ff8030
  May 07 19:36:41 ap kernel: #PF error: [normal kernel read fault]
  May 07 19:36:41 ap kernel: PGD 85c9cc067 P4D 85c9cc067 PUD 85c9cb067 PMD 0 
  May 07 19:36:41 ap kernel: Oops:  [#1] SMP PTI
  May 07 19:36:41 ap kernel: CPU: 7 PID: 31632 Comm: worker Tainted: P U  W 
 OE 5.1.0-050100-generic #201905052130
  May 07 19:36:41 ap kernel: Hardware name: Dell Inc. Precision 5530/0FP2W2, 
BIOS 1.10.1 04/26/2019
  May 07 19:36:41 ap kernel: RIP: 0010:compaction_alloc+0x589/0x890
  May 07 19:36:41 ap kernel: Code: 7d b0 41 83 e6 1f 41 83 c6 01 4d 39 fc 73 7b 
e9 57 01 00 00 4d 89 e2 49 c1 e2 06 4c 03 15 57 b9 18 01 4d 89 d7 4d 85 ff 74 
44 <41> 8b 47 30 25 80 00 00 f0 3d 00 00 00 f0 0
  May 07 19:36:41 ap kernel: RSP: 0018:b352435b34a0 EFLAGS: 00010286
  May 07 19:36:41 ap kernel: RAX: a03dfc7d5d00 RBX: b352435b36a0 RCX: 
003d
  May 07 19:36:41 ap kernel: RDX: 800ffe00 RSI:  RDI: 
a03dfc7cd1e0
  May 07 19:36:41 ap kernel: RBP: b352435b3538 R08:  R09: 
a03dfc7d5d00
  May 07 19:36:41 ap kernel: R10: f9a5c3ff8000 R11: b352435b3719 R12: 
800ffe00
  May 07 19:36:41 ap kernel: R13: 8010 R14: 0020 R15: 
f9a5c3ff8000
  May 07 19:36:41 ap kernel: FS:  7f28c17fa700() 
GS:a03ddc3c() knlGS:
  May 07 19:36:41 ap kernel: CS:  0010 DS:  ES:  CR0: 80050033
  May 07 19:36:41 ap kernel: CR2: f9a5c3ff8030 CR3: 00069e550003 CR4: 
003626e0
  May 07 19:36:41 ap kernel: DR0:  DR1:  DR2: 

  May 07 19:36:41 ap kernel: DR3:  DR6: fffe0ff0 DR7: 
0400
  May 07 19:36:41 ap kernel: Call Trace:
  May 07 19:36:41 ap kernel:  migrate_pages+0x107/0xb40
  May 07 19:36:41 ap kernel:  ? move_freelist_tail+0xd0/0xd0
  May 07 19:36:41 ap kernel:  ? isolate_freepages_block+0x370/0x370
  May 07 19:36:41 ap kernel:  compact_zone+0x752/0xd70
  May 07 19:36:41 ap kernel:  compact_zone_order+0xd8/0x120
  May 07 19:36:41 ap kernel:  try_to_compact_pages+0xb0/0x260
  May 07 19:36:41 ap 

[Kernel-packages] [Bug 1828131] Re: Qemu causes system hang

2019-05-12 Thread post-factum
I don't think this is nvidia-related. I have the same on Arch with Intel
card only.

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to nvidia-graphics-drivers-418 in Ubuntu.
https://bugs.launchpad.net/bugs/1828131

Title:
  Qemu causes system hang

Status in nvidia-graphics-drivers-418 package in Ubuntu:
  Confirmed
Status in qemu package in Ubuntu:
  Incomplete

Bug description:
  I'm trying to use https://virt-manager.org/ / QEMU. I migrated over a
  Ubuntu Guest from Virtualbox, added it to virt-manager, and launch. It
  opens the window, then the system freezes.

  I don't know if it's a bug in QEMU, libvirt, or virt-manager, or
  nvidia. Nvidia is working fine otherwise, Version: 430.09.

  This is easily reproduced every time I try starting a qemu machine, so
  I can run any diagnostics while this happens if that helps.

  This is from journalctl at the time of the freeze:

  May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): The 
NVIDIA X driver has encountered an error; attempting to
  May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): 
recover...
  May 07 19:35:52 ap kernel: NVRM: GPU at PCI::01:00: 
GPU-d9fbb72e-29cb-d4db-ad8f-af242c1a6c15
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics 
Exception: Class 0x0 Subchannel 0x0 Mismatch
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics 
Exception: ESR 0x4041b0=0x20
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics 
Exception: ESR 0x404000=0x8002
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics 
Exception: ChID 0008, Class 902d, Offset 0860, Data 
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 32, Channel ID 
0008 intr 0200
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 31, Ch 0009, intr 
5000. MMU Fault: ENGINE CE0 HUBCLIENT_CE0 faulted @ 0x1_005a. Fault is 
of type FAULT_PTE ACCESS_TYPE_READ
  May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (II) NVIDIA(0): Error 
recovery was successful.
  May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): The 
NVIDIA X driver has encountered an error; attempting to
  May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): 
recover...
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 32, Channel ID 
000b intr 0200
  May 07 19:36:03 ap kernel: Asynchronous wait on fence 
NVIDIA:nvidia.prime:10ad97 timed out (hint:intel_atomic_commit_ready+0x0/0x50 
[i915])
  May 07 19:36:21 ap libvirtd[1863]: internal error: connection closed due to 
keepalive timeout
  May 07 19:36:41 ap kernel: BUG: unable to handle kernel paging request at 
f9a5c3ff8030
  May 07 19:36:41 ap kernel: #PF error: [normal kernel read fault]
  May 07 19:36:41 ap kernel: PGD 85c9cc067 P4D 85c9cc067 PUD 85c9cb067 PMD 0 
  May 07 19:36:41 ap kernel: Oops:  [#1] SMP PTI
  May 07 19:36:41 ap kernel: CPU: 7 PID: 31632 Comm: worker Tainted: P U  W 
 OE 5.1.0-050100-generic #201905052130
  May 07 19:36:41 ap kernel: Hardware name: Dell Inc. Precision 5530/0FP2W2, 
BIOS 1.10.1 04/26/2019
  May 07 19:36:41 ap kernel: RIP: 0010:compaction_alloc+0x589/0x890
  May 07 19:36:41 ap kernel: Code: 7d b0 41 83 e6 1f 41 83 c6 01 4d 39 fc 73 7b 
e9 57 01 00 00 4d 89 e2 49 c1 e2 06 4c 03 15 57 b9 18 01 4d 89 d7 4d 85 ff 74 
44 <41> 8b 47 30 25 80 00 00 f0 3d 00 00 00 f0 0
  May 07 19:36:41 ap kernel: RSP: 0018:b352435b34a0 EFLAGS: 00010286
  May 07 19:36:41 ap kernel: RAX: a03dfc7d5d00 RBX: b352435b36a0 RCX: 
003d
  May 07 19:36:41 ap kernel: RDX: 800ffe00 RSI:  RDI: 
a03dfc7cd1e0
  May 07 19:36:41 ap kernel: RBP: b352435b3538 R08:  R09: 
a03dfc7d5d00
  May 07 19:36:41 ap kernel: R10: f9a5c3ff8000 R11: b352435b3719 R12: 
800ffe00
  May 07 19:36:41 ap kernel: R13: 8010 R14: 0020 R15: 
f9a5c3ff8000
  May 07 19:36:41 ap kernel: FS:  7f28c17fa700() 
GS:a03ddc3c() knlGS:
  May 07 19:36:41 ap kernel: CS:  0010 DS:  ES:  CR0: 80050033
  May 07 19:36:41 ap kernel: CR2: f9a5c3ff8030 CR3: 00069e550003 CR4: 
003626e0
  May 07 19:36:41 ap kernel: DR0:  DR1:  DR2: 

  May 07 19:36:41 ap kernel: DR3:  DR6: fffe0ff0 DR7: 
0400
  May 07 19:36:41 ap kernel: Call Trace:
  May 07 19:36:41 ap kernel:  migrate_pages+0x107/0xb40
  May 07 19:36:41 ap kernel:  ? move_freelist_tail+0xd0/0xd0
  May 07 19:36:41 ap kernel:  ? isolate_freepages_block+0x370/0x370
  May 07 19:36:41 ap kernel:  compact_zone+0x752/0xd70
  May 07 19:36:41 ap kernel:  compact_zone_order+0xd8/0x120
  May 07 19:36:41 ap kernel:  try_to_compact_pages+0xb0/0x260
  May 07 19:36:41 ap kernel:  __alloc_pages_direct_compact+0x8c/0x170
  May 07 

[Kernel-packages] [Bug 1828131] Re: Qemu causes system hang

2019-05-12 Thread Launchpad Bug Tracker
Status changed to 'Confirmed' because the bug affects multiple users.

** Changed in: nvidia-graphics-drivers-418 (Ubuntu)
   Status: New => Confirmed

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to nvidia-graphics-drivers-418 in Ubuntu.
https://bugs.launchpad.net/bugs/1828131

Title:
  Qemu causes system hang

Status in nvidia-graphics-drivers-418 package in Ubuntu:
  Confirmed
Status in qemu package in Ubuntu:
  Incomplete

Bug description:
  I'm trying to use https://virt-manager.org/ / QEMU. I migrated over a
  Ubuntu Guest from Virtualbox, added it to virt-manager, and launch. It
  opens the window, then the system freezes.

  I don't know if it's a bug in QEMU, libvirt, or virt-manager, or
  nvidia. Nvidia is working fine otherwise, Version: 430.09.

  This is easily reproduced every time I try starting a qemu machine, so
  I can run any diagnostics while this happens if that helps.

  This is from journalctl at the time of the freeze:

  May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): The 
NVIDIA X driver has encountered an error; attempting to
  May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): 
recover...
  May 07 19:35:52 ap kernel: NVRM: GPU at PCI::01:00: 
GPU-d9fbb72e-29cb-d4db-ad8f-af242c1a6c15
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics 
Exception: Class 0x0 Subchannel 0x0 Mismatch
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics 
Exception: ESR 0x4041b0=0x20
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics 
Exception: ESR 0x404000=0x8002
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics 
Exception: ChID 0008, Class 902d, Offset 0860, Data 
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 32, Channel ID 
0008 intr 0200
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 31, Ch 0009, intr 
5000. MMU Fault: ENGINE CE0 HUBCLIENT_CE0 faulted @ 0x1_005a. Fault is 
of type FAULT_PTE ACCESS_TYPE_READ
  May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (II) NVIDIA(0): Error 
recovery was successful.
  May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): The 
NVIDIA X driver has encountered an error; attempting to
  May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): 
recover...
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 32, Channel ID 
000b intr 0200
  May 07 19:36:03 ap kernel: Asynchronous wait on fence 
NVIDIA:nvidia.prime:10ad97 timed out (hint:intel_atomic_commit_ready+0x0/0x50 
[i915])
  May 07 19:36:21 ap libvirtd[1863]: internal error: connection closed due to 
keepalive timeout
  May 07 19:36:41 ap kernel: BUG: unable to handle kernel paging request at 
f9a5c3ff8030
  May 07 19:36:41 ap kernel: #PF error: [normal kernel read fault]
  May 07 19:36:41 ap kernel: PGD 85c9cc067 P4D 85c9cc067 PUD 85c9cb067 PMD 0 
  May 07 19:36:41 ap kernel: Oops:  [#1] SMP PTI
  May 07 19:36:41 ap kernel: CPU: 7 PID: 31632 Comm: worker Tainted: P U  W 
 OE 5.1.0-050100-generic #201905052130
  May 07 19:36:41 ap kernel: Hardware name: Dell Inc. Precision 5530/0FP2W2, 
BIOS 1.10.1 04/26/2019
  May 07 19:36:41 ap kernel: RIP: 0010:compaction_alloc+0x589/0x890
  May 07 19:36:41 ap kernel: Code: 7d b0 41 83 e6 1f 41 83 c6 01 4d 39 fc 73 7b 
e9 57 01 00 00 4d 89 e2 49 c1 e2 06 4c 03 15 57 b9 18 01 4d 89 d7 4d 85 ff 74 
44 <41> 8b 47 30 25 80 00 00 f0 3d 00 00 00 f0 0
  May 07 19:36:41 ap kernel: RSP: 0018:b352435b34a0 EFLAGS: 00010286
  May 07 19:36:41 ap kernel: RAX: a03dfc7d5d00 RBX: b352435b36a0 RCX: 
003d
  May 07 19:36:41 ap kernel: RDX: 800ffe00 RSI:  RDI: 
a03dfc7cd1e0
  May 07 19:36:41 ap kernel: RBP: b352435b3538 R08:  R09: 
a03dfc7d5d00
  May 07 19:36:41 ap kernel: R10: f9a5c3ff8000 R11: b352435b3719 R12: 
800ffe00
  May 07 19:36:41 ap kernel: R13: 8010 R14: 0020 R15: 
f9a5c3ff8000
  May 07 19:36:41 ap kernel: FS:  7f28c17fa700() 
GS:a03ddc3c() knlGS:
  May 07 19:36:41 ap kernel: CS:  0010 DS:  ES:  CR0: 80050033
  May 07 19:36:41 ap kernel: CR2: f9a5c3ff8030 CR3: 00069e550003 CR4: 
003626e0
  May 07 19:36:41 ap kernel: DR0:  DR1:  DR2: 

  May 07 19:36:41 ap kernel: DR3:  DR6: fffe0ff0 DR7: 
0400
  May 07 19:36:41 ap kernel: Call Trace:
  May 07 19:36:41 ap kernel:  migrate_pages+0x107/0xb40
  May 07 19:36:41 ap kernel:  ? move_freelist_tail+0xd0/0xd0
  May 07 19:36:41 ap kernel:  ? isolate_freepages_block+0x370/0x370
  May 07 19:36:41 ap kernel:  compact_zone+0x752/0xd70
  May 07 19:36:41 ap kernel:  compact_zone_order+0xd8/0x120
  May 07 19:36:41 ap kernel:  try_to_compact_pages+0xb0/0x260
  May 07 

[Kernel-packages] [Bug 1828131] Re: Qemu causes system hang

2019-05-10 Thread Avi Eis
Yes, had the same issue on 418.

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to nvidia-graphics-drivers-418 in Ubuntu.
https://bugs.launchpad.net/bugs/1828131

Title:
  Qemu causes system hang

Status in nvidia-graphics-drivers-418 package in Ubuntu:
  New
Status in qemu package in Ubuntu:
  Incomplete

Bug description:
  I'm trying to use https://virt-manager.org/ / QEMU. I migrated over a
  Ubuntu Guest from Virtualbox, added it to virt-manager, and launch. It
  opens the window, then the system freezes.

  I don't know if it's a bug in QEMU, libvirt, or virt-manager, or
  nvidia. Nvidia is working fine otherwise, Version: 430.09.

  This is easily reproduced every time I try starting a qemu machine, so
  I can run any diagnostics while this happens if that helps.

  This is from journalctl at the time of the freeze:

  May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): The 
NVIDIA X driver has encountered an error; attempting to
  May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): 
recover...
  May 07 19:35:52 ap kernel: NVRM: GPU at PCI::01:00: 
GPU-d9fbb72e-29cb-d4db-ad8f-af242c1a6c15
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics 
Exception: Class 0x0 Subchannel 0x0 Mismatch
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics 
Exception: ESR 0x4041b0=0x20
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics 
Exception: ESR 0x404000=0x8002
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics 
Exception: ChID 0008, Class 902d, Offset 0860, Data 
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 32, Channel ID 
0008 intr 0200
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 31, Ch 0009, intr 
5000. MMU Fault: ENGINE CE0 HUBCLIENT_CE0 faulted @ 0x1_005a. Fault is 
of type FAULT_PTE ACCESS_TYPE_READ
  May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (II) NVIDIA(0): Error 
recovery was successful.
  May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): The 
NVIDIA X driver has encountered an error; attempting to
  May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): 
recover...
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 32, Channel ID 
000b intr 0200
  May 07 19:36:03 ap kernel: Asynchronous wait on fence 
NVIDIA:nvidia.prime:10ad97 timed out (hint:intel_atomic_commit_ready+0x0/0x50 
[i915])
  May 07 19:36:21 ap libvirtd[1863]: internal error: connection closed due to 
keepalive timeout
  May 07 19:36:41 ap kernel: BUG: unable to handle kernel paging request at 
f9a5c3ff8030
  May 07 19:36:41 ap kernel: #PF error: [normal kernel read fault]
  May 07 19:36:41 ap kernel: PGD 85c9cc067 P4D 85c9cc067 PUD 85c9cb067 PMD 0 
  May 07 19:36:41 ap kernel: Oops:  [#1] SMP PTI
  May 07 19:36:41 ap kernel: CPU: 7 PID: 31632 Comm: worker Tainted: P U  W 
 OE 5.1.0-050100-generic #201905052130
  May 07 19:36:41 ap kernel: Hardware name: Dell Inc. Precision 5530/0FP2W2, 
BIOS 1.10.1 04/26/2019
  May 07 19:36:41 ap kernel: RIP: 0010:compaction_alloc+0x589/0x890
  May 07 19:36:41 ap kernel: Code: 7d b0 41 83 e6 1f 41 83 c6 01 4d 39 fc 73 7b 
e9 57 01 00 00 4d 89 e2 49 c1 e2 06 4c 03 15 57 b9 18 01 4d 89 d7 4d 85 ff 74 
44 <41> 8b 47 30 25 80 00 00 f0 3d 00 00 00 f0 0
  May 07 19:36:41 ap kernel: RSP: 0018:b352435b34a0 EFLAGS: 00010286
  May 07 19:36:41 ap kernel: RAX: a03dfc7d5d00 RBX: b352435b36a0 RCX: 
003d
  May 07 19:36:41 ap kernel: RDX: 800ffe00 RSI:  RDI: 
a03dfc7cd1e0
  May 07 19:36:41 ap kernel: RBP: b352435b3538 R08:  R09: 
a03dfc7d5d00
  May 07 19:36:41 ap kernel: R10: f9a5c3ff8000 R11: b352435b3719 R12: 
800ffe00
  May 07 19:36:41 ap kernel: R13: 8010 R14: 0020 R15: 
f9a5c3ff8000
  May 07 19:36:41 ap kernel: FS:  7f28c17fa700() 
GS:a03ddc3c() knlGS:
  May 07 19:36:41 ap kernel: CS:  0010 DS:  ES:  CR0: 80050033
  May 07 19:36:41 ap kernel: CR2: f9a5c3ff8030 CR3: 00069e550003 CR4: 
003626e0
  May 07 19:36:41 ap kernel: DR0:  DR1:  DR2: 

  May 07 19:36:41 ap kernel: DR3:  DR6: fffe0ff0 DR7: 
0400
  May 07 19:36:41 ap kernel: Call Trace:
  May 07 19:36:41 ap kernel:  migrate_pages+0x107/0xb40
  May 07 19:36:41 ap kernel:  ? move_freelist_tail+0xd0/0xd0
  May 07 19:36:41 ap kernel:  ? isolate_freepages_block+0x370/0x370
  May 07 19:36:41 ap kernel:  compact_zone+0x752/0xd70
  May 07 19:36:41 ap kernel:  compact_zone_order+0xd8/0x120
  May 07 19:36:41 ap kernel:  try_to_compact_pages+0xb0/0x260
  May 07 19:36:41 ap kernel:  __alloc_pages_direct_compact+0x8c/0x170
  May 07 19:36:41 ap kernel:  __alloc_pages_slowpath+0x4b3/0xeb0
  May 

[Kernel-packages] [Bug 1828131] Re: Qemu causes system hang

2019-05-10 Thread Christian Ehrhardt 
Since 430 is still from a PPA https://launchpad.net/~graphics-
drivers/+archive/ubuntu/ I used 418 being the closest one.

Do you hit the same with 418?

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to nvidia-graphics-drivers-418 in Ubuntu.
https://bugs.launchpad.net/bugs/1828131

Title:
  Qemu causes system hang

Status in nvidia-graphics-drivers-418 package in Ubuntu:
  New
Status in qemu package in Ubuntu:
  Incomplete

Bug description:
  I'm trying to use https://virt-manager.org/ / QEMU. I migrated over a
  Ubuntu Guest from Virtualbox, added it to virt-manager, and launch. It
  opens the window, then the system freezes.

  I don't know if it's a bug in QEMU, libvirt, or virt-manager, or
  nvidia. Nvidia is working fine otherwise, Version: 430.09.

  This is easily reproduced every time I try starting a qemu machine, so
  I can run any diagnostics while this happens if that helps.

  This is from journalctl at the time of the freeze:

  May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): The 
NVIDIA X driver has encountered an error; attempting to
  May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): 
recover...
  May 07 19:35:52 ap kernel: NVRM: GPU at PCI::01:00: 
GPU-d9fbb72e-29cb-d4db-ad8f-af242c1a6c15
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics 
Exception: Class 0x0 Subchannel 0x0 Mismatch
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics 
Exception: ESR 0x4041b0=0x20
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics 
Exception: ESR 0x404000=0x8002
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics 
Exception: ChID 0008, Class 902d, Offset 0860, Data 
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 32, Channel ID 
0008 intr 0200
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 31, Ch 0009, intr 
5000. MMU Fault: ENGINE CE0 HUBCLIENT_CE0 faulted @ 0x1_005a. Fault is 
of type FAULT_PTE ACCESS_TYPE_READ
  May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (II) NVIDIA(0): Error 
recovery was successful.
  May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): The 
NVIDIA X driver has encountered an error; attempting to
  May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): 
recover...
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 32, Channel ID 
000b intr 0200
  May 07 19:36:03 ap kernel: Asynchronous wait on fence 
NVIDIA:nvidia.prime:10ad97 timed out (hint:intel_atomic_commit_ready+0x0/0x50 
[i915])
  May 07 19:36:21 ap libvirtd[1863]: internal error: connection closed due to 
keepalive timeout
  May 07 19:36:41 ap kernel: BUG: unable to handle kernel paging request at 
f9a5c3ff8030
  May 07 19:36:41 ap kernel: #PF error: [normal kernel read fault]
  May 07 19:36:41 ap kernel: PGD 85c9cc067 P4D 85c9cc067 PUD 85c9cb067 PMD 0 
  May 07 19:36:41 ap kernel: Oops:  [#1] SMP PTI
  May 07 19:36:41 ap kernel: CPU: 7 PID: 31632 Comm: worker Tainted: P U  W 
 OE 5.1.0-050100-generic #201905052130
  May 07 19:36:41 ap kernel: Hardware name: Dell Inc. Precision 5530/0FP2W2, 
BIOS 1.10.1 04/26/2019
  May 07 19:36:41 ap kernel: RIP: 0010:compaction_alloc+0x589/0x890
  May 07 19:36:41 ap kernel: Code: 7d b0 41 83 e6 1f 41 83 c6 01 4d 39 fc 73 7b 
e9 57 01 00 00 4d 89 e2 49 c1 e2 06 4c 03 15 57 b9 18 01 4d 89 d7 4d 85 ff 74 
44 <41> 8b 47 30 25 80 00 00 f0 3d 00 00 00 f0 0
  May 07 19:36:41 ap kernel: RSP: 0018:b352435b34a0 EFLAGS: 00010286
  May 07 19:36:41 ap kernel: RAX: a03dfc7d5d00 RBX: b352435b36a0 RCX: 
003d
  May 07 19:36:41 ap kernel: RDX: 800ffe00 RSI:  RDI: 
a03dfc7cd1e0
  May 07 19:36:41 ap kernel: RBP: b352435b3538 R08:  R09: 
a03dfc7d5d00
  May 07 19:36:41 ap kernel: R10: f9a5c3ff8000 R11: b352435b3719 R12: 
800ffe00
  May 07 19:36:41 ap kernel: R13: 8010 R14: 0020 R15: 
f9a5c3ff8000
  May 07 19:36:41 ap kernel: FS:  7f28c17fa700() 
GS:a03ddc3c() knlGS:
  May 07 19:36:41 ap kernel: CS:  0010 DS:  ES:  CR0: 80050033
  May 07 19:36:41 ap kernel: CR2: f9a5c3ff8030 CR3: 00069e550003 CR4: 
003626e0
  May 07 19:36:41 ap kernel: DR0:  DR1:  DR2: 

  May 07 19:36:41 ap kernel: DR3:  DR6: fffe0ff0 DR7: 
0400
  May 07 19:36:41 ap kernel: Call Trace:
  May 07 19:36:41 ap kernel:  migrate_pages+0x107/0xb40
  May 07 19:36:41 ap kernel:  ? move_freelist_tail+0xd0/0xd0
  May 07 19:36:41 ap kernel:  ? isolate_freepages_block+0x370/0x370
  May 07 19:36:41 ap kernel:  compact_zone+0x752/0xd70
  May 07 19:36:41 ap kernel:  compact_zone_order+0xd8/0x120
  May 07 19:36:41 ap kernel:  try_to_compact_pages+0xb0/0x260
  May 07 19:36:41 ap 

[Kernel-packages] [Bug 1828131] Re: Qemu causes system hang

2019-05-10 Thread Christian Ehrhardt 
So I guess we should then file the bug against nvidia-driver-430 then?

** Also affects: nvidia-graphics-drivers-418 (Ubuntu)
   Importance: Undecided
   Status: New

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to nvidia-graphics-drivers-418 in Ubuntu.
https://bugs.launchpad.net/bugs/1828131

Title:
  Qemu causes system hang

Status in nvidia-graphics-drivers-418 package in Ubuntu:
  New
Status in qemu package in Ubuntu:
  Incomplete

Bug description:
  I'm trying to use https://virt-manager.org/ / QEMU. I migrated over a
  Ubuntu Guest from Virtualbox, added it to virt-manager, and launch. It
  opens the window, then the system freezes.

  I don't know if it's a bug in QEMU, libvirt, or virt-manager, or
  nvidia. Nvidia is working fine otherwise, Version: 430.09.

  This is easily reproduced every time I try starting a qemu machine, so
  I can run any diagnostics while this happens if that helps.

  This is from journalctl at the time of the freeze:

  May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): The 
NVIDIA X driver has encountered an error; attempting to
  May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): 
recover...
  May 07 19:35:52 ap kernel: NVRM: GPU at PCI::01:00: 
GPU-d9fbb72e-29cb-d4db-ad8f-af242c1a6c15
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics 
Exception: Class 0x0 Subchannel 0x0 Mismatch
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics 
Exception: ESR 0x4041b0=0x20
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics 
Exception: ESR 0x404000=0x8002
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 13, Graphics 
Exception: ChID 0008, Class 902d, Offset 0860, Data 
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 32, Channel ID 
0008 intr 0200
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 31, Ch 0009, intr 
5000. MMU Fault: ENGINE CE0 HUBCLIENT_CE0 faulted @ 0x1_005a. Fault is 
of type FAULT_PTE ACCESS_TYPE_READ
  May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (II) NVIDIA(0): Error 
recovery was successful.
  May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): The 
NVIDIA X driver has encountered an error; attempting to
  May 07 19:35:52 ap /usr/lib/gdm3/gdm-x-session[3400]: (EE) NVIDIA(0): 
recover...
  May 07 19:35:52 ap kernel: NVRM: Xid (PCI::01:00): 32, Channel ID 
000b intr 0200
  May 07 19:36:03 ap kernel: Asynchronous wait on fence 
NVIDIA:nvidia.prime:10ad97 timed out (hint:intel_atomic_commit_ready+0x0/0x50 
[i915])
  May 07 19:36:21 ap libvirtd[1863]: internal error: connection closed due to 
keepalive timeout
  May 07 19:36:41 ap kernel: BUG: unable to handle kernel paging request at 
f9a5c3ff8030
  May 07 19:36:41 ap kernel: #PF error: [normal kernel read fault]
  May 07 19:36:41 ap kernel: PGD 85c9cc067 P4D 85c9cc067 PUD 85c9cb067 PMD 0 
  May 07 19:36:41 ap kernel: Oops:  [#1] SMP PTI
  May 07 19:36:41 ap kernel: CPU: 7 PID: 31632 Comm: worker Tainted: P U  W 
 OE 5.1.0-050100-generic #201905052130
  May 07 19:36:41 ap kernel: Hardware name: Dell Inc. Precision 5530/0FP2W2, 
BIOS 1.10.1 04/26/2019
  May 07 19:36:41 ap kernel: RIP: 0010:compaction_alloc+0x589/0x890
  May 07 19:36:41 ap kernel: Code: 7d b0 41 83 e6 1f 41 83 c6 01 4d 39 fc 73 7b 
e9 57 01 00 00 4d 89 e2 49 c1 e2 06 4c 03 15 57 b9 18 01 4d 89 d7 4d 85 ff 74 
44 <41> 8b 47 30 25 80 00 00 f0 3d 00 00 00 f0 0
  May 07 19:36:41 ap kernel: RSP: 0018:b352435b34a0 EFLAGS: 00010286
  May 07 19:36:41 ap kernel: RAX: a03dfc7d5d00 RBX: b352435b36a0 RCX: 
003d
  May 07 19:36:41 ap kernel: RDX: 800ffe00 RSI:  RDI: 
a03dfc7cd1e0
  May 07 19:36:41 ap kernel: RBP: b352435b3538 R08:  R09: 
a03dfc7d5d00
  May 07 19:36:41 ap kernel: R10: f9a5c3ff8000 R11: b352435b3719 R12: 
800ffe00
  May 07 19:36:41 ap kernel: R13: 8010 R14: 0020 R15: 
f9a5c3ff8000
  May 07 19:36:41 ap kernel: FS:  7f28c17fa700() 
GS:a03ddc3c() knlGS:
  May 07 19:36:41 ap kernel: CS:  0010 DS:  ES:  CR0: 80050033
  May 07 19:36:41 ap kernel: CR2: f9a5c3ff8030 CR3: 00069e550003 CR4: 
003626e0
  May 07 19:36:41 ap kernel: DR0:  DR1:  DR2: 

  May 07 19:36:41 ap kernel: DR3:  DR6: fffe0ff0 DR7: 
0400
  May 07 19:36:41 ap kernel: Call Trace:
  May 07 19:36:41 ap kernel:  migrate_pages+0x107/0xb40
  May 07 19:36:41 ap kernel:  ? move_freelist_tail+0xd0/0xd0
  May 07 19:36:41 ap kernel:  ? isolate_freepages_block+0x370/0x370
  May 07 19:36:41 ap kernel:  compact_zone+0x752/0xd70
  May 07 19:36:41 ap kernel:  compact_zone_order+0xd8/0x120
  May 07 19:36:41 ap kernel:  try_to_compact_pages+0xb0/0x260