[Desktop-packages] [Bug 2037641] Re: amdgpu: [gfxhub0] no-retry page fault

2023-10-05 Thread Mario Limonciello
Ok in this case can you please open an upstream mesa bug?

** Changed in: mesa (Ubuntu)
   Status: Fix Released => Triaged

-- 
You received this bug notification because you are a member of Desktop
Packages, which is subscribed to mesa in Ubuntu.
https://bugs.launchpad.net/bugs/2037641

Title:
  amdgpu: [gfxhub0] no-retry page fault

Status in linux package in Ubuntu:
  Invalid
Status in mesa package in Ubuntu:
  Triaged

Bug description:
  Whenever I use Intellij IDEA, after a couple of minutes the screen
  locks up and pretty much renders the machine unusable. I can still ssh
  to it, but the UI is pretty much gone and doesn't respond to
  Ctrl+Alt+F* keys.

  The setup: the machine is connected to an external monitor via USB-C.
  Running kernel 6.5 on Ubuntu 23.10.

  There's the following in kern.log:

  ```
  2023-09-28T13:58:23.077679+03:00 mavi-ThinkPad-T14s kernel: [  422.206433] 
amdgpu :06:00.0: amdgpu: [gfxhub0] no-retry page fault (src_id:0 ring:24 
vmid:6 pasid:32773, for process Xwayland pid 7640 thread Xwayland:cs0 pid 7663)
  2023-09-28T13:58:23.077694+03:00 mavi-ThinkPad-T14s kernel: [  422.206450] 
amdgpu :06:00.0: amdgpu:   in page starting at address 0xb15852923000 
from IH client 0x1b (UTCL2)
  2023-09-28T13:58:23.077695+03:00 mavi-ThinkPad-T14s kernel: [  422.206460] 
amdgpu :06:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00600431
  2023-09-28T13:58:23.077696+03:00 mavi-ThinkPad-T14s kernel: [  422.206466] 
amdgpu :06:00.0: amdgpu:  Faulty UTCL2 client ID: IA (0x2)
  2023-09-28T13:58:23.077698+03:00 mavi-ThinkPad-T14s kernel: [  422.206471] 
amdgpu :06:00.0: amdgpu:  MORE_FAULTS: 0x1
  2023-09-28T13:58:23.077699+03:00 mavi-ThinkPad-T14s kernel: [  422.206476] 
amdgpu :06:00.0: amdgpu:  WALKER_ERROR: 0x0
  2023-09-28T13:58:23.077699+03:00 mavi-ThinkPad-T14s kernel: [  422.206481] 
amdgpu :06:00.0: amdgpu:  PERMISSION_FAULTS: 0x3
  2023-09-28T13:58:23.077700+03:00 mavi-ThinkPad-T14s kernel: [  422.206485] 
amdgpu :06:00.0: amdgpu:  MAPPING_ERROR: 0x0
  2023-09-28T13:58:23.077701+03:00 mavi-ThinkPad-T14s kernel: [  422.206490] 
amdgpu :06:00.0: amdgpu:  RW: 0x0
  2023-09-28T13:58:23.077701+03:00 mavi-ThinkPad-T14s kernel: [  422.206497] 
amdgpu :06:00.0: amdgpu: [gfxhub0] no-retry page fault (src_id:0 ring:24 
vmid:6 pasid:32773, for process Xwayland pid 7640 thread Xwayland:cs0 pid 7663)
  2023-09-28T13:58:23.077702+03:00 mavi-ThinkPad-T14s kernel: [  422.206506] 
amdgpu :06:00.0: amdgpu:   in page starting at address 0xb07248096000 
from IH client 0x1b (UTCL2)
  2023-09-28T13:58:23.077703+03:00 mavi-ThinkPad-T14s kernel: [  422.206514] 
amdgpu :06:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00600431
  2023-09-28T13:58:23.077704+03:00 mavi-ThinkPad-T14s kernel: [  422.206519] 
amdgpu :06:00.0: amdgpu:  Faulty UTCL2 client ID: IA (0x2)
  2023-09-28T13:58:23.077704+03:00 mavi-ThinkPad-T14s kernel: [  422.206524] 
amdgpu :06:00.0: amdgpu:  MORE_FAULTS: 0x1
  2023-09-28T13:58:23.077705+03:00 mavi-ThinkPad-T14s kernel: [  422.206528] 
amdgpu :06:00.0: amdgpu:  WALKER_ERROR: 0x0
  2023-09-28T13:58:23.077705+03:00 mavi-ThinkPad-T14s kernel: [  422.206533] 
amdgpu :06:00.0: amdgpu:  PERMISSION_FAULTS: 0x3
  2023-09-28T13:58:23.077706+03:00 mavi-ThinkPad-T14s kernel: [  422.206538] 
amdgpu :06:00.0: amdgpu:  MAPPING_ERROR: 0x0
  2023-09-28T13:58:23.077707+03:00 mavi-ThinkPad-T14s kernel: [  422.206542] 
amdgpu :06:00.0: amdgpu:  RW: 0x0
  2023-09-28T13:58:23.077708+03:00 mavi-ThinkPad-T14s kernel: [  422.206549] 
amdgpu :06:00.0: amdgpu: [gfxhub0] no-retry page fault (src_id:0 ring:24 
vmid:6 pasid:32773, for process Xwayland pid 7640 thread Xwayland:cs0 pid 7663)
  2023-09-28T13:58:23.077709+03:00 mavi-ThinkPad-T14s kernel: [  422.206556] 
amdgpu :06:00.0: amdgpu:   in page starting at address 0xaf8c3d808000 
from IH client 0x1b (UTCL2)
  2023-09-28T13:58:23.077710+03:00 mavi-ThinkPad-T14s kernel: [  422.206564] 
amdgpu :06:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00600431
  2023-09-28T13:58:23.077710+03:00 mavi-ThinkPad-T14s kernel: [  422.206569] 
amdgpu :06:00.0: amdgpu:  Faulty UTCL2 client ID: IA (0x2)
  2023-09-28T13:58:23.077711+03:00 mavi-ThinkPad-T14s kernel: [  422.206574] 
amdgpu :06:00.0: amdgpu:  MORE_FAULTS: 0x1
  2023-09-28T13:58:23.077712+03:00 mavi-ThinkPad-T14s kernel: [  422.206578] 
amdgpu :06:00.0: amdgpu:  WALKER_ERROR: 0x0
  2023-09-28T13:58:23.077712+03:00 mavi-ThinkPad-T14s kernel: [  422.206583] 
amdgpu :06:00.0: amdgpu:  PERMISSION_FAULTS: 0x3
  2023-09-28T13:58:23.077713+03:00 mavi-ThinkPad-T14s kernel: [  422.206588] 
amdgpu :06:00.0: amdgpu:  MAPPING_ERROR: 0x0
  2023-09-28T13:58:23.077713+03:00 mavi-ThinkPad-T14s kernel: [  422.206592] 
amdgpu :06:00.0: amdgpu:  RW: 0x0
  2023-09-28T13:58:23

[Desktop-packages] [Bug 2037641] Re: amdgpu: [gfxhub0] no-retry page fault

2023-10-04 Thread Martin Vysny
The process is Intellij IDEA, running on Java 17 in Xwayland.

-- 
You received this bug notification because you are a member of Desktop
Packages, which is subscribed to mesa in Ubuntu.
https://bugs.launchpad.net/bugs/2037641

Title:
  amdgpu: [gfxhub0] no-retry page fault

Status in linux package in Ubuntu:
  Invalid
Status in mesa package in Ubuntu:
  Fix Released

Bug description:
  Whenever I use Intellij IDEA, after a couple of minutes the screen
  locks up and pretty much renders the machine unusable. I can still ssh
  to it, but the UI is pretty much gone and doesn't respond to
  Ctrl+Alt+F* keys.

  The setup: the machine is connected to an external monitor via USB-C.
  Running kernel 6.5 on Ubuntu 23.10.

  There's the following in kern.log:

  ```
  2023-09-28T13:58:23.077679+03:00 mavi-ThinkPad-T14s kernel: [  422.206433] 
amdgpu :06:00.0: amdgpu: [gfxhub0] no-retry page fault (src_id:0 ring:24 
vmid:6 pasid:32773, for process Xwayland pid 7640 thread Xwayland:cs0 pid 7663)
  2023-09-28T13:58:23.077694+03:00 mavi-ThinkPad-T14s kernel: [  422.206450] 
amdgpu :06:00.0: amdgpu:   in page starting at address 0xb15852923000 
from IH client 0x1b (UTCL2)
  2023-09-28T13:58:23.077695+03:00 mavi-ThinkPad-T14s kernel: [  422.206460] 
amdgpu :06:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00600431
  2023-09-28T13:58:23.077696+03:00 mavi-ThinkPad-T14s kernel: [  422.206466] 
amdgpu :06:00.0: amdgpu:  Faulty UTCL2 client ID: IA (0x2)
  2023-09-28T13:58:23.077698+03:00 mavi-ThinkPad-T14s kernel: [  422.206471] 
amdgpu :06:00.0: amdgpu:  MORE_FAULTS: 0x1
  2023-09-28T13:58:23.077699+03:00 mavi-ThinkPad-T14s kernel: [  422.206476] 
amdgpu :06:00.0: amdgpu:  WALKER_ERROR: 0x0
  2023-09-28T13:58:23.077699+03:00 mavi-ThinkPad-T14s kernel: [  422.206481] 
amdgpu :06:00.0: amdgpu:  PERMISSION_FAULTS: 0x3
  2023-09-28T13:58:23.077700+03:00 mavi-ThinkPad-T14s kernel: [  422.206485] 
amdgpu :06:00.0: amdgpu:  MAPPING_ERROR: 0x0
  2023-09-28T13:58:23.077701+03:00 mavi-ThinkPad-T14s kernel: [  422.206490] 
amdgpu :06:00.0: amdgpu:  RW: 0x0
  2023-09-28T13:58:23.077701+03:00 mavi-ThinkPad-T14s kernel: [  422.206497] 
amdgpu :06:00.0: amdgpu: [gfxhub0] no-retry page fault (src_id:0 ring:24 
vmid:6 pasid:32773, for process Xwayland pid 7640 thread Xwayland:cs0 pid 7663)
  2023-09-28T13:58:23.077702+03:00 mavi-ThinkPad-T14s kernel: [  422.206506] 
amdgpu :06:00.0: amdgpu:   in page starting at address 0xb07248096000 
from IH client 0x1b (UTCL2)
  2023-09-28T13:58:23.077703+03:00 mavi-ThinkPad-T14s kernel: [  422.206514] 
amdgpu :06:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00600431
  2023-09-28T13:58:23.077704+03:00 mavi-ThinkPad-T14s kernel: [  422.206519] 
amdgpu :06:00.0: amdgpu:  Faulty UTCL2 client ID: IA (0x2)
  2023-09-28T13:58:23.077704+03:00 mavi-ThinkPad-T14s kernel: [  422.206524] 
amdgpu :06:00.0: amdgpu:  MORE_FAULTS: 0x1
  2023-09-28T13:58:23.077705+03:00 mavi-ThinkPad-T14s kernel: [  422.206528] 
amdgpu :06:00.0: amdgpu:  WALKER_ERROR: 0x0
  2023-09-28T13:58:23.077705+03:00 mavi-ThinkPad-T14s kernel: [  422.206533] 
amdgpu :06:00.0: amdgpu:  PERMISSION_FAULTS: 0x3
  2023-09-28T13:58:23.077706+03:00 mavi-ThinkPad-T14s kernel: [  422.206538] 
amdgpu :06:00.0: amdgpu:  MAPPING_ERROR: 0x0
  2023-09-28T13:58:23.077707+03:00 mavi-ThinkPad-T14s kernel: [  422.206542] 
amdgpu :06:00.0: amdgpu:  RW: 0x0
  2023-09-28T13:58:23.077708+03:00 mavi-ThinkPad-T14s kernel: [  422.206549] 
amdgpu :06:00.0: amdgpu: [gfxhub0] no-retry page fault (src_id:0 ring:24 
vmid:6 pasid:32773, for process Xwayland pid 7640 thread Xwayland:cs0 pid 7663)
  2023-09-28T13:58:23.077709+03:00 mavi-ThinkPad-T14s kernel: [  422.206556] 
amdgpu :06:00.0: amdgpu:   in page starting at address 0xaf8c3d808000 
from IH client 0x1b (UTCL2)
  2023-09-28T13:58:23.077710+03:00 mavi-ThinkPad-T14s kernel: [  422.206564] 
amdgpu :06:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00600431
  2023-09-28T13:58:23.077710+03:00 mavi-ThinkPad-T14s kernel: [  422.206569] 
amdgpu :06:00.0: amdgpu:  Faulty UTCL2 client ID: IA (0x2)
  2023-09-28T13:58:23.077711+03:00 mavi-ThinkPad-T14s kernel: [  422.206574] 
amdgpu :06:00.0: amdgpu:  MORE_FAULTS: 0x1
  2023-09-28T13:58:23.077712+03:00 mavi-ThinkPad-T14s kernel: [  422.206578] 
amdgpu :06:00.0: amdgpu:  WALKER_ERROR: 0x0
  2023-09-28T13:58:23.077712+03:00 mavi-ThinkPad-T14s kernel: [  422.206583] 
amdgpu :06:00.0: amdgpu:  PERMISSION_FAULTS: 0x3
  2023-09-28T13:58:23.077713+03:00 mavi-ThinkPad-T14s kernel: [  422.206588] 
amdgpu :06:00.0: amdgpu:  MAPPING_ERROR: 0x0
  2023-09-28T13:58:23.077713+03:00 mavi-ThinkPad-T14s kernel: [  422.206592] 
amdgpu :06:00.0: amdgpu:  RW: 0x0
  2023-09-28T13:58:23.077726+03:00 mavi-ThinkPad-T14s kernel: [  422.206597] 
amd

[Desktop-packages] [Bug 2037641] Re: amdgpu: [gfxhub0] no-retry page fault

2023-10-04 Thread Martin Vysny
Unfortunately the problem is still reproducible even with the newest
mesa, even though it looks like it's much less frequent. Yesterday
evening I got another crash, with mesa 23.2.1-1ubuntu2:

```
2023-10-04T22:33:15.415854+03:00 mavi-ThinkPad-T14s kernel: [ 1076.119146] 
amdgpu :06:00.0: amdgpu: [gfxhub0] no-retry page fault (src_id:0 ring:24 
vmid:4 pasid:32770, for process Xwayland pid 4940 thread Xwayland:cs0 pid 5015)
2023-10-04T22:33:15.415871+03:00 mavi-ThinkPad-T14s kernel: [ 1076.119168] 
amdgpu :06:00.0: amdgpu:   in page starting at address 0xe5ea2326e000 
from IH client 0x1b (UTCL2)
2023-10-04T22:33:15.415873+03:00 mavi-ThinkPad-T14s kernel: [ 1076.119180] 
amdgpu :06:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00400430
2023-10-04T22:33:15.415874+03:00 mavi-ThinkPad-T14s kernel: [ 1076.119187] 
amdgpu :06:00.0: amdgpu:  Faulty UTCL2 client ID: IA (0x2)
2023-10-04T22:33:15.415875+03:00 mavi-ThinkPad-T14s kernel: [ 1076.119194] 
amdgpu :06:00.0: amdgpu:  MORE_FAULTS: 0x0
2023-10-04T22:33:15.415876+03:00 mavi-ThinkPad-T14s kernel: [ 1076.119200] 
amdgpu :06:00.0: amdgpu:  WALKER_ERROR: 0x0
2023-10-04T22:33:15.415878+03:00 mavi-ThinkPad-T14s kernel: [ 1076.119206] 
amdgpu :06:00.0: amdgpu:  PERMISSION_FAULTS: 0x3
2023-10-04T22:33:15.415878+03:00 mavi-ThinkPad-T14s kernel: [ 1076.119212] 
amdgpu :06:00.0: amdgpu:  MAPPING_ERROR: 0x0
2023-10-04T22:33:15.415879+03:00 mavi-ThinkPad-T14s kernel: [ 1076.119218] 
amdgpu :06:00.0: amdgpu:  RW: 0x0
2023-10-04T22:33:15.415881+03:00 mavi-ThinkPad-T14s kernel: [ 1076.119750] 
amdgpu :06:00.0: amdgpu: [gfxhub0] no-retry page fault (src_id:0 ring:24 
vmid:4 pasid:32770, for process Xwayland pid 4940 thread Xwayland:cs0 pid 5015)
2023-10-04T22:33:15.415881+03:00 mavi-ThinkPad-T14s kernel: [ 1076.119768] 
amdgpu :06:00.0: amdgpu:   in page starting at address 0xe5ea2326f000 
from IH client 0x1b (UTCL2)
2023-10-04T22:33:15.415883+03:00 mavi-ThinkPad-T14s kernel: [ 1076.119780] 
amdgpu :06:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00400430
2023-10-04T22:33:15.415884+03:00 mavi-ThinkPad-T14s kernel: [ 1076.119787] 
amdgpu :06:00.0: amdgpu:  Faulty UTCL2 client ID: IA (0x2)
2023-10-04T22:33:15.415885+03:00 mavi-ThinkPad-T14s kernel: [ 1076.119793] 
amdgpu :06:00.0: amdgpu:  MORE_FAULTS: 0x0
2023-10-04T22:33:15.415885+03:00 mavi-ThinkPad-T14s kernel: [ 1076.119800] 
amdgpu :06:00.0: amdgpu:  WALKER_ERROR: 0x0
2023-10-04T22:33:15.415886+03:00 mavi-ThinkPad-T14s kernel: [ 1076.119806] 
amdgpu :06:00.0: amdgpu:  PERMISSION_FAULTS: 0x3
2023-10-04T22:33:15.415887+03:00 mavi-ThinkPad-T14s kernel: [ 1076.119812] 
amdgpu :06:00.0: amdgpu:  MAPPING_ERROR: 0x0
2023-10-04T22:33:15.415888+03:00 mavi-ThinkPad-T14s kernel: [ 1076.119818] 
amdgpu :06:00.0: amdgpu:  RW: 0x0
2023-10-04T22:33:25.115577+03:00 mavi-ThinkPad-T14s kernel: [ 1085.820288] 
[drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_low timeout, signaled 
seq=75561, emitted seq=75563
2023-10-04T22:33:25.115600+03:00 mavi-ThinkPad-T14s kernel: [ 1085.821169] 
[drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process 
Xwayland pid 4940 thread Xwayland:cs0 pid 5015
2023-10-04T22:33:25.115602+03:00 mavi-ThinkPad-T14s kernel: [ 1085.822031] 
amdgpu :06:00.0: amdgpu: GPU reset begin!
2023-10-04T22:33:25.347547+03:00 mavi-ThinkPad-T14s kernel: [ 1086.052065] 
[drm] psp gfx command UNLOAD_TA(0x2) failed and response status is (0x117)
2023-10-04T22:33:25.375820+03:00 mavi-ThinkPad-T14s kernel: [ 1086.078048] 
amdgpu :06:00.0: amdgpu: MODE2 reset
2023-10-04T22:33:25.375835+03:00 mavi-ThinkPad-T14s kernel: [ 1086.078296] 
amdgpu :06:00.0: amdgpu: GPU reset succeeded, trying to resume
2023-10-04T22:33:25.375837+03:00 mavi-ThinkPad-T14s kernel: [ 1086.078498] 
[drm] PCIE GART of 1024M enabled.
2023-10-04T22:33:25.375838+03:00 mavi-ThinkPad-T14s kernel: [ 1086.078503] 
[drm] PTB located at 0x00F43FC0
2023-10-04T22:33:25.375839+03:00 mavi-ThinkPad-T14s kernel: [ 1086.078625] 
[drm] PSP is resuming...
2023-10-04T22:33:26.075542+03:00 mavi-ThinkPad-T14s kernel: [ 1086.780162] 
[drm] reserve 0x40 from 0xf43f80 for PSP TMR
2023-10-04T22:33:26.363527+03:00 mavi-ThinkPad-T14s kernel: [ 1087.066834] 
amdgpu :06:00.0: amdgpu: RAS: optional ras ta ucode is not available
2023-10-04T22:33:26.375796+03:00 mavi-ThinkPad-T14s kernel: [ 1087.078189] 
amdgpu :06:00.0: amdgpu: RAP: optional rap ta ucode is not available
2023-10-04T22:33:26.375806+03:00 mavi-ThinkPad-T14s kernel: [ 1087.078196] 
amdgpu :06:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not 
available
2023-10-04T22:33:26.375808+03:00 mavi-ThinkPad-T14s kernel: [ 1087.078204] 
amdgpu :06:00.0: amdgpu: SMU is resuming...
2023-10-04T22:33:26.375809+03:00 mavi-ThinkPad-T14s kernel: [ 1087.079063] 
amdgpu :06:00.0: amdgpu: SMU is resumed succ

[Desktop-packages] [Bug 2037641] Re: amdgpu: [gfxhub0] no-retry page fault

2023-10-04 Thread Mario Limonciello
I'd say that's very likely. That mesa upgrade just landed in the archive
a few days ago and the trace you reported looks more like how a mesa bug
manifests.

** Changed in: linux (Ubuntu)
   Status: Confirmed => Invalid

** Changed in: mesa (Ubuntu)
   Status: New => Fix Released

-- 
You received this bug notification because you are a member of Desktop
Packages, which is subscribed to mesa in Ubuntu.
https://bugs.launchpad.net/bugs/2037641

Title:
  amdgpu: [gfxhub0] no-retry page fault

Status in linux package in Ubuntu:
  Invalid
Status in mesa package in Ubuntu:
  Fix Released

Bug description:
  Whenever I use Intellij IDEA, after a couple of minutes the screen
  locks up and pretty much renders the machine unusable. I can still ssh
  to it, but the UI is pretty much gone and doesn't respond to
  Ctrl+Alt+F* keys.

  The setup: the machine is connected to an external monitor via USB-C.
  Running kernel 6.5 on Ubuntu 23.10.

  There's the following in kern.log:

  ```
  2023-09-28T13:58:23.077679+03:00 mavi-ThinkPad-T14s kernel: [  422.206433] 
amdgpu :06:00.0: amdgpu: [gfxhub0] no-retry page fault (src_id:0 ring:24 
vmid:6 pasid:32773, for process Xwayland pid 7640 thread Xwayland:cs0 pid 7663)
  2023-09-28T13:58:23.077694+03:00 mavi-ThinkPad-T14s kernel: [  422.206450] 
amdgpu :06:00.0: amdgpu:   in page starting at address 0xb15852923000 
from IH client 0x1b (UTCL2)
  2023-09-28T13:58:23.077695+03:00 mavi-ThinkPad-T14s kernel: [  422.206460] 
amdgpu :06:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00600431
  2023-09-28T13:58:23.077696+03:00 mavi-ThinkPad-T14s kernel: [  422.206466] 
amdgpu :06:00.0: amdgpu:  Faulty UTCL2 client ID: IA (0x2)
  2023-09-28T13:58:23.077698+03:00 mavi-ThinkPad-T14s kernel: [  422.206471] 
amdgpu :06:00.0: amdgpu:  MORE_FAULTS: 0x1
  2023-09-28T13:58:23.077699+03:00 mavi-ThinkPad-T14s kernel: [  422.206476] 
amdgpu :06:00.0: amdgpu:  WALKER_ERROR: 0x0
  2023-09-28T13:58:23.077699+03:00 mavi-ThinkPad-T14s kernel: [  422.206481] 
amdgpu :06:00.0: amdgpu:  PERMISSION_FAULTS: 0x3
  2023-09-28T13:58:23.077700+03:00 mavi-ThinkPad-T14s kernel: [  422.206485] 
amdgpu :06:00.0: amdgpu:  MAPPING_ERROR: 0x0
  2023-09-28T13:58:23.077701+03:00 mavi-ThinkPad-T14s kernel: [  422.206490] 
amdgpu :06:00.0: amdgpu:  RW: 0x0
  2023-09-28T13:58:23.077701+03:00 mavi-ThinkPad-T14s kernel: [  422.206497] 
amdgpu :06:00.0: amdgpu: [gfxhub0] no-retry page fault (src_id:0 ring:24 
vmid:6 pasid:32773, for process Xwayland pid 7640 thread Xwayland:cs0 pid 7663)
  2023-09-28T13:58:23.077702+03:00 mavi-ThinkPad-T14s kernel: [  422.206506] 
amdgpu :06:00.0: amdgpu:   in page starting at address 0xb07248096000 
from IH client 0x1b (UTCL2)
  2023-09-28T13:58:23.077703+03:00 mavi-ThinkPad-T14s kernel: [  422.206514] 
amdgpu :06:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00600431
  2023-09-28T13:58:23.077704+03:00 mavi-ThinkPad-T14s kernel: [  422.206519] 
amdgpu :06:00.0: amdgpu:  Faulty UTCL2 client ID: IA (0x2)
  2023-09-28T13:58:23.077704+03:00 mavi-ThinkPad-T14s kernel: [  422.206524] 
amdgpu :06:00.0: amdgpu:  MORE_FAULTS: 0x1
  2023-09-28T13:58:23.077705+03:00 mavi-ThinkPad-T14s kernel: [  422.206528] 
amdgpu :06:00.0: amdgpu:  WALKER_ERROR: 0x0
  2023-09-28T13:58:23.077705+03:00 mavi-ThinkPad-T14s kernel: [  422.206533] 
amdgpu :06:00.0: amdgpu:  PERMISSION_FAULTS: 0x3
  2023-09-28T13:58:23.077706+03:00 mavi-ThinkPad-T14s kernel: [  422.206538] 
amdgpu :06:00.0: amdgpu:  MAPPING_ERROR: 0x0
  2023-09-28T13:58:23.077707+03:00 mavi-ThinkPad-T14s kernel: [  422.206542] 
amdgpu :06:00.0: amdgpu:  RW: 0x0
  2023-09-28T13:58:23.077708+03:00 mavi-ThinkPad-T14s kernel: [  422.206549] 
amdgpu :06:00.0: amdgpu: [gfxhub0] no-retry page fault (src_id:0 ring:24 
vmid:6 pasid:32773, for process Xwayland pid 7640 thread Xwayland:cs0 pid 7663)
  2023-09-28T13:58:23.077709+03:00 mavi-ThinkPad-T14s kernel: [  422.206556] 
amdgpu :06:00.0: amdgpu:   in page starting at address 0xaf8c3d808000 
from IH client 0x1b (UTCL2)
  2023-09-28T13:58:23.077710+03:00 mavi-ThinkPad-T14s kernel: [  422.206564] 
amdgpu :06:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00600431
  2023-09-28T13:58:23.077710+03:00 mavi-ThinkPad-T14s kernel: [  422.206569] 
amdgpu :06:00.0: amdgpu:  Faulty UTCL2 client ID: IA (0x2)
  2023-09-28T13:58:23.077711+03:00 mavi-ThinkPad-T14s kernel: [  422.206574] 
amdgpu :06:00.0: amdgpu:  MORE_FAULTS: 0x1
  2023-09-28T13:58:23.077712+03:00 mavi-ThinkPad-T14s kernel: [  422.206578] 
amdgpu :06:00.0: amdgpu:  WALKER_ERROR: 0x0
  2023-09-28T13:58:23.077712+03:00 mavi-ThinkPad-T14s kernel: [  422.206583] 
amdgpu :06:00.0: amdgpu:  PERMISSION_FAULTS: 0x3
  2023-09-28T13:58:23.077713+03:00 mavi-ThinkPad-T14s kernel: [  422.206588] 
amdgpu :06:00.0: amdgpu:   

[Desktop-packages] [Bug 2037641] Re: amdgpu: [gfxhub0] no-retry page fault

2023-10-04 Thread Martin Vysny
Thanks Mario! I checked, and I have the newest mesa:

```
$ apt search mesa-vdpau
Sorting... Done
Full Text Search... Done
mesa-vdpau-drivers/mantic,now 23.2.1-1ubuntu2 amd64 [installed,automatic]
  Mesa VDPAU video acceleration drivers
```

Funny thing is that the bug is no longer reproducible - everything is
working properly, yay! Could it be that mesa was upgraded a couple of
days ago, resolving the issue?

Anyways, the issue looks to be fixed now. Thanks again!

-- 
You received this bug notification because you are a member of Desktop
Packages, which is subscribed to mesa in Ubuntu.
https://bugs.launchpad.net/bugs/2037641

Title:
  amdgpu: [gfxhub0] no-retry page fault

Status in linux package in Ubuntu:
  Confirmed
Status in mesa package in Ubuntu:
  New

Bug description:
  Whenever I use Intellij IDEA, after a couple of minutes the screen
  locks up and pretty much renders the machine unusable. I can still ssh
  to it, but the UI is pretty much gone and doesn't respond to
  Ctrl+Alt+F* keys.

  The setup: the machine is connected to an external monitor via USB-C.
  Running kernel 6.5 on Ubuntu 23.10.

  There's the following in kern.log:

  ```
  2023-09-28T13:58:23.077679+03:00 mavi-ThinkPad-T14s kernel: [  422.206433] 
amdgpu :06:00.0: amdgpu: [gfxhub0] no-retry page fault (src_id:0 ring:24 
vmid:6 pasid:32773, for process Xwayland pid 7640 thread Xwayland:cs0 pid 7663)
  2023-09-28T13:58:23.077694+03:00 mavi-ThinkPad-T14s kernel: [  422.206450] 
amdgpu :06:00.0: amdgpu:   in page starting at address 0xb15852923000 
from IH client 0x1b (UTCL2)
  2023-09-28T13:58:23.077695+03:00 mavi-ThinkPad-T14s kernel: [  422.206460] 
amdgpu :06:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00600431
  2023-09-28T13:58:23.077696+03:00 mavi-ThinkPad-T14s kernel: [  422.206466] 
amdgpu :06:00.0: amdgpu:  Faulty UTCL2 client ID: IA (0x2)
  2023-09-28T13:58:23.077698+03:00 mavi-ThinkPad-T14s kernel: [  422.206471] 
amdgpu :06:00.0: amdgpu:  MORE_FAULTS: 0x1
  2023-09-28T13:58:23.077699+03:00 mavi-ThinkPad-T14s kernel: [  422.206476] 
amdgpu :06:00.0: amdgpu:  WALKER_ERROR: 0x0
  2023-09-28T13:58:23.077699+03:00 mavi-ThinkPad-T14s kernel: [  422.206481] 
amdgpu :06:00.0: amdgpu:  PERMISSION_FAULTS: 0x3
  2023-09-28T13:58:23.077700+03:00 mavi-ThinkPad-T14s kernel: [  422.206485] 
amdgpu :06:00.0: amdgpu:  MAPPING_ERROR: 0x0
  2023-09-28T13:58:23.077701+03:00 mavi-ThinkPad-T14s kernel: [  422.206490] 
amdgpu :06:00.0: amdgpu:  RW: 0x0
  2023-09-28T13:58:23.077701+03:00 mavi-ThinkPad-T14s kernel: [  422.206497] 
amdgpu :06:00.0: amdgpu: [gfxhub0] no-retry page fault (src_id:0 ring:24 
vmid:6 pasid:32773, for process Xwayland pid 7640 thread Xwayland:cs0 pid 7663)
  2023-09-28T13:58:23.077702+03:00 mavi-ThinkPad-T14s kernel: [  422.206506] 
amdgpu :06:00.0: amdgpu:   in page starting at address 0xb07248096000 
from IH client 0x1b (UTCL2)
  2023-09-28T13:58:23.077703+03:00 mavi-ThinkPad-T14s kernel: [  422.206514] 
amdgpu :06:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00600431
  2023-09-28T13:58:23.077704+03:00 mavi-ThinkPad-T14s kernel: [  422.206519] 
amdgpu :06:00.0: amdgpu:  Faulty UTCL2 client ID: IA (0x2)
  2023-09-28T13:58:23.077704+03:00 mavi-ThinkPad-T14s kernel: [  422.206524] 
amdgpu :06:00.0: amdgpu:  MORE_FAULTS: 0x1
  2023-09-28T13:58:23.077705+03:00 mavi-ThinkPad-T14s kernel: [  422.206528] 
amdgpu :06:00.0: amdgpu:  WALKER_ERROR: 0x0
  2023-09-28T13:58:23.077705+03:00 mavi-ThinkPad-T14s kernel: [  422.206533] 
amdgpu :06:00.0: amdgpu:  PERMISSION_FAULTS: 0x3
  2023-09-28T13:58:23.077706+03:00 mavi-ThinkPad-T14s kernel: [  422.206538] 
amdgpu :06:00.0: amdgpu:  MAPPING_ERROR: 0x0
  2023-09-28T13:58:23.077707+03:00 mavi-ThinkPad-T14s kernel: [  422.206542] 
amdgpu :06:00.0: amdgpu:  RW: 0x0
  2023-09-28T13:58:23.077708+03:00 mavi-ThinkPad-T14s kernel: [  422.206549] 
amdgpu :06:00.0: amdgpu: [gfxhub0] no-retry page fault (src_id:0 ring:24 
vmid:6 pasid:32773, for process Xwayland pid 7640 thread Xwayland:cs0 pid 7663)
  2023-09-28T13:58:23.077709+03:00 mavi-ThinkPad-T14s kernel: [  422.206556] 
amdgpu :06:00.0: amdgpu:   in page starting at address 0xaf8c3d808000 
from IH client 0x1b (UTCL2)
  2023-09-28T13:58:23.077710+03:00 mavi-ThinkPad-T14s kernel: [  422.206564] 
amdgpu :06:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00600431
  2023-09-28T13:58:23.077710+03:00 mavi-ThinkPad-T14s kernel: [  422.206569] 
amdgpu :06:00.0: amdgpu:  Faulty UTCL2 client ID: IA (0x2)
  2023-09-28T13:58:23.077711+03:00 mavi-ThinkPad-T14s kernel: [  422.206574] 
amdgpu :06:00.0: amdgpu:  MORE_FAULTS: 0x1
  2023-09-28T13:58:23.077712+03:00 mavi-ThinkPad-T14s kernel: [  422.206578] 
amdgpu :06:00.0: amdgpu:  WALKER_ERROR: 0x0
  2023-09-28T13:58:23.077712+03:00 mavi-ThinkPad-T14s kernel: [  422.206583] 

[Desktop-packages] [Bug 2037641] Re: amdgpu: [gfxhub0] no-retry page fault

2023-10-03 Thread Mario Limonciello
Are you up to date on the current version of mesa in 23.10?
23.2.1-1ubuntu2?

** Also affects: mesa (Ubuntu)
   Importance: Undecided
   Status: New

-- 
You received this bug notification because you are a member of Desktop
Packages, which is subscribed to mesa in Ubuntu.
https://bugs.launchpad.net/bugs/2037641

Title:
  amdgpu: [gfxhub0] no-retry page fault

Status in linux package in Ubuntu:
  Confirmed
Status in mesa package in Ubuntu:
  New

Bug description:
  Whenever I use Intellij IDEA, after a couple of minutes the screen
  locks up and pretty much renders the machine unusable. I can still ssh
  to it, but the UI is pretty much gone and doesn't respond to
  Ctrl+Alt+F* keys.

  The setup: the machine is connected to an external monitor via USB-C.
  Running kernel 6.5 on Ubuntu 23.10.

  There's the following in kern.log:

  ```
  2023-09-28T13:58:23.077679+03:00 mavi-ThinkPad-T14s kernel: [  422.206433] 
amdgpu :06:00.0: amdgpu: [gfxhub0] no-retry page fault (src_id:0 ring:24 
vmid:6 pasid:32773, for process Xwayland pid 7640 thread Xwayland:cs0 pid 7663)
  2023-09-28T13:58:23.077694+03:00 mavi-ThinkPad-T14s kernel: [  422.206450] 
amdgpu :06:00.0: amdgpu:   in page starting at address 0xb15852923000 
from IH client 0x1b (UTCL2)
  2023-09-28T13:58:23.077695+03:00 mavi-ThinkPad-T14s kernel: [  422.206460] 
amdgpu :06:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00600431
  2023-09-28T13:58:23.077696+03:00 mavi-ThinkPad-T14s kernel: [  422.206466] 
amdgpu :06:00.0: amdgpu:  Faulty UTCL2 client ID: IA (0x2)
  2023-09-28T13:58:23.077698+03:00 mavi-ThinkPad-T14s kernel: [  422.206471] 
amdgpu :06:00.0: amdgpu:  MORE_FAULTS: 0x1
  2023-09-28T13:58:23.077699+03:00 mavi-ThinkPad-T14s kernel: [  422.206476] 
amdgpu :06:00.0: amdgpu:  WALKER_ERROR: 0x0
  2023-09-28T13:58:23.077699+03:00 mavi-ThinkPad-T14s kernel: [  422.206481] 
amdgpu :06:00.0: amdgpu:  PERMISSION_FAULTS: 0x3
  2023-09-28T13:58:23.077700+03:00 mavi-ThinkPad-T14s kernel: [  422.206485] 
amdgpu :06:00.0: amdgpu:  MAPPING_ERROR: 0x0
  2023-09-28T13:58:23.077701+03:00 mavi-ThinkPad-T14s kernel: [  422.206490] 
amdgpu :06:00.0: amdgpu:  RW: 0x0
  2023-09-28T13:58:23.077701+03:00 mavi-ThinkPad-T14s kernel: [  422.206497] 
amdgpu :06:00.0: amdgpu: [gfxhub0] no-retry page fault (src_id:0 ring:24 
vmid:6 pasid:32773, for process Xwayland pid 7640 thread Xwayland:cs0 pid 7663)
  2023-09-28T13:58:23.077702+03:00 mavi-ThinkPad-T14s kernel: [  422.206506] 
amdgpu :06:00.0: amdgpu:   in page starting at address 0xb07248096000 
from IH client 0x1b (UTCL2)
  2023-09-28T13:58:23.077703+03:00 mavi-ThinkPad-T14s kernel: [  422.206514] 
amdgpu :06:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00600431
  2023-09-28T13:58:23.077704+03:00 mavi-ThinkPad-T14s kernel: [  422.206519] 
amdgpu :06:00.0: amdgpu:  Faulty UTCL2 client ID: IA (0x2)
  2023-09-28T13:58:23.077704+03:00 mavi-ThinkPad-T14s kernel: [  422.206524] 
amdgpu :06:00.0: amdgpu:  MORE_FAULTS: 0x1
  2023-09-28T13:58:23.077705+03:00 mavi-ThinkPad-T14s kernel: [  422.206528] 
amdgpu :06:00.0: amdgpu:  WALKER_ERROR: 0x0
  2023-09-28T13:58:23.077705+03:00 mavi-ThinkPad-T14s kernel: [  422.206533] 
amdgpu :06:00.0: amdgpu:  PERMISSION_FAULTS: 0x3
  2023-09-28T13:58:23.077706+03:00 mavi-ThinkPad-T14s kernel: [  422.206538] 
amdgpu :06:00.0: amdgpu:  MAPPING_ERROR: 0x0
  2023-09-28T13:58:23.077707+03:00 mavi-ThinkPad-T14s kernel: [  422.206542] 
amdgpu :06:00.0: amdgpu:  RW: 0x0
  2023-09-28T13:58:23.077708+03:00 mavi-ThinkPad-T14s kernel: [  422.206549] 
amdgpu :06:00.0: amdgpu: [gfxhub0] no-retry page fault (src_id:0 ring:24 
vmid:6 pasid:32773, for process Xwayland pid 7640 thread Xwayland:cs0 pid 7663)
  2023-09-28T13:58:23.077709+03:00 mavi-ThinkPad-T14s kernel: [  422.206556] 
amdgpu :06:00.0: amdgpu:   in page starting at address 0xaf8c3d808000 
from IH client 0x1b (UTCL2)
  2023-09-28T13:58:23.077710+03:00 mavi-ThinkPad-T14s kernel: [  422.206564] 
amdgpu :06:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00600431
  2023-09-28T13:58:23.077710+03:00 mavi-ThinkPad-T14s kernel: [  422.206569] 
amdgpu :06:00.0: amdgpu:  Faulty UTCL2 client ID: IA (0x2)
  2023-09-28T13:58:23.077711+03:00 mavi-ThinkPad-T14s kernel: [  422.206574] 
amdgpu :06:00.0: amdgpu:  MORE_FAULTS: 0x1
  2023-09-28T13:58:23.077712+03:00 mavi-ThinkPad-T14s kernel: [  422.206578] 
amdgpu :06:00.0: amdgpu:  WALKER_ERROR: 0x0
  2023-09-28T13:58:23.077712+03:00 mavi-ThinkPad-T14s kernel: [  422.206583] 
amdgpu :06:00.0: amdgpu:  PERMISSION_FAULTS: 0x3
  2023-09-28T13:58:23.077713+03:00 mavi-ThinkPad-T14s kernel: [  422.206588] 
amdgpu :06:00.0: amdgpu:  MAPPING_ERROR: 0x0
  2023-09-28T13:58:23.077713+03:00 mavi-ThinkPad-T14s kernel: [  422.206592] 
amdgpu :06:00.0: amdgpu:  RW: 0