Hello,

Could you please try the kernel update which is currently in -proposed
(version 5.4.0-81.91) to see if it fixes the issue you are having?

See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how
to enable and use -proposed. Thank you!

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1939417

Title:
  Ubuntu GUI crashes (w and w/o Wayland), VM_L2_PROTECTION_FAULT,
  amdgpu 0000:09:00.0: [gfxhub0] retry page fault

Status in linux package in Ubuntu:
  New

Bug description:
  Ubuntu GUI crashes (w and w/o Wayland)
  on
  AMD Ryzen 5 3400G with Radeon Vega Graphics (family: 0x17, model: 0x18, 
stepping: 0x1)

  sometimes total OS crash, sometimes ssh access still possible.

  1) The release of Ubuntu you are using
  Distributor ID:       Ubuntu
  Description:  Ubuntu 20.04.2 LTS
  Release:      20.04
  Codename:     focal

  2) The version of the package you are using
  the GUI crashes at different times in different situations, no specific 
application or package identified.

  3) What you expected to happen
  Ubuntu working, applications like LibreOffice do their job.

  4) What happened instead
  After some time of successful work (from seconds to hour) suddenly no input 
by keyboard possible; OS crashes: GUI frozen, most times mouse movement frozen 
(sometimes not), one time flickering GUI.
  Sometimes remote ssh reboot possible. Sometimes not even successful ping to 
the network interface possible.

  No results: Tried to start "Ubuntu" w/o Wayland, same issue as "Ubuntu with 
Wayland"; several reboots;
  no issues experienced when accessing machine via ssh (remote). Using 
application like LibreOffice or OracleVM or Nemo via remote ssh (IPv6) did not 
show this issue.

  dmesg says when crashed:

  [ 4926.673857] [drm] enabling link 0 failed: 15
  [ 6478.430465] gmc_v9_0_process_interrupt: 15 callbacks suppressed
  [ 6478.430474] amdgpu 0000:09:00.0: [gfxhub0] retry page fault (src_id:0 
ring:0 vmid:3 pasid:32772, for process Xorg pid 7853 thread Xorg:c
  s0 pid 7854)
  [ 6478.430481] amdgpu 0000:09:00.0:   in page starting at address 
0x000080010ea95000 from client 27
  [ 6478.430484] amdgpu 0000:09:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00301031
  [ 6478.430487] amdgpu 0000:09:00.0:      MORE_FAULTS: 0x1
  [ 6478.430490] amdgpu 0000:09:00.0:      WALKER_ERROR: 0x0
  [ 6478.430492] amdgpu 0000:09:00.0:      PERMISSION_FAULTS: 0x3
  [ 6478.430495] amdgpu 0000:09:00.0:      MAPPING_ERROR: 0x0
  [ 6478.430497] amdgpu 0000:09:00.0:      RW: 0x0
  [ 6478.430506] amdgpu 0000:09:00.0: [gfxhub0] retry page fault (src_id:0 
ring:0 vmid:3 pasid:32772, for process Xorg pid 7853 thread Xorg:c
  s0 pid 7854)
  [ 6478.430509] amdgpu 0000:09:00.0:   in page starting at address 
0x000080010eac9000 from client 27
  [ 6478.430511] amdgpu 0000:09:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00301031
  [ 6478.430514] amdgpu 0000:09:00.0:      MORE_FAULTS: 0x1
  [ 6478.430516] amdgpu 0000:09:00.0:      WALKER_ERROR: 0x0
  [ 6478.430518] amdgpu 0000:09:00.0:      PERMISSION_FAULTS: 0x3
  [ 6478.430520] amdgpu 0000:09:00.0:      MAPPING_ERROR: 0x0
  [ 6478.430522] amdgpu 0000:09:00.0:      RW: 0x0
  [ 6478.430530] amdgpu 0000:09:00.0: [gfxhub0] retry page fault (src_id:0 
ring:0 vmid:3 pasid:32772, for process Xorg pid 7853 thread Xorg:c
  s0 pid 7854)

  ....

  [ 6488.436541] amdgpu 0000:09:00.0: [gfxhub0] retry page fault (src_id:0 
ring:0 vmid:3 pasid:32772, for process Xorg pid 7853 thread Xorg:cs0 pid 7854)
  [ 6488.436542] amdgpu 0000:09:00.0:   in page starting at address 
0x000080010eae3000 from client 27
  [ 6488.436544] amdgpu 0000:09:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00301031
  [ 6488.436545] amdgpu 0000:09:00.0:      MORE_FAULTS: 0x1
  [ 6488.436546] amdgpu 0000:09:00.0:      WALKER_ERROR: 0x0
  [ 6488.436547] amdgpu 0000:09:00.0:      PERMISSION_FAULTS: 0x3
  [ 6488.436548] amdgpu 0000:09:00.0:      MAPPING_ERROR: 0x0
  [ 6488.436549] amdgpu 0000:09:00.0:      RW: 0x0
  [ 6488.615178] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, 
but soft recovered

  
-----------------------------------------------------------------------------------
  
-----------------------------------------------------------------------------------

  [    0.000000] Linux version 5.4.0-80-generic (buildd@lcy01-amd64-030) (gcc 
version 9.3.0 (Ubuntu 9.3.0-17ubuntu1~20.04)) #90-Ubuntu SMP Fr
  i Jul 9 22:49:44 UTC 2021 (Ubuntu 5.4.0-80.90-generic 5.4.124)
  [    0.000000] Command line: BOOT_IMAGE=/vmlinuz-5.4.0-80-generic 
root=UUID=2fac2ccc-b353-4ced-a8e5-7e5a7f0fe5f3 ro
  [    0.000000] KERNEL supported cpus:
  [    0.000000]   Intel GenuineIntel
  [    0.000000]   AMD AuthenticAMD
  [    0.000000]   Hygon HygonGenuine
  [    0.000000]   Centaur CentaurHauls
  [    0.000000]   zhaoxin   Shanghai
  [    0.000000] x86/fpu: Supporting XSAVE feature 0x001: 'x87 floating point 
registers'
  [    0.000000] x86/fpu: Supporting XSAVE feature 0x002: 'SSE registers'
  [    0.000000] x86/fpu: Supporting XSAVE feature 0x004: 'AVX registers'
  [    0.000000] x86/fpu: xstate_offset[2]:  576, xstate_sizes[2]:  256
  [    0.000000] x86/fpu: Enabled xstate features 0x7, context size is 832 
bytes, using 'compacted' format.
  [    0.000000] BIOS-provided physical RAM map:
  [    0.000000] BIOS-e820: [mem 0x0000000000000000-0x000000000009d3ff] usable
  [    0.000000] BIOS-e820: [mem 0x000000000009d400-0x000000000009ffff] reserved
  [    0.000000] BIOS-e820: [mem 0x00000000000e0000-0x00000000000fffff] reserved
  [    0.000000] BIOS-e820: [mem 0x0000000000100000-0x0000000009d01fff] usable
  [    0.000000] BIOS-e820: [mem 0x0000000009d02000-0x0000000009ffffff] reserved
  [    0.000000] BIOS-e820: [mem 0x000000000a000000-0x000000000a1fffff] usable
  [    0.000000] BIOS-e820: [mem 0x000000000a200000-0x000000000a20afff] ACPI NVS
  [    0.000000] BIOS-e820: [mem 0x000000000a20b000-0x000000000affffff] usable
  [    0.000000] BIOS-e820: [mem 0x000000000b000000-0x000000000b01ffff] reserved
  [    0.000000] BIOS-e820: [mem 0x000000000b020000-0x000000003aeccfff] usable
  [    0.000000] BIOS-e820: [mem 0x000000003aecd000-0x000000003c3adfff] reserved
  [    0.000000] BIOS-e820: [mem 0x000000003c3ae000-0x000000003c52dfff] usable
  [    0.000000] BIOS-e820: [mem 0x000000003c52e000-0x000000003c93bfff] ACPI NVS
  [    0.000000] BIOS-e820: [mem 0x000000003c93c000-0x000000003d767fff] reserved
  [    0.000000] BIOS-e820: [mem 0x000000003d768000-0x000000003effffff] usable
  [    0.000000] BIOS-e820: [mem 0x000000003f000000-0x00000000bfffffff] reserved
  [    0.000000] BIOS-e820: [mem 0x00000000e0000000-0x00000000efffffff] reserved
  [    0.000000] BIOS-e820: [mem 0x00000000fd000000-0x00000000ffffffff] reserved
  [    0.000000] BIOS-e820: [mem 0x0000000100000000-0x000000043f33ffff] usable
  [    0.000000] NX (Execute Disable) protection: active
  [    0.000000] SMBIOS 3.2.1 present.
  [    0.000000] DMI: To Be Filled By O.E.M. To Be Filled By O.E.M./B450 Pro4, 
BIOS P4.20 06/18/2020
  [    0.000000] tsc: Fast TSC calibration using PIT
  [    0.000000] tsc: Detected 3692.878 MHz processor
  [    0.000759] e820: update [mem 0x00000000-0x00000fff] usable ==> reserved
  [    0.000760] e820: remove [mem 0x000a0000-0x000fffff] usable
  [    0.000764] last_pfn = 0x43f340 max_arch_pfn = 0x400000000
  [    0.000769] MTRR default type: uncachable
  [    0.000769] MTRR fixed ranges enabled:
  [    0.000770]   00000-9FFFF write-back
  [    0.000770]   A0000-BFFFF write-through
  [    0.000771]   C0000-FFFFF write-protect
  [    0.000772] MTRR variable ranges enabled:
  [    0.000773]   0 base 000000000000 mask FFFF80000000 write-back
  [    0.000773]   1 base 000080000000 mask FFFFC0000000 write-back
  [    0.000774]   2 disabled
  [    0.000774]   3 disabled
  [    0.000774]   4 disabled
  [    0.000775]   5 disabled
  [    0.000775]   6 disabled
  [    0.000775]   7 disabled
  [    0.000776] TOM2: 0000000440000000 aka 17408M
  [    0.001037] x86/PAT: Configuration [0-7]: WB  WC  UC- UC  WB  WP  UC- WT
  [    0.001198] e820: update [mem 0xc0000000-0xffffffff] usable ==> reserved
  [    0.001203] last_pfn = 0x3f000 max_arch_pfn = 0x400000000
  [    0.004468] check: Scanning 1 areas for low memory corruption
  [    0.004477] Using GB pages for direct mapping
  [    0.004794] RAMDISK: [mem 0x2dd4d000-0x32e9dfff]
  [    0.004799] ACPI: Early table checksum verification disabled
  [    0.004802] ACPI: RSDP 0x00000000000F05A0 000024 (v02 ALASKA)
  [    0.004804] ACPI: XSDT 0x000000003C8B90A0 0000BC (v01 ALASKA A M I    
01072009 AMI  00010013)
  [    0.004809] ACPI: FACP 0x000000003C8BFC50 000114 (v06 ALASKA A M I    
01072009 AMI  00010013)
  [    0.004813] ACPI: DSDT 0x000000003C8B91F0 006A60 (v02 ALASKA A M I    
01072009 INTL 20120913)
  [    0.004815] ACPI: FACS 0x000000003C925E00 000040
  [    0.004817] ACPI: APIC 0x000000003C8BFD68 00015E (v03 ALASKA A M I    
01072009 AMI  00010013)
  [    0.004819] ACPI: FPDT 0x000000003C8BFEC8 000044 (v01 ALASKA A M I    
01072009 AMI  00010013)
  [    0.004821] ACPI: FIDT 0x000000003C8BFF10 00009C (v01 ALASKA A M I    
01072009 AMI  00010013)
  [    0.004823] ACPI: SSDT 0x000000003C8BFFB0 000094 (v02 ALASKA CPUSSDT  
01072009 AMI  01072009)
  [    0.004825] ACPI: SSDT 0x000000003C8C0048 005419 (v02 AMD    AmdTable 
00000002 MSFT 04000000)
  [    0.004827] ACPI: SSDT 0x000000003C8C5468 00378A (v01 AMD    AMD AOD  
00000001 INTL 20120913)
  [    0.004830] ACPI: MCFG 0x000000003C8C8BF8 00003C (v01 ALASKA A M I    
01072009 MSFT 00010013)
  [    0.004831] ACPI: AAFT 0x000000003C8C8C38 000463 (v01 ALASKA OEMAAFT  
01072009 MSFT 00000097)
  [    0.004834] ACPI: HPET 0x000000003C8C90A0 000038 (v01 ALASKA A M I    
01072009 AMI  00000005)
  [    0.004836] ACPI: UEFI 0x000000003C8C90D8 000042 (v01 ALASKA A M I    
00000002      01000013)
  [    0.004838] ACPI: IVRS 0x000000003C8C9120 0000D0 (v02 AMD    AMD IVRS 
00000001 AMD  00000000)
  [    0.004840] ACPI: SSDT 0x000000003C8C91F0 000E0C (v01 AMD    AMD CPU  
00000001 AMD  00000001)
  [    0.004842] ACPI: CRAT 0x000000003C8CA000 000810 (v01 AMD    AMD CRAT 
00000001 AMD  00000001)
  [    0.004844] ACPI: CDIT 0x000000003C8CA810 000029 (v01 AMD    AMD CDIT 
00000001 AMD  00000001)
  [    0.004846] ACPI: SSDT 0x000000003C8CA840 001D34 (v01 AMD    AmdTable 
00000001 INTL 20120913)
  [    0.004848] ACPI: SSDT 0x000000003C8CC578 0000BF (v01 AMD    AMD PT   
00001000 INTL 20120913)
  [    0.004850] ACPI: WSMT 0x000000003C8CC638 000028 (v01 ALASKA A M I    
01072009 AMI  00010013)
  [    0.004852] ACPI: SSDT 0x000000003C8CC660 0010AF (v01 AMD    AmdTable 
00000001 INTL 20120913)
  [    0.004854] ACPI: Reserving FACP table memory at [mem 
0x3c8bfc50-0x3c8bfd63]
  [    0.004855] ACPI: Reserving DSDT table memory at [mem 
0x3c8b91f0-0x3c8bfc4f]
  [    0.004855] ACPI: Reserving FACS table memory at [mem 
0x3c925e00-0x3c925e3f]
  [    0.004856] ACPI: Reserving APIC table memory at [mem 
0x3c8bfd68-0x3c8bfec5]
  [    0.004857] ACPI: Reserving FPDT table memory at [mem 
0x3c8bfec8-0x3c8bff0b]
  [    0.004857] ACPI: Reserving FIDT table memory at [mem 
0x3c8bff10-0x3c8bffab]
  [    0.004858] ACPI: Reserving SSDT table memory at [mem 
0x3c8bffb0-0x3c8c0043]
  [    0.004859] ACPI: Reserving SSDT table memory at [mem 
0x3c8c0048-0x3c8c5460]
  [    0.004859] ACPI: Reserving SSDT table memory at [mem 
0x3c8c5468-0x3c8c8bf1]
  [    0.004860] ACPI: Reserving MCFG table memory at [mem 
0x3c8c8bf8-0x3c8c8c33]
  [    0.004861] ACPI: Reserving AAFT table memory at [mem 
0x3c8c8c38-0x3c8c909a]
  [    0.004861] ACPI: Reserving HPET table memory at [mem 
0x3c8c90a0-0x3c8c90d7]
  [    0.004862] ACPI: Reserving UEFI table memory at [mem 
0x3c8c90d8-0x3c8c9119]
  [    0.004863] ACPI: Reserving IVRS table memory at [mem 
0x3c8c9120-0x3c8c91ef]
  [    0.004863] ACPI: Reserving SSDT table memory at [mem 
0x3c8c91f0-0x3c8c9ffb]
  [    0.004864] ACPI: Reserving CRAT table memory at [mem 
0x3c8ca000-0x3c8ca80f]
  [    0.004865] ACPI: Reserving CDIT table memory at [mem 
0x3c8ca810-0x3c8ca838]
  [    0.004865] ACPI: Reserving SSDT table memory at [mem 
0x3c8ca840-0x3c8cc573]
  [    0.004866] ACPI: Reserving SSDT table memory at [mem 
0x3c8cc578-0x3c8cc636]
  [    0.004867] ACPI: Reserving WSMT table memory at [mem 
0x3c8cc638-0x3c8cc65f]
  [    0.004867] ACPI: Reserving SSDT table memory at [mem 
0x3c8cc660-0x3c8cd70e]
  [    0.004879] ACPI: Local APIC address 0xfee00000
  [    0.004979] No NUMA configuration found
  [    0.004980] Faking a node at [mem 0x0000000000000000-0x000000043f33ffff]
  [    0.004987] NODE_DATA(0) allocated [mem 0x43f315000-0x43f33ffff]
  [    0.005139] Zone ranges:
  [    0.005140]   DMA      [mem 0x0000000000001000-0x0000000000ffffff]
  [    0.005141]   DMA32    [mem 0x0000000001000000-0x00000000ffffffff]
  [    0.005142]   Normal   [mem 0x0000000100000000-0x000000043f33ffff]
  [    0.005142]   Device   empty
  [    0.005143] Movable zone start for each node
  [    0.005145] Early memory node ranges
  [    0.005146]   node   0: [mem 0x0000000000001000-0x000000000009cfff]
  [    0.005147]   node   0: [mem 0x0000000000100000-0x0000000009d01fff]
  [    0.005148]   node   0: [mem 0x000000000a000000-0x000000000a1fffff]
  [    0.005148]   node   0: [mem 0x000000000a20b000-0x000000000affffff]
  [    0.005149]   node   0: [mem 0x000000000b020000-0x000000003aeccfff]
  [    0.005150]   node   0: [mem 0x000000003c3ae000-0x000000003c52dfff]
  [    0.005150]   node   0: [mem 0x000000003d768000-0x000000003effffff]
  [    0.005151]   node   0: [mem 0x0000000100000000-0x000000043f33ffff]
  [    0.005238] Zeroed struct page in unavailable ranges: 18280 pages
  [    0.005239] Initmem setup node 0 [mem 
0x0000000000001000-0x000000043f33ffff]
  [    0.005240] On node 0 totalpages: 3651736
  [    0.005241]   DMA zone: 64 pages used for memmap
  [    0.005241]   DMA zone: 21 pages reserved
  [    0.005242]   DMA zone: 3996 pages, LIFO batch:0
  [    0.005278]   DMA32 zone: 3799 pages used for memmap
  [    0.005279]   DMA32 zone: 243132 pages, LIFO batch:63
  [    0.009896]   Normal zone: 53197 pages used for memmap
  [    0.009897]   Normal zone: 3404608 pages, LIFO batch:63
  [    0.042167] ACPI: PM-Timer IO Port: 0x808
  [    0.042170] ACPI: Local APIC address 0xfee00000
  [    0.042177] ACPI: LAPIC_NMI (acpi_id[0xff] high edge lint[0x1])
  [    0.042192] IOAPIC[0]: apic_id 9, version 33, address 0xfec00000, GSI 0-23
  [    0.042199] IOAPIC[1]: apic_id 10, version 33, address 0xfec01000, GSI 
24-55
  [    0.042201] ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl)
  [    0.042202] ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 low level)
  [    0.042203] ACPI: IRQ0 used by override.
  [    0.042203] ACPI: IRQ9 used by override.
  [    0.042205] Using ACPI (MADT) for SMP configuration information
  [    0.042206] ACPI: HPET id: 0x10228201 base: 0xfed00000
  [    0.042210] smpboot: Allowing 32 CPUs, 24 hotplug CPUs
  ........

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1939417/+subscriptions


-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to