Having this issue on AMD Ryzen 7 7800X3D 8-Core Processor.  iGPU is
enabled.

[    8.237617] kernel: ------------[ cut here ]------------
[    8.237636] kernel: UBSAN: array-index-out-of-bounds in 
/build/linux-tazIIz/linux-6.14.0/drivers/gpu/drm/amd/amdgpu/../display/dc/dml2/dml21/src/dml2_core/dml2_core_dcn4_calcs.c:11590:83
[    8.237652] kernel: index 4 is out of range for type 'double [2][8]'
[    8.237658] kernel: CPU: 3 UID: 0 PID: 558 Comm: plymouthd Kdump: loaded Not 
tainted 6.14.0-33-generic #33-Ubuntu
[    8.237664] kernel: Hardware name: ASUS System Product Name/ProArt 
X870E-CREATOR WIFI, BIOS 1203 03/04/2025
[    8.237670] kernel: Call Trace:
[    8.237675] kernel:  <TASK>
[    8.237681] kernel:  show_stack+0x49/0x60
[    8.237686] kernel:  dump_stack_lvl+0x5f/0x90
[    8.237696] kernel:  dump_stack+0x10/0x18
[    8.237701] kernel:  ubsan_epilogue+0x9/0x40
[    8.237706] kernel:  __ubsan_handle_out_of_bounds.cold+0x44/0x49
[    8.237712] kernel:  dml_core_mode_programming.cold+0xfe5/0x1cfb [amdgpu]
[    8.237717] kernel:  dml2_core_calcs_mode_programming_ex+0x2a/0x70 [amdgpu]
[    8.237723] kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
[    8.237728] kernel:  ? dml2_core_calcs_mode_programming_ex+0x2a/0x70 [amdgpu]
[    8.237733] kernel:  core_dcn4_mode_programming+0xdf/0xda0 [amdgpu]
[    8.237739] kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
[    8.237746] kernel:  dml2_top_soc15_build_mode_programming+0x3f3/0x6a0 
[amdgpu]
[    8.237751] kernel:  dml2_build_mode_programming+0x1a/0x30 [amdgpu]
[    8.237757] kernel:  dml21_mode_check_and_programming+0x13b/0x230 [amdgpu]
[    8.237762] kernel:  dml21_validate+0x25/0x40 [amdgpu]
[    8.237767] kernel:  dml2_validate+0xbf/0xe0 [amdgpu]
[    8.237773] kernel:  dcn401_validate_bandwidth+0x4e/0x70 [amdgpu]
[    8.237778] kernel:  update_planes_and_stream_state+0x26f/0x5d0 [amdgpu]
[    8.237783] kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
[    8.237789] kernel:  update_planes_and_stream_v3+0x52/0x1e0 [amdgpu]
[    8.237820] kernel:  dc_update_planes_and_stream+0x5d/0x110 [amdgpu]
[    8.237826] kernel:  amdgpu_dm_commit_planes+0x5bf/0x1670 [amdgpu]
[    8.237831] kernel:  amdgpu_dm_atomic_commit_tail+0xa32/0x1230 [amdgpu]
[    8.237836] kernel:  commit_tail+0xc6/0x1b0
[    8.237842] kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
[    8.237847] kernel:  drm_atomic_helper_commit+0x132/0x160
[    8.237854] kernel:  drm_atomic_commit+0xab/0xf0
[    8.237859] kernel:  ? __pfx___drm_printfn_info+0x10/0x10
[    8.237864] kernel:  drm_client_modeset_commit_atomic+0x200/0x240
[    8.237872] kernel:  drm_client_modeset_commit_locked+0x5b/0x170
[    8.237877] kernel:  ? mutex_lock+0x12/0x50
[    8.237882] kernel:  drm_client_modeset_commit+0x26/0x50
[    8.237889] kernel:  __drm_fb_helper_restore_fbdev_mode_unlocked+0xc2/0x100
[    8.237895] kernel:  drm_fb_helper_lastclose+0x17/0x30
[    8.237900] kernel:  drm_fbdev_client_restore+0x11/0x20
[    8.237905] kernel:  drm_client_dev_restore+0x78/0xe0
[    8.237911] kernel:  drm_release+0x113/0x130
[    8.237916] kernel:  __fput+0xea/0x2d0
[    8.237923] kernel:  __fput_sync+0x59/0x80
[    8.237929] kernel:  __x64_sys_close+0x3d/0x90
[    8.237934] kernel:  x64_sys_call+0x1786/0x2310
[    8.237939] kernel:  do_syscall_64+0x7e/0x170
[    8.237945] kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
[    8.237952] kernel:  ? handle_pte_fault+0x1db/0x200
[    8.237957] kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
[    8.237962] kernel:  ? __handle_mm_fault+0x3d2/0x7a0
[    8.237969] kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
[    8.237974] kernel:  ? __count_memcg_events+0xd3/0x1a0
[    8.237981] kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
[    8.237987] kernel:  ? count_memcg_events.constprop.0+0x2a/0x50
[    8.237992] kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
[    8.237997] kernel:  ? handle_mm_fault+0x1bb/0x2d0
[    8.238008] kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
[    8.238013] kernel:  ? do_user_addr_fault+0x5e9/0x7e0
[    8.238018] kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
[    8.238026] kernel:  ? arch_exit_to_user_mode_prepare.isra.0+0x22/0xd0
[    8.238032] kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
[    8.238037] kernel:  ? irqentry_exit_to_user_mode+0x2d/0x1d0
[    8.238045] kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
[    8.238050] kernel:  ? irqentry_exit+0x43/0x50
[    8.238055] kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
[    8.238060] kernel:  ? exc_page_fault+0x96/0x1e0
[    8.238066] kernel:  entry_SYSCALL_64_after_hwframe+0x76/0x7e
[    8.238073] kernel: RIP: 0033:0x7eb9c26eafe2
[    8.238081] kernel: Code: 08 0f 85 71 3a ff ff 49 89 fb 48 89 f0 48 89 d7 48 
89 ce 4c 89 c2 4d 89 ca 4c 8b 44 24 08 4c 8b 4c 24 10 4c 89 5c 24 08 0f 05 <c3> 
66 2e 0f 1f 84 00 00 00 00 00 66 2e 0f 1f 84 00 00 00 00 00 66
[    8.238092] kernel: RSP: 002b:00007ffc5d2c67e8 EFLAGS: 00000246 ORIG_RAX: 
0000000000000003
[    8.238100] kernel: RAX: ffffffffffffffda RBX: 00007eb9c247d1c0 RCX: 
00007eb9c26eafe2
[    8.238105] kernel: RDX: 0000000000000000 RSI: 0000000000000000 RDI: 
000000000000000d
[    8.238115] kernel: RBP: 00007ffc5d2c6810 R08: 0000000000000000 R09: 
0000000000000000
[    8.238120] kernel: R10: 0000000000000000 R11: 0000000000000246 R12: 
00007ffc5d2c6950
[    8.238126] kernel: R13: 00007eb9c247d148 R14: 000059ebebb99c80 R15: 
00007eb9c247d148
[    8.238131] kernel:  </TASK>
[    8.238136] kernel: ---[ end trace ]---

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux-hwe-6.14 in Ubuntu.
https://bugs.launchpad.net/bugs/2126854

Title:
  AMDGPU firmware load timeout and system freeze on kernel 6.14.0-33 (RX
  9060 XT, RDNA3)

Status in linux-hwe-6.14 package in Ubuntu:
  Incomplete

Bug description:
  All the information is directly from my PC although I am not very
  experience in this kind bug reporting and I want to be as helpful as
  possible. The information that follows was only organized by chatGPT
  for me so maybe whoever reads this can understand it better. If I can
  help with anymore info or testing please let me know.

  
  AMDGPU Regression Report — Linux Mint (Ubuntu 24.04 Base)

  Summary:
  System hard-locks shortly after boot or during normal operation when running 
kernel 6.14.0-33-generic. 
  Issue is fully resolved when downgrading to 6.14.0-32-generic, indicating a 
regression in the newer kernel’s AMDGPU module or firmware handling.

  System Information:
    Distro:          Linux Mint 21.3 (Ubuntu 24.04 base)
    Working Kernel:  6.14.0-32-generic
    Failing Kernel:  6.14.0-33-generic
    GPU:             AMD Radeon RX 9060 XT (gfx1200, RDNA3)
    Driver:          amdgpu (in-kernel)
    linux-firmware:  20240318.git3b128b60-0ubuntu2.17

  Problem Description:
    • On kernel 6.14.0-33, the system freezes completely (no mouse, keyboard, 
or SSH access) within minutes of login.
    • One freeze occurred during boot, several others randomly after boot.
    • Rebooting into kernel 6.14.0-32 restores full stability.
    • Reinstalling linux-firmware did not resolve the issue.

  Relevant Log Excerpts (from journalctl -b -1 on failing kernel):
    amdgpu 0000:0e:00.0: amdgpu: Failed to load firmware 
"amdgpu/gfx1200_mec2.bin"
    amdgpu 0000:0e:00.0: amdgpu: [gfxhub] timeout 0x00000010
    amdgpu 0000:0e:00.0: amdgpu: Fatal error during GPU init
    WARNING: CPU: 8 PID: 202 at amdgpu_irq_put+0x9c/0xb0 [amdgpu]
    UBSAN: array-index-out-of-bounds in dml2_core_dcn4_calcs.c
    I/O error, dev sda, sector 5992552
    watchdog: task blocked for more than 122 seconds

  Steps to Reproduce:
    1. Boot kernel 6.14.0-33-generic with AMD RX 9060 XT.
    2. Log into desktop and wait a few minutes or open GPU-accelerated 
applications.
    3. System locks up completely (no recovery except power cycle).
    4. Boot into 6.14.0-32-generic → no freezes.

  Expected vs Actual:
    Expected: GPU initializes normally, stable desktop operation.
    Actual: GPU firmware load timeout (-110), kernel bug in amdgpu display 
code, and full system hang.

  Workaround:
    Booting kernel 6.14.0-32-generic avoids the problem entirely.

  Notes:
    The issue persists even after reinstalling the linux-firmware package and 
regenerating initramfs. 
    Appears to be a regression in amdgpu initialization code introduced between 
-32 and -33.

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux-hwe-6.14/+bug/2126854/+subscriptions


-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : [email protected]
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to