Having this issue on AMD Ryzen 7 7800X3D 8-Core Processor. iGPU is enabled.
[ 8.237617] kernel: ------------[ cut here ]------------ [ 8.237636] kernel: UBSAN: array-index-out-of-bounds in /build/linux-tazIIz/linux-6.14.0/drivers/gpu/drm/amd/amdgpu/../display/dc/dml2/dml21/src/dml2_core/dml2_core_dcn4_calcs.c:11590:83 [ 8.237652] kernel: index 4 is out of range for type 'double [2][8]' [ 8.237658] kernel: CPU: 3 UID: 0 PID: 558 Comm: plymouthd Kdump: loaded Not tainted 6.14.0-33-generic #33-Ubuntu [ 8.237664] kernel: Hardware name: ASUS System Product Name/ProArt X870E-CREATOR WIFI, BIOS 1203 03/04/2025 [ 8.237670] kernel: Call Trace: [ 8.237675] kernel: <TASK> [ 8.237681] kernel: show_stack+0x49/0x60 [ 8.237686] kernel: dump_stack_lvl+0x5f/0x90 [ 8.237696] kernel: dump_stack+0x10/0x18 [ 8.237701] kernel: ubsan_epilogue+0x9/0x40 [ 8.237706] kernel: __ubsan_handle_out_of_bounds.cold+0x44/0x49 [ 8.237712] kernel: dml_core_mode_programming.cold+0xfe5/0x1cfb [amdgpu] [ 8.237717] kernel: dml2_core_calcs_mode_programming_ex+0x2a/0x70 [amdgpu] [ 8.237723] kernel: ? srso_alias_return_thunk+0x5/0xfbef5 [ 8.237728] kernel: ? dml2_core_calcs_mode_programming_ex+0x2a/0x70 [amdgpu] [ 8.237733] kernel: core_dcn4_mode_programming+0xdf/0xda0 [amdgpu] [ 8.237739] kernel: ? srso_alias_return_thunk+0x5/0xfbef5 [ 8.237746] kernel: dml2_top_soc15_build_mode_programming+0x3f3/0x6a0 [amdgpu] [ 8.237751] kernel: dml2_build_mode_programming+0x1a/0x30 [amdgpu] [ 8.237757] kernel: dml21_mode_check_and_programming+0x13b/0x230 [amdgpu] [ 8.237762] kernel: dml21_validate+0x25/0x40 [amdgpu] [ 8.237767] kernel: dml2_validate+0xbf/0xe0 [amdgpu] [ 8.237773] kernel: dcn401_validate_bandwidth+0x4e/0x70 [amdgpu] [ 8.237778] kernel: update_planes_and_stream_state+0x26f/0x5d0 [amdgpu] [ 8.237783] kernel: ? srso_alias_return_thunk+0x5/0xfbef5 [ 8.237789] kernel: update_planes_and_stream_v3+0x52/0x1e0 [amdgpu] [ 8.237820] kernel: dc_update_planes_and_stream+0x5d/0x110 [amdgpu] [ 8.237826] kernel: amdgpu_dm_commit_planes+0x5bf/0x1670 [amdgpu] [ 8.237831] kernel: amdgpu_dm_atomic_commit_tail+0xa32/0x1230 [amdgpu] [ 8.237836] kernel: commit_tail+0xc6/0x1b0 [ 8.237842] kernel: ? srso_alias_return_thunk+0x5/0xfbef5 [ 8.237847] kernel: drm_atomic_helper_commit+0x132/0x160 [ 8.237854] kernel: drm_atomic_commit+0xab/0xf0 [ 8.237859] kernel: ? __pfx___drm_printfn_info+0x10/0x10 [ 8.237864] kernel: drm_client_modeset_commit_atomic+0x200/0x240 [ 8.237872] kernel: drm_client_modeset_commit_locked+0x5b/0x170 [ 8.237877] kernel: ? mutex_lock+0x12/0x50 [ 8.237882] kernel: drm_client_modeset_commit+0x26/0x50 [ 8.237889] kernel: __drm_fb_helper_restore_fbdev_mode_unlocked+0xc2/0x100 [ 8.237895] kernel: drm_fb_helper_lastclose+0x17/0x30 [ 8.237900] kernel: drm_fbdev_client_restore+0x11/0x20 [ 8.237905] kernel: drm_client_dev_restore+0x78/0xe0 [ 8.237911] kernel: drm_release+0x113/0x130 [ 8.237916] kernel: __fput+0xea/0x2d0 [ 8.237923] kernel: __fput_sync+0x59/0x80 [ 8.237929] kernel: __x64_sys_close+0x3d/0x90 [ 8.237934] kernel: x64_sys_call+0x1786/0x2310 [ 8.237939] kernel: do_syscall_64+0x7e/0x170 [ 8.237945] kernel: ? srso_alias_return_thunk+0x5/0xfbef5 [ 8.237952] kernel: ? handle_pte_fault+0x1db/0x200 [ 8.237957] kernel: ? srso_alias_return_thunk+0x5/0xfbef5 [ 8.237962] kernel: ? __handle_mm_fault+0x3d2/0x7a0 [ 8.237969] kernel: ? srso_alias_return_thunk+0x5/0xfbef5 [ 8.237974] kernel: ? __count_memcg_events+0xd3/0x1a0 [ 8.237981] kernel: ? srso_alias_return_thunk+0x5/0xfbef5 [ 8.237987] kernel: ? count_memcg_events.constprop.0+0x2a/0x50 [ 8.237992] kernel: ? srso_alias_return_thunk+0x5/0xfbef5 [ 8.237997] kernel: ? handle_mm_fault+0x1bb/0x2d0 [ 8.238008] kernel: ? srso_alias_return_thunk+0x5/0xfbef5 [ 8.238013] kernel: ? do_user_addr_fault+0x5e9/0x7e0 [ 8.238018] kernel: ? srso_alias_return_thunk+0x5/0xfbef5 [ 8.238026] kernel: ? arch_exit_to_user_mode_prepare.isra.0+0x22/0xd0 [ 8.238032] kernel: ? srso_alias_return_thunk+0x5/0xfbef5 [ 8.238037] kernel: ? irqentry_exit_to_user_mode+0x2d/0x1d0 [ 8.238045] kernel: ? srso_alias_return_thunk+0x5/0xfbef5 [ 8.238050] kernel: ? irqentry_exit+0x43/0x50 [ 8.238055] kernel: ? srso_alias_return_thunk+0x5/0xfbef5 [ 8.238060] kernel: ? exc_page_fault+0x96/0x1e0 [ 8.238066] kernel: entry_SYSCALL_64_after_hwframe+0x76/0x7e [ 8.238073] kernel: RIP: 0033:0x7eb9c26eafe2 [ 8.238081] kernel: Code: 08 0f 85 71 3a ff ff 49 89 fb 48 89 f0 48 89 d7 48 89 ce 4c 89 c2 4d 89 ca 4c 8b 44 24 08 4c 8b 4c 24 10 4c 89 5c 24 08 0f 05 <c3> 66 2e 0f 1f 84 00 00 00 00 00 66 2e 0f 1f 84 00 00 00 00 00 66 [ 8.238092] kernel: RSP: 002b:00007ffc5d2c67e8 EFLAGS: 00000246 ORIG_RAX: 0000000000000003 [ 8.238100] kernel: RAX: ffffffffffffffda RBX: 00007eb9c247d1c0 RCX: 00007eb9c26eafe2 [ 8.238105] kernel: RDX: 0000000000000000 RSI: 0000000000000000 RDI: 000000000000000d [ 8.238115] kernel: RBP: 00007ffc5d2c6810 R08: 0000000000000000 R09: 0000000000000000 [ 8.238120] kernel: R10: 0000000000000000 R11: 0000000000000246 R12: 00007ffc5d2c6950 [ 8.238126] kernel: R13: 00007eb9c247d148 R14: 000059ebebb99c80 R15: 00007eb9c247d148 [ 8.238131] kernel: </TASK> [ 8.238136] kernel: ---[ end trace ]--- -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux-hwe-6.14 in Ubuntu. https://bugs.launchpad.net/bugs/2126854 Title: AMDGPU firmware load timeout and system freeze on kernel 6.14.0-33 (RX 9060 XT, RDNA3) Status in linux-hwe-6.14 package in Ubuntu: Incomplete Bug description: All the information is directly from my PC although I am not very experience in this kind bug reporting and I want to be as helpful as possible. The information that follows was only organized by chatGPT for me so maybe whoever reads this can understand it better. If I can help with anymore info or testing please let me know. AMDGPU Regression Report — Linux Mint (Ubuntu 24.04 Base) Summary: System hard-locks shortly after boot or during normal operation when running kernel 6.14.0-33-generic. Issue is fully resolved when downgrading to 6.14.0-32-generic, indicating a regression in the newer kernel’s AMDGPU module or firmware handling. System Information: Distro: Linux Mint 21.3 (Ubuntu 24.04 base) Working Kernel: 6.14.0-32-generic Failing Kernel: 6.14.0-33-generic GPU: AMD Radeon RX 9060 XT (gfx1200, RDNA3) Driver: amdgpu (in-kernel) linux-firmware: 20240318.git3b128b60-0ubuntu2.17 Problem Description: • On kernel 6.14.0-33, the system freezes completely (no mouse, keyboard, or SSH access) within minutes of login. • One freeze occurred during boot, several others randomly after boot. • Rebooting into kernel 6.14.0-32 restores full stability. • Reinstalling linux-firmware did not resolve the issue. Relevant Log Excerpts (from journalctl -b -1 on failing kernel): amdgpu 0000:0e:00.0: amdgpu: Failed to load firmware "amdgpu/gfx1200_mec2.bin" amdgpu 0000:0e:00.0: amdgpu: [gfxhub] timeout 0x00000010 amdgpu 0000:0e:00.0: amdgpu: Fatal error during GPU init WARNING: CPU: 8 PID: 202 at amdgpu_irq_put+0x9c/0xb0 [amdgpu] UBSAN: array-index-out-of-bounds in dml2_core_dcn4_calcs.c I/O error, dev sda, sector 5992552 watchdog: task blocked for more than 122 seconds Steps to Reproduce: 1. Boot kernel 6.14.0-33-generic with AMD RX 9060 XT. 2. Log into desktop and wait a few minutes or open GPU-accelerated applications. 3. System locks up completely (no recovery except power cycle). 4. Boot into 6.14.0-32-generic → no freezes. Expected vs Actual: Expected: GPU initializes normally, stable desktop operation. Actual: GPU firmware load timeout (-110), kernel bug in amdgpu display code, and full system hang. Workaround: Booting kernel 6.14.0-32-generic avoids the problem entirely. Notes: The issue persists even after reinstalling the linux-firmware package and regenerating initramfs. Appears to be a regression in amdgpu initialization code introduced between -32 and -33. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux-hwe-6.14/+bug/2126854/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : [email protected] Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp

