Control: tags -1 + upstream moreinfo Hi Amit,
On Tue, Dec 16, 2025 at 11:26:26AM +0000, Amit Gurdasani wrote: > Package: linux-image-6.17.11+deb14-amd64 > Version: 6.17.11-1 > Severity: important > X-Debbugs-Cc: [email protected] > User: [email protected] > Usertags: amd64 > > Dear Maintainer, > > I had my Debian testing desktop reboot after an unattended upgrade where the > kernel was updated from linux-image-6.17.9+deb14-amd64 to > linux-image-6.17.11+deb14-amd64. After reboot overnight, I found the machine > unresponsive in the morning. Rebooting revealed many successive kernel oops > in amdgpu. Oops text below. > > The GPU in use is an AMD Radeon R9 270X ("Curacao XT"), from the GCN 1.0 > generation. I _am_ using the following argument on the kernel command line > to gain some performance: > > amdgpu.ppfeaturemask=0xffffffff > > This kernel oops was not occurring in kernels up to and including 6.17.9 > (Debian-packaged). > > I have not tried to boot 6.17.11 without the amdgpu.ppfeaturemask=0xffffffff > kernel command-line argument to see if the oops still occurs. > > I did find that there was some work done on amdgpu in November: > > https://lists.freedesktop.org/archives/amd-gfx/2025-November/133356.html > > I don't know enough to know whether that work could cause this kernel oops. > > Downgrading back to 6.17.9 has eliminated the kernel oops. > > Thanks, > > Amit Gurdasani > > Oops text: > > 2025-12-16T10:12:43.249625+00:00 athena kernel: amdgpu 0000:01:00.0: > [drm:amdgpu_ib_ring_tests [amdgpu]] *ERROR* IB test failed on uvd (-110). > 2025-12-16T10:12:43.249625+00:00 athena kernel: amdgpu 0000:01:00.0: amdgpu: > ib ring test failed (-110). > 2025-12-16T10:12:43.249627+00:00 athena kernel: BUG: kernel NULL pointer > dereference, address: 0000000000000000 > 2025-12-16T10:12:43.249627+00:00 athena kernel: #PF: supervisor instruction > fetch in kernel mode > 2025-12-16T10:12:43.249627+00:00 athena kernel: #PF: error_code(0x0010) - > not-present page > 2025-12-16T10:12:43.249627+00:00 athena kernel: PGD 0 P4D 0 > 2025-12-16T10:12:43.249627+00:00 athena kernel: Oops: Oops: 0010 [#1] SMP > NOPTI > 2025-12-16T10:12:43.249627+00:00 athena kernel: CPU: 2 UID: 0 PID: 564 Comm: > kworker/2:2 Tainted: G S U 6.17.11+deb14-amd64 #1 > PREEMPT(lazy) Debian 6.17.11-1 > 2025-12-16T10:12:43.249628+00:00 athena kernel: Tainted: [S]=CPU_OUT_OF_SPEC, > [U]=USER > 2025-12-16T10:12:43.249629+00:00 athena kernel: Hardware name: Gigabyte > Technology Co., Ltd. Z690 UD DDR4/Z690 UD DDR4, BIOS F29 09/27/2024 > 2025-12-16T10:12:43.249629+00:00 athena kernel: Workqueue: events > amdgpu_tlb_fence_work [amdgpu] > 2025-12-16T10:12:43.249629+00:00 athena kernel: RIP: 0010:0x0 > 2025-12-16T10:12:43.249630+00:00 athena kernel: Code: Unable to access opcode > bytes at 0xffffffffffffffd6. > 2025-12-16T10:12:43.249630+00:00 athena kernel: RSP: 0018:ffffca9902dbfde0 > EFLAGS: 00010246 > 2025-12-16T10:12:43.249630+00:00 athena kernel: RAX: 0000000000000000 RBX: > 0000000000008000 RCX: 0000000000000001 > 2025-12-16T10:12:43.249632+00:00 athena kernel: RDX: 0000000000000002 RSI: > 0000000000008000 RDI: ffff8a34a8a00000 > 2025-12-16T10:12:43.249633+00:00 athena kernel: RBP: 0000000000000001 R08: > 0000000000000000 R09: 0000000000000001 > 2025-12-16T10:12:43.249633+00:00 athena kernel: R10: 0000000000000002 R11: > 0000000000000000 R12: 0000000000000000 > 2025-12-16T10:12:43.249633+00:00 athena kernel: R13: 0000000000000002 R14: > 0000000000000000 R15: ffff8a34a8a00000 > 2025-12-16T10:12:43.249633+00:00 athena kernel: FS: 0000000000000000(0000) > GS:ffff8a3c8a288000(0000) knlGS:0000000000000000 > 2025-12-16T10:12:43.249633+00:00 athena kernel: CS: 0010 DS: 0000 ES: 0000 > CR0: 0000000080050033 > 2025-12-16T10:12:43.249635+00:00 athena kernel: CR2: ffffffffffffffd6 CR3: > 000000018082c000 CR4: 0000000000f50ef0 > 2025-12-16T10:12:43.249635+00:00 athena kernel: PKRU: 55555554 > 2025-12-16T10:12:43.249635+00:00 athena kernel: Call Trace: > 2025-12-16T10:12:43.249635+00:00 athena kernel: <TASK> > 2025-12-16T10:12:43.249635+00:00 athena kernel: > amdgpu_gmc_flush_gpu_tlb_pasid+0xd6/0x400 [amdgpu] > 2025-12-16T10:12:43.249635+00:00 athena kernel: > amdgpu_tlb_fence_work+0x6e/0xe0 [amdgpu] > 2025-12-16T10:12:43.249636+00:00 athena kernel: process_one_work+0x18f/0x350 > 2025-12-16T10:12:43.249638+00:00 athena kernel: worker_thread+0x25a/0x3a0 > 2025-12-16T10:12:43.249638+00:00 athena kernel: ? > __pfx_worker_thread+0x10/0x10 > 2025-12-16T10:12:43.249639+00:00 athena kernel: kthread+0xf9/0x240 > 2025-12-16T10:12:43.249639+00:00 athena kernel: ? __pfx_kthread+0x10/0x10 > 2025-12-16T10:12:43.249639+00:00 athena kernel: ? __pfx_kthread+0x10/0x10 > 2025-12-16T10:12:43.249639+00:00 athena kernel: ret_from_fork+0x194/0x1c0 > 2025-12-16T10:12:43.249640+00:00 athena kernel: ? __pfx_kthread+0x10/0x10 > 2025-12-16T10:12:43.249641+00:00 athena kernel: ret_from_fork_asm+0x1a/0x30 > 2025-12-16T10:12:43.249641+00:00 athena kernel: </TASK> > 2025-12-16T10:12:43.249641+00:00 athena kernel: Modules linked in: > dm_thin_pool dm_persistent_data dm_bio_prison dm_bufio raid1 dm_raid raid456 > async_raid6_recov async_memcpy async_pq async_xor async_tx md_mod xor > hid_generic usbhid hid raid6_pq amdgpu(+) dm_mod xe uas usb_storage > intel_vsec sr_mod drm_gpuvm configfs amdxcp cdrom sd_mod > drm_panel_backlight_quirks drm_gpusvm_helper gpu_sched crc16 i915 radeon > drm_ttm_helper drm_buddy ttm drm_exec drm_suballoc_helper i2c_algo_bit > drm_display_helper cec rc_core drm_client_lib ahci drm_kms_helper iTCO_wdt > intel_pmc_bxt libahci xhci_pci_renesas iTCO_vendor_support drm xhci_pci > libata watchdog mxm_wmi nvme xhci_hcd r8169 nvme_core realtek mdio_devres > scsi_mod usbcore libphy video nvme_keyring intel_lpss_pci i2c_i801 mdio_bus > intel_lpss nvme_auth wmi fan i2c_smbus scsi_common button pinctrl_alderlake > usb_common idma64 efivarfs > 2025-12-16T10:12:43.249643+00:00 athena kernel: CR2: 0000000000000000 > 2025-12-16T10:12:43.249646+00:00 athena kernel: ---[ end trace > 0000000000000000 ]--- > 2025-12-16T10:12:43.249646+00:00 athena kernel: RIP: 0010:0x0 > 2025-12-16T10:12:43.249647+00:00 athena kernel: Code: Unable to access opcode > bytes at 0xffffffffffffffd6. > 2025-12-16T10:12:43.249647+00:00 athena kernel: RSP: 0018:ffffca9902dbfde0 > EFLAGS: 00010246 > 2025-12-16T10:12:43.249647+00:00 athena kernel: RAX: 0000000000000000 RBX: > 0000000000008000 RCX: 0000000000000001 > 2025-12-16T10:12:43.249647+00:00 athena kernel: RDX: 0000000000000002 RSI: > 0000000000008000 RDI: ffff8a34a8a00000 > 2025-12-16T10:12:43.249647+00:00 athena kernel: RBP: 0000000000000001 R08: > 0000000000000000 R09: 0000000000000001 > 2025-12-16T10:12:43.249649+00:00 athena kernel: R10: 0000000000000002 R11: > 0000000000000000 R12: 0000000000000000 > 2025-12-16T10:12:43.249649+00:00 athena kernel: R13: 0000000000000002 R14: > 0000000000000000 R15: ffff8a34a8a00000 > 2025-12-16T10:12:43.249649+00:00 athena kernel: FS: 0000000000000000(0000) > GS:ffff8a3c8a288000(0000) knlGS:0000000000000000 > 2025-12-16T10:12:43.249649+00:00 athena kernel: CS: 0010 DS: 0000 ES: 0000 > CR0: 0000000080050033 > 2025-12-16T10:12:43.249649+00:00 athena kernel: CR2: ffffffffffffffd6 CR3: > 000000018082c000 CR4: 0000000000f50ef0 > 2025-12-16T10:12:43.249649+00:00 athena kernel: PKRU: 55555554 > 2025-12-16T10:12:43.249651+00:00 athena kernel: note: kworker/2:2[564] exited > with irqs disabled Thanks for the report. As this is a regression between the two version, can you please bisect the changes to identify which commits breaks. That would involve compiling and testing a few kernels: git clone --single-branch -b linux-6.17.y https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux-stable.git cd linux-stable git checkout v6.17.9 cp /boot/config-$(uname -r) .config yes '' | make localmodconfig make savedefconfig mv defconfig arch/x86/configs/my_defconfig # test 6.17.9 to ensure this is "good" make my_defconfig make -j $(nproc) bindeb-pkg ... install the resulting .deb package and confirm it successfully boots / problem does not exist # test 6.17.11 to ensure this is "bad" git checkout v6.12.35 make my_defconfig make -j $(nproc) bindeb-pkg ... install the resulting .deb package and confirm it fails to boot / problem exists With that confirmed, the bisection can start: git bisect start git bisect good v6.17.9 git bisect bad v6.17.11 In each bisection step git checks out a state between the oldest known-bad and the newest known-good commit. In each step test using: make my_defconfig make -j $(nproc) bindeb-pkg ... install, try to boot / verify if problem exists and if the problem is hit run: git bisect bad and if the problem doesn't trigger run: git bisect good . Please pay attention to always select the just built kernel for booting, it won't always be the default kernel picked up by grub. Iterate until git announces to have identified the first bad commit. Then provide the output of git bisect log In the course of the bisection you might have to uninstall previous kernels again to not exhaust the disk space in /boot. Also in the end uninstall all self-built kernels again. Regards, Salvatore

