Control: tags -1 + upstream moreinfo

Hi Amit,

On Tue, Dec 16, 2025 at 11:26:26AM +0000, Amit Gurdasani wrote:
> Package: linux-image-6.17.11+deb14-amd64
> Version: 6.17.11-1
> Severity: important
> X-Debbugs-Cc: [email protected]
> User: [email protected]
> Usertags: amd64
> 
> Dear Maintainer,
> 
> I had my Debian testing desktop reboot after an unattended upgrade where the
> kernel was updated from linux-image-6.17.9+deb14-amd64 to
> linux-image-6.17.11+deb14-amd64. After reboot overnight, I found the machine
> unresponsive in the morning. Rebooting revealed many successive kernel oops
> in amdgpu. Oops text below.
> 
> The GPU in use is an AMD Radeon R9 270X ("Curacao XT"), from the GCN 1.0
> generation. I _am_ using the following argument on the kernel command line
> to gain some performance:
> 
> amdgpu.ppfeaturemask=0xffffffff
> 
> This kernel oops was not occurring in kernels up to and including 6.17.9
> (Debian-packaged).
> 
> I have not tried to boot 6.17.11 without the amdgpu.ppfeaturemask=0xffffffff
> kernel command-line argument to see if the oops still occurs.
> 
> I did find that there was some work done on amdgpu in November:
> 
> https://lists.freedesktop.org/archives/amd-gfx/2025-November/133356.html
> 
> I don't know enough to know whether that work could cause this kernel oops.
> 
> Downgrading back to 6.17.9 has eliminated the kernel oops.
> 
> Thanks,
> 
> Amit Gurdasani
> 
> Oops text:
> 
> 2025-12-16T10:12:43.249625+00:00 athena kernel: amdgpu 0000:01:00.0: 
> [drm:amdgpu_ib_ring_tests [amdgpu]] *ERROR* IB test failed on uvd (-110).
> 2025-12-16T10:12:43.249625+00:00 athena kernel: amdgpu 0000:01:00.0: amdgpu: 
> ib ring test failed (-110).
> 2025-12-16T10:12:43.249627+00:00 athena kernel: BUG: kernel NULL pointer 
> dereference, address: 0000000000000000
> 2025-12-16T10:12:43.249627+00:00 athena kernel: #PF: supervisor instruction 
> fetch in kernel mode
> 2025-12-16T10:12:43.249627+00:00 athena kernel: #PF: error_code(0x0010) - 
> not-present page
> 2025-12-16T10:12:43.249627+00:00 athena kernel: PGD 0 P4D 0 
> 2025-12-16T10:12:43.249627+00:00 athena kernel: Oops: Oops: 0010 [#1] SMP 
> NOPTI
> 2025-12-16T10:12:43.249627+00:00 athena kernel: CPU: 2 UID: 0 PID: 564 Comm: 
> kworker/2:2 Tainted: G S   U              6.17.11+deb14-amd64 #1 
> PREEMPT(lazy)  Debian 6.17.11-1 
> 2025-12-16T10:12:43.249628+00:00 athena kernel: Tainted: [S]=CPU_OUT_OF_SPEC, 
> [U]=USER
> 2025-12-16T10:12:43.249629+00:00 athena kernel: Hardware name: Gigabyte 
> Technology Co., Ltd. Z690 UD DDR4/Z690 UD DDR4, BIOS F29 09/27/2024
> 2025-12-16T10:12:43.249629+00:00 athena kernel: Workqueue: events 
> amdgpu_tlb_fence_work [amdgpu]
> 2025-12-16T10:12:43.249629+00:00 athena kernel: RIP: 0010:0x0
> 2025-12-16T10:12:43.249630+00:00 athena kernel: Code: Unable to access opcode 
> bytes at 0xffffffffffffffd6.
> 2025-12-16T10:12:43.249630+00:00 athena kernel: RSP: 0018:ffffca9902dbfde0 
> EFLAGS: 00010246
> 2025-12-16T10:12:43.249630+00:00 athena kernel: RAX: 0000000000000000 RBX: 
> 0000000000008000 RCX: 0000000000000001
> 2025-12-16T10:12:43.249632+00:00 athena kernel: RDX: 0000000000000002 RSI: 
> 0000000000008000 RDI: ffff8a34a8a00000
> 2025-12-16T10:12:43.249633+00:00 athena kernel: RBP: 0000000000000001 R08: 
> 0000000000000000 R09: 0000000000000001
> 2025-12-16T10:12:43.249633+00:00 athena kernel: R10: 0000000000000002 R11: 
> 0000000000000000 R12: 0000000000000000
> 2025-12-16T10:12:43.249633+00:00 athena kernel: R13: 0000000000000002 R14: 
> 0000000000000000 R15: ffff8a34a8a00000
> 2025-12-16T10:12:43.249633+00:00 athena kernel: FS:  0000000000000000(0000) 
> GS:ffff8a3c8a288000(0000) knlGS:0000000000000000
> 2025-12-16T10:12:43.249633+00:00 athena kernel: CS:  0010 DS: 0000 ES: 0000 
> CR0: 0000000080050033
> 2025-12-16T10:12:43.249635+00:00 athena kernel: CR2: ffffffffffffffd6 CR3: 
> 000000018082c000 CR4: 0000000000f50ef0
> 2025-12-16T10:12:43.249635+00:00 athena kernel: PKRU: 55555554
> 2025-12-16T10:12:43.249635+00:00 athena kernel: Call Trace:
> 2025-12-16T10:12:43.249635+00:00 athena kernel:  <TASK>
> 2025-12-16T10:12:43.249635+00:00 athena kernel:  
> amdgpu_gmc_flush_gpu_tlb_pasid+0xd6/0x400 [amdgpu]
> 2025-12-16T10:12:43.249635+00:00 athena kernel:  
> amdgpu_tlb_fence_work+0x6e/0xe0 [amdgpu]
> 2025-12-16T10:12:43.249636+00:00 athena kernel:  process_one_work+0x18f/0x350
> 2025-12-16T10:12:43.249638+00:00 athena kernel:  worker_thread+0x25a/0x3a0
> 2025-12-16T10:12:43.249638+00:00 athena kernel:  ? 
> __pfx_worker_thread+0x10/0x10
> 2025-12-16T10:12:43.249639+00:00 athena kernel:  kthread+0xf9/0x240
> 2025-12-16T10:12:43.249639+00:00 athena kernel:  ? __pfx_kthread+0x10/0x10
> 2025-12-16T10:12:43.249639+00:00 athena kernel:  ? __pfx_kthread+0x10/0x10
> 2025-12-16T10:12:43.249639+00:00 athena kernel:  ret_from_fork+0x194/0x1c0
> 2025-12-16T10:12:43.249640+00:00 athena kernel:  ? __pfx_kthread+0x10/0x10
> 2025-12-16T10:12:43.249641+00:00 athena kernel:  ret_from_fork_asm+0x1a/0x30
> 2025-12-16T10:12:43.249641+00:00 athena kernel:  </TASK>
> 2025-12-16T10:12:43.249641+00:00 athena kernel: Modules linked in: 
> dm_thin_pool dm_persistent_data dm_bio_prison dm_bufio raid1 dm_raid raid456 
> async_raid6_recov async_memcpy async_pq async_xor async_tx md_mod xor 
> hid_generic usbhid hid raid6_pq amdgpu(+) dm_mod xe uas usb_storage 
> intel_vsec sr_mod drm_gpuvm configfs amdxcp cdrom sd_mod 
> drm_panel_backlight_quirks drm_gpusvm_helper gpu_sched crc16 i915 radeon 
> drm_ttm_helper drm_buddy ttm drm_exec drm_suballoc_helper i2c_algo_bit 
> drm_display_helper cec rc_core drm_client_lib ahci drm_kms_helper iTCO_wdt 
> intel_pmc_bxt libahci xhci_pci_renesas iTCO_vendor_support drm xhci_pci 
> libata watchdog mxm_wmi nvme xhci_hcd r8169 nvme_core realtek mdio_devres 
> scsi_mod usbcore libphy video nvme_keyring intel_lpss_pci i2c_i801 mdio_bus 
> intel_lpss nvme_auth wmi fan i2c_smbus scsi_common button pinctrl_alderlake 
> usb_common idma64 efivarfs
> 2025-12-16T10:12:43.249643+00:00 athena kernel: CR2: 0000000000000000
> 2025-12-16T10:12:43.249646+00:00 athena kernel: ---[ end trace 
> 0000000000000000 ]---
> 2025-12-16T10:12:43.249646+00:00 athena kernel: RIP: 0010:0x0
> 2025-12-16T10:12:43.249647+00:00 athena kernel: Code: Unable to access opcode 
> bytes at 0xffffffffffffffd6.
> 2025-12-16T10:12:43.249647+00:00 athena kernel: RSP: 0018:ffffca9902dbfde0 
> EFLAGS: 00010246
> 2025-12-16T10:12:43.249647+00:00 athena kernel: RAX: 0000000000000000 RBX: 
> 0000000000008000 RCX: 0000000000000001
> 2025-12-16T10:12:43.249647+00:00 athena kernel: RDX: 0000000000000002 RSI: 
> 0000000000008000 RDI: ffff8a34a8a00000
> 2025-12-16T10:12:43.249647+00:00 athena kernel: RBP: 0000000000000001 R08: 
> 0000000000000000 R09: 0000000000000001
> 2025-12-16T10:12:43.249649+00:00 athena kernel: R10: 0000000000000002 R11: 
> 0000000000000000 R12: 0000000000000000
> 2025-12-16T10:12:43.249649+00:00 athena kernel: R13: 0000000000000002 R14: 
> 0000000000000000 R15: ffff8a34a8a00000
> 2025-12-16T10:12:43.249649+00:00 athena kernel: FS:  0000000000000000(0000) 
> GS:ffff8a3c8a288000(0000) knlGS:0000000000000000
> 2025-12-16T10:12:43.249649+00:00 athena kernel: CS:  0010 DS: 0000 ES: 0000 
> CR0: 0000000080050033
> 2025-12-16T10:12:43.249649+00:00 athena kernel: CR2: ffffffffffffffd6 CR3: 
> 000000018082c000 CR4: 0000000000f50ef0
> 2025-12-16T10:12:43.249649+00:00 athena kernel: PKRU: 55555554
> 2025-12-16T10:12:43.249651+00:00 athena kernel: note: kworker/2:2[564] exited 
> with irqs disabled

Thanks for the report. As this is a regression between the two
version, can you please bisect the changes to identify which commits
breaks. That would involve compiling and testing a few kernels:

    git clone --single-branch -b linux-6.17.y 
https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux-stable.git
    cd linux-stable
    git checkout v6.17.9
    cp /boot/config-$(uname -r) .config
    yes '' | make localmodconfig
    make savedefconfig
    mv defconfig arch/x86/configs/my_defconfig

    # test 6.17.9 to ensure this is "good"
    make my_defconfig
    make -j $(nproc) bindeb-pkg
    ... install the resulting .deb package and confirm it successfully boots / 
problem does not exist

    # test 6.17.11 to ensure this is "bad"
    git checkout v6.12.35
    make my_defconfig
    make -j $(nproc) bindeb-pkg
    ... install the resulting .deb package and confirm it fails to boot / 
problem exists

With that confirmed, the bisection can start:

    git bisect start
    git bisect good v6.17.9
    git bisect bad v6.17.11

In each bisection step git checks out a state between the oldest
known-bad and the newest known-good commit. In each step test using:

    make my_defconfig
    make -j $(nproc) bindeb-pkg
    ... install, try to boot / verify if problem exists

and if the problem is hit run:

    git bisect bad

and if the problem doesn't trigger run:

    git bisect good

. Please pay attention to always select the just built kernel for
booting, it won't always be the default kernel picked up by grub.

Iterate until git announces to have identified the first bad commit.

Then provide the output of

    git bisect log

In the course of the bisection you might have to uninstall previous
kernels again to not exhaust the disk space in /boot. Also in the end
uninstall all self-built kernels again.

Regards,
Salvatore

Reply via email to