Public bug reported:
Please see upstream report at
https://gitlab.freedesktop.org/drm/nouveau/-/issues/405
On my ubuntu24.04 system, the following crash(?) is observed.
Linux: 6.8.0-50-generic
Ubuntu: 24.04.1 LTS
VGA compatible controller: NVIDIA Corporation AD104GLM [RTX 3500 Ada
Generation Laptop GPU]
[1285887.476968] ------------[ cut here ]------------
[1285887.476970] WARNING: CPU: 4 PID: 712001 at
drivers/gpu/drm/nouveau/nvkm/subdev/gsp/r535.c:192
r535_gsp_cmdq_push+0x1d2/0x240 [nouveau]
[1285887.477036] Modules linked in: tcp_diag inet_diag xt_nat cpuid veth tls
vhost_net vhost vhost_iotlb tap rfcomm snd_seq_dummy snd_hrtimer
nf_conntrack_netlink xfrm_user xfrm_algo br_netfilter cmac algif_hash
algif_skcipher af_alg xt_CHECKSUM bridge stp llc overlay ip6t_REJECT
nf_reject_ipv6 nft_chain_nat qrtr xt_MASQUERADE nf_nat xt_hl ip6t_rt ipt_REJECT
nf_reject_ipv4 xt_LOG nf_log_syslog xt_multiport xt_comment nft_limit bnep
snd_sof_pci_intel_tgl snd_sof_intel_hda_common soundwire_intel
snd_sof_intel_hda_mlink soundwire_cadence snd_sof_intel_hda snd_sof_pci
snd_sof_xtensa_dsp snd_sof xt_limit xt_addrtype snd_sof_utils xt_tcpudp
snd_soc_hdac_hda snd_hda_ext_core snd_soc_acpi_intel_match xt_conntrack
snd_soc_acpi soundwire_generic_allocation nf_conntrack soundwire_bus r8153_ecm
nf_defrag_ipv6 nf_defrag_ipv4 snd_ctl_led cdc_ether snd_soc_core nft_compat
snd_hda_codec_realtek usbnet snd_compress snd_hda_codec_generic
snd_hda_codec_hdmi ac97_bus snd_pcm_dmaengine xe intel_uncore_frequency
snd_hda_intel
[1285887.477060] intel_uncore_frequency_common drm_suballoc_helper
snd_intel_dspcfg snd_intel_sdw_acpi nf_tables x86_pkg_temp_thermal
snd_hda_codec r8152 intel_powerclamp mii libcrc32c binfmt_misc snd_hda_core
nouveau snd_hwdep coretemp btusb nls_iso8859_1 btrtl snd_pcm iwlmvm dell_rbtn
btintel hid_sensor_custom_intel_hinge hid_sensor_gyro_3d hid_sensor_accel_3d
hid_sensor_als btbcm snd_seq_midi kvm_intel mxm_wmi mac80211 snd_seq_midi_event
drm_gpuvm btmtk hid_sensor_trigger i915 cmdlinepart dell_laptop snd_rawmidi
libarc4 kvm processor_thermal_device_pci drm_exec industrialio_triggered_buffer
spi_nor bluetooth processor_thermal_device dell_wmi gpu_sched snd_seq iwlwifi
kfifo_buf mtd irqbypass processor_thermal_wt_hint dell_smbios drm_buddy
drm_ttm_helper snd_seq_device hid_sensor_iio_common ecdh_generic intel_rapl_msr
dcdbas mei_pxp mei_hdcp snd_timer rapl dell_wmi_sysman intel_cstate
dell_wmi_ddv dell_smm_hwmon ledtrig_audio dell_wmi_descriptor
firmware_attributes_class ecc wmi_bmof spi_intel_pci i2c_i801 ttm
[1285887.477087] industrialio processor_thermal_rfim nvidia_wmi_ec_backlight
snd cfg80211 spi_intel i2c_smbus processor_thermal_rapl drm_display_helper
soundcore intel_rapl_common cec processor_thermal_wt_req
processor_thermal_power_floor intel_pmc_core joydev int3403_thermal
processor_thermal_mbox rc_core intel_hid int3400_thermal input_leds
pmt_telemetry i2c_algo_bit int340x_thermal_zone intel_vsec acpi_thermal_rel
pmt_class sparse_keymap acpi_tad mac_hid acpi_pad mei_me serio_raw mei
sch_fq_codel msr parport_pc ppdev lp parport efi_pstore nfnetlink dmi_sysfs
ip_tables x_tables autofs4 typec_displayport dm_crypt usbhid hid_sensor_custom
hid_sensor_hub intel_ishtp_hid nvme nvme_core nvme_auth hid_multitouch ahci
hid_generic libahci crct10dif_pclmul crc32_pclmul polyval_clmulni
polyval_generic ghash_clmulni_intel sha256_ssse3 rtsx_pci_sdmmc intel_lpss_pci
i2c_hid_acpi ucsi_acpi intel_lpss sha1_ssse3 video xhci_pci typec_ucsi
thunderbolt i2c_hid psmouse e1000e intel_ish_ipc rtsx_pci intel_ishtp idma64
vmd xhci_pci_renesas
[1285887.477114] typec hid wmi pinctrl_alderlake aesni_intel crypto_simd cryptd
[1285887.477117] CPU: 4 PID: 712001 Comm: fwupd Tainted: G W
6.8.0-50-generic #51-Ubuntu
[1285887.477119] Hardware name: Dell Inc. Precision 7780/0342YC, BIOS 1.17.0
10/04/2024
[1285887.477120] RIP: 0010:r535_gsp_cmdq_push+0x1d2/0x240 [nouveau]
[1285887.477168] Code: 31 d2 31 c9 31 f6 31 ff 45 31 c9 c3 cc cc cc cc ba 02 00
00 00 be 02 00 00 00 bf 01 00 00 00 e8 84 9a 5d e0 83 6d cc 01 75 2e <0f> 0b 48
8b 7d d0 e8 73 32 78 df b8 92 ff ff ff 48 83 c4 10 5b 41
[1285887.477169] RSP: 0018:ffffb5fc8fe6b808 EFLAGS: 00010246
[1285887.477170] RAX: 0000000000000000 RBX: 0000000000000000 RCX:
0000000000000000
[1285887.477170] RDX: 0000000000000000 RSI: 0000000000000000 RDI:
0000000000000000
[1285887.477171] RBP: ffffb5fc8fe6b840 R08: 0000000000000000 R09:
0000000000000000
[1285887.477171] R10: 0000000000000000 R11: 0000000000000000 R12:
0000000000000030
[1285887.477172] R13: ffff8c80433ac000 R14: 0000000000001000 R15:
0000000000000000
[1285887.477172] FS: 0000766295cbdb80(0000) GS:ffff8c87bec00000(0000)
knlGS:0000000000000000
[1285887.477173] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[1285887.477174] CR2: 00007cd024f69020 CR3: 0000000115c56005 CR4:
0000000000f72ef0
[1285887.477175] DR0: 00007fffffffe1d8 DR1: 0000000000000000 DR2:
0000000000000000
[1285887.477175] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7:
0000000000000400
[1285887.477176] PKRU: 55555554
[1285887.477176] Call Trace:
[1285887.477177] <TASK>
[1285887.477179] ? show_regs+0x6d/0x80
[1285887.477182] ? __warn+0x89/0x160
[1285887.477183] ? r535_gsp_cmdq_push+0x1d2/0x240 [nouveau]
[1285887.477229] ? report_bug+0x17e/0x1b0
[1285887.477232] ? handle_bug+0x51/0xa0
[1285887.477234] ? exc_invalid_op+0x18/0x80
[1285887.477236] ? asm_exc_invalid_op+0x1b/0x20
[1285887.477238] ? r535_gsp_cmdq_push+0x1d2/0x240 [nouveau]
[1285887.477282] ? r535_gsp_cmdq_push+0x1cc/0x240 [nouveau]
[1285887.477326] r535_gsp_rpc_send+0x3b/0x140 [nouveau]
[1285887.477370] r535_gsp_rpc_push+0x156/0x160 [nouveau]
[1285887.477415] r535_gsp_rpc_rm_ctrl_push+0x44/0x140 [nouveau]
[1285887.477458] ? r535_gsp_rpc_rm_ctrl_get+0x3f/0xc0 [nouveau]
[1285887.477502] r535_dp_aux_xfer+0x137/0x330 [nouveau]
[1285887.477562] nvkm_uoutp_mthd+0x381/0x500 [nouveau]
[1285887.477621] nvkm_object_mthd+0x17/0x40 [nouveau]
[1285887.477651] nvkm_ioctl_mthd+0x5d/0xc0 [nouveau]
[1285887.477680] nvkm_ioctl+0x132/0x2b0 [nouveau]
[1285887.477709] nvkm_client_ioctl+0xe/0x20 [nouveau]
[1285887.477766] nvif_object_mthd+0xc6/0x260 [nouveau]
[1285887.477792] nvif_outp_dp_aux_xfer+0xac/0x220 [nouveau]
[1285887.477843] nouveau_connector_aux_xfer+0x5c/0xb0 [nouveau]
[1285887.477897] drm_dp_dpcd_access+0xbf/0x160 [drm_display_helper]
[1285887.477905] drm_dp_dpcd_probe+0x41/0x100 [drm_display_helper]
[1285887.477911] drm_dp_dpcd_read+0xe8/0x130 [drm_display_helper]
[1285887.477916] auxdev_read_iter+0x9b/0x1b0 [drm_display_helper]
[1285887.477922] vfs_read+0x25c/0x390
[1285887.477924] ksys_read+0x73/0x100
[1285887.477926] __x64_sys_read+0x19/0x30
[1285887.477927] x64_sys_call+0x1bf0/0x25a0
[1285887.477929] do_syscall_64+0x7f/0x180
[1285887.477930] ? vfs_read+0x2c7/0x390
[1285887.477931] ? vfs_read+0x2c7/0x390
[1285887.477933] ? __f_unlock_pos+0x12/0x20
[1285887.477935] ? ksys_read+0xe6/0x100
[1285887.477936] ? syscall_exit_to_user_mode+0x86/0x260
[1285887.477941] ? do_syscall_64+0x8c/0x180
[1285887.477942] ? do_syscall_64+0x8c/0x180
[1285887.477943] ? do_syscall_64+0x8c/0x180
[1285887.477944] ? do_syscall_64+0x8c/0x180
[1285887.477945] ? exc_page_fault+0x94/0x1b0
[1285887.477946] entry_SYSCALL_64_after_hwframe+0x78/0x80
[1285887.477948] RIP: 0033:0x76629771ba9a
[1285887.477977] Code: 55 48 89 e5 48 83 ec 20 48 89 55 e8 48 89 75 f0 89 7d f8
e8 b8 ca f7 ff 48 8b 55 e8 48 8b 75 f0 41 89 c0 8b 7d f8 31 c0 0f 05 <48> 3d 00
f0 ff ff 77 2e 44 89 c7 48 89 45 f8 e8 12 cb f7 ff 48 8b
[1285887.477978] RSP: 002b:00007fff4c0dc9f0 EFLAGS: 00000246 ORIG_RAX:
0000000000000000
[1285887.477979] RAX: ffffffffffffffda RBX: 00005e57f0408230 RCX:
000076629771ba9a
[1285887.477979] RDX: 000000000000000d RSI: 00005e57f04081f0 RDI:
000000000000000e
[1285887.477980] RBP: 00007fff4c0dca10 R08: 0000000000000000 R09:
0000000000000000
[1285887.477980] R10: 0000000000000000 R11: 0000000000000246 R12:
000000000000000d
[1285887.477981] R13: 00005e57f03b6a00 R14: 00005e57f0408280 R15:
000000000000000a
[1285887.477982] </TASK>
[1285887.477983] ---[ end trace 0000000000000000 ]---
I can also see
[1286296.035270] nouveau 0000:01:00.0: gsp: fini failed, -110
[1286296.035274] nouveau 0000:01:00.0: init failed with -110
[1286296.035275] nouveau: DRM-master:00000000:00000080: init failed with -110
[1286296.035319] nouveau: DRM-master:00000000:00000000: init failed with -110
[1286296.035338] nouveau 0000:01:00.0: DRM: Client resume failed with error:
-110
[1286296.035343] nouveau 0000:01:00.0: DRM: resume failed with: -110
** Affects: linux-signed (Ubuntu)
Importance: Undecided
Status: New
--
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/2092849
Title:
Nouveau driver crashes all the time
To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux-signed/+bug/2092849/+subscriptions
--
ubuntu-bugs mailing list
[email protected]
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs