https://bugs.freedesktop.org/show_bug.cgi?id=106670
Bug ID: 106670
Summary: AMD GPU Error, random lockup, Ryzen 2500U Vega 8 GPU
Product: Mesa
Version: 18.0
Hardware: Other
OS: All
Status: NEW
Severity: normal
Priority: medium
Component: Mesa core
Assignee: mesa-dev@lists.freedesktop.org
Reporter: jvdeli...@charter.net
QA Contact: mesa-dev@lists.freedesktop.org
I am monitoring HP Laptop via ssh to try to catch a lockup propblem. I am not
sure which component to select for the bug reports. Here is output from dmesg.
This while running glxgear. It did not lockup yet, but spotted this first. I
will post more as I find. Please advise other info needed. [aside: the PCI Bus
Error I think is unrelated but included so others can discern]
[ 270.207119] pcieport :00:01.7: AER: Multiple Corrected error received:
id=0008
[ 270.207136] pcieport :00:01.7: PCIe Bus Error: severity=Corrected,
type=Data Link Layer, id=000f(Transmitter ID)
[ 270.207144] pcieport :00:01.7: device [1022:15d3] error
status/mask=1000/6000
[ 270.207149] pcieport :00:01.7:[12] Replay Timer Timeout
[ 397.899405] pcieport :00:01.7: AER: Multiple Corrected error received:
id=0008
[ 397.899426] pcieport :00:01.7: PCIe Bus Error: severity=Corrected,
type=Data Link Layer, id=000f(Transmitter ID)
[ 397.899434] pcieport :00:01.7: device [1022:15d3] error
status/mask=1000/6000
[ 397.899439] pcieport :00:01.7:[12] Replay Timer Timeout
[ 793.776505] pcieport :00:01.7: AER: Multiple Corrected error received:
id=0008
[ 793.776524] pcieport :00:01.7: PCIe Bus Error: severity=Corrected,
type=Data Link Layer, id=000f(Transmitter ID)
[ 793.776532] pcieport :00:01.7: device [1022:15d3] error
status/mask=1000/6000
[ 793.776537] pcieport :00:01.7:[12] Replay Timer Timeout
[ 797.012006] nf_conntrack: default automatic helper assignment has been
turned off for security reasons and CT-based firewall rule not found. Use the
iptables CT target to attach helpers instead.
[ 1079.061454] pcieport :00:01.7: AER: Corrected error received: id=0008
[ 1079.061469] pcieport :00:01.7: PCIe Bus Error: severity=Corrected,
type=Data Link Layer, id=000f(Transmitter ID)
[ 1079.061478] pcieport :00:01.7: device [1022:15d3] error
status/mask=1000/6000
[ 1079.061483] pcieport :00:01.7:[12] Replay Timer Timeout
[ 1079.061489] pcieport :00:01.7: AER: Corrected error received: id=0008
[ 1079.061503] pcieport :00:01.7: can't find device of ID0008
[ 1145.211182] pcieport :00:01.7: AER: Corrected error received: id=0008
[ 1145.211196] pcieport :00:01.7: PCIe Bus Error: severity=Corrected,
type=Data Link Layer, id=000f(Transmitter ID)
[ 1145.211214] pcieport :00:01.7: device [1022:15d3] error
status/mask=1000/6000
[ 1145.211220] pcieport :00:01.7:[12] Replay Timer Timeout
[ 1145.211229] pcieport :00:01.7: AER: Corrected error received: id=0008
[ 1145.211239] pcieport :00:01.7: can't find device of ID0008
[ 1350.594831] [drm:generic_reg_wait [amdgpu]] *ERROR* REG_WAIT timeout 1us *
10 tries - optc1_lock line:553
[ 1350.594955] WARNING: CPU: 3 PID: 1828 at
drivers/gpu/drm/amd/amdgpu/../display/dc/dc_helper.c:195
generic_reg_wait+0xf3/0x170 [amdgpu]
[ 1350.594956] Modules linked in: ccm fuse rfcomm xt_CHECKSUM ipt_MASQUERADE
nf_nat_masquerade_ipv4 tun nf_conntrack_netbios_ns nf_conntrack_broadcast xt_CT
ip6t_rpfilter ip6t_REJECT nf_reject_ipv6 xt_conntrack devlink ip_set nfnetlink
ebtable_nat ebtable_broute bridge stp llc ip6table_nat nf_conntrack_ipv6
nf_defrag_ipv6 nf_nat_ipv6 ip6table_mangle ip6table_raw ip6table_security
iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat nf_conntrack
libcrc32c iptable_mangle cmac iptable_raw iptable_security ebtable_filter
ebtables ip6table_filter ip6_tables bnep sunrpc vfat fat arc4 r8822be(C) hp_wmi
sparse_keymap wmi_bmof edac_mce_amd kvm_amd ccp kvm snd_hda_codec_realtek
snd_hda_codec_generic snd_hda_codec_hdmi mac80211 snd_hda_intel btusb irqbypass
btrtl crct10dif_pclmul crc32_pclmul btbcm
[ 1350.594989] btintel snd_hda_codec bluetooth hid_sensor_accel_3d
hid_sensor_incl_3d hid_sensor_gyro_3d ghash_clmulni_intel uvcvideo
hid_sensor_rotation hid_sensor_magn_3d snd_hda_core hid_sensor_trigger
hid_sensor_iio_common industrialio_triggered_buffer videobuf2_vmalloc
videobuf2_memops kfifo_buf videobuf2_v4l2 snd_hwdep videobuf2_common
industrialio snd_seq videodev cfg80211 snd_seq_device ecdh_generic joydev
snd_pcm media rtsx_pci_ms memstick rfkill snd_timer snd sp5100_tco soundcore
shpchp i2c_piix4 k10temp tpm_crb wmi tpm_tis hp_accel tpm_tis_core lis3lv02d
tpm i2c_scmi video hp_wireless input_polldev pinctrl_amd acpi_cpufreq amdkfd
hid_sensor_hub amd_iommu_v2 amdgpu hid_logitech_hidpp chash i2c_algo_bit
gpu_sched