[Mesa-dev] [Bug 106670] AMD GPU Error, random lockup, Ryzen 2500U Vega 8 GPU

2018-05-26 Thread bugzilla-daemon
https://bugs.freedesktop.org/show_bug.cgi?id=106670

Ernst Sjöstrand  changed:

   What|Removed |Added

Product|Mesa|DRI
  Component|Mesa core   |DRM/AMDgpu
   Assignee|mesa-dev@lists.freedesktop. |dri-devel@lists.freedesktop
   |org |.org
 QA Contact|mesa-dev@lists.freedesktop. |
   |org |
Version|18.0|unspecified

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are the QA Contact for the bug.___
mesa-dev mailing list
mesa-dev@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/mesa-dev


[Mesa-dev] [Bug 106670] AMD GPU Error, random lockup, Ryzen 2500U Vega 8 GPU

2018-05-26 Thread bugzilla-daemon
https://bugs.freedesktop.org/show_bug.cgi?id=106670

Bug ID: 106670
   Summary: AMD GPU Error, random lockup, Ryzen 2500U Vega 8 GPU
   Product: Mesa
   Version: 18.0
  Hardware: Other
OS: All
Status: NEW
  Severity: normal
  Priority: medium
 Component: Mesa core
  Assignee: mesa-dev@lists.freedesktop.org
  Reporter: jvdeli...@charter.net
QA Contact: mesa-dev@lists.freedesktop.org

I am monitoring HP Laptop via ssh to try to catch a lockup propblem.  I am not
sure which component to select for the bug reports.  Here is output from dmesg.
This while running glxgear. It did not lockup yet, but spotted this first.  I
will post more as I find. Please advise other info needed. [aside: the PCI Bus
Error I think is unrelated but included so others can discern]

[  270.207119] pcieport :00:01.7: AER: Multiple Corrected error received:
id=0008
[  270.207136] pcieport :00:01.7: PCIe Bus Error: severity=Corrected,
type=Data Link Layer, id=000f(Transmitter ID)
[  270.207144] pcieport :00:01.7:   device [1022:15d3] error
status/mask=1000/6000
[  270.207149] pcieport :00:01.7:[12] Replay Timer Timeout  
[  397.899405] pcieport :00:01.7: AER: Multiple Corrected error received:
id=0008
[  397.899426] pcieport :00:01.7: PCIe Bus Error: severity=Corrected,
type=Data Link Layer, id=000f(Transmitter ID)
[  397.899434] pcieport :00:01.7:   device [1022:15d3] error
status/mask=1000/6000
[  397.899439] pcieport :00:01.7:[12] Replay Timer Timeout  
[  793.776505] pcieport :00:01.7: AER: Multiple Corrected error received:
id=0008
[  793.776524] pcieport :00:01.7: PCIe Bus Error: severity=Corrected,
type=Data Link Layer, id=000f(Transmitter ID)
[  793.776532] pcieport :00:01.7:   device [1022:15d3] error
status/mask=1000/6000
[  793.776537] pcieport :00:01.7:[12] Replay Timer Timeout  
[  797.012006] nf_conntrack: default automatic helper assignment has been
turned off for security reasons and CT-based  firewall rule not found. Use the
iptables CT target to attach helpers instead.
[ 1079.061454] pcieport :00:01.7: AER: Corrected error received: id=0008
[ 1079.061469] pcieport :00:01.7: PCIe Bus Error: severity=Corrected,
type=Data Link Layer, id=000f(Transmitter ID)
[ 1079.061478] pcieport :00:01.7:   device [1022:15d3] error
status/mask=1000/6000
[ 1079.061483] pcieport :00:01.7:[12] Replay Timer Timeout  
[ 1079.061489] pcieport :00:01.7: AER: Corrected error received: id=0008
[ 1079.061503] pcieport :00:01.7: can't find device of ID0008
[ 1145.211182] pcieport :00:01.7: AER: Corrected error received: id=0008
[ 1145.211196] pcieport :00:01.7: PCIe Bus Error: severity=Corrected,
type=Data Link Layer, id=000f(Transmitter ID)
[ 1145.211214] pcieport :00:01.7:   device [1022:15d3] error
status/mask=1000/6000
[ 1145.211220] pcieport :00:01.7:[12] Replay Timer Timeout  
[ 1145.211229] pcieport :00:01.7: AER: Corrected error received: id=0008
[ 1145.211239] pcieport :00:01.7: can't find device of ID0008
[ 1350.594831] [drm:generic_reg_wait [amdgpu]] *ERROR* REG_WAIT timeout 1us *
10 tries - optc1_lock line:553
[ 1350.594955] WARNING: CPU: 3 PID: 1828 at
drivers/gpu/drm/amd/amdgpu/../display/dc/dc_helper.c:195
generic_reg_wait+0xf3/0x170 [amdgpu]
[ 1350.594956] Modules linked in: ccm fuse rfcomm xt_CHECKSUM ipt_MASQUERADE
nf_nat_masquerade_ipv4 tun nf_conntrack_netbios_ns nf_conntrack_broadcast xt_CT
ip6t_rpfilter ip6t_REJECT nf_reject_ipv6 xt_conntrack devlink ip_set nfnetlink
ebtable_nat ebtable_broute bridge stp llc ip6table_nat nf_conntrack_ipv6
nf_defrag_ipv6 nf_nat_ipv6 ip6table_mangle ip6table_raw ip6table_security
iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat nf_conntrack
libcrc32c iptable_mangle cmac iptable_raw iptable_security ebtable_filter
ebtables ip6table_filter ip6_tables bnep sunrpc vfat fat arc4 r8822be(C) hp_wmi
sparse_keymap wmi_bmof edac_mce_amd kvm_amd ccp kvm snd_hda_codec_realtek
snd_hda_codec_generic snd_hda_codec_hdmi mac80211 snd_hda_intel btusb irqbypass
btrtl crct10dif_pclmul crc32_pclmul btbcm
[ 1350.594989]  btintel snd_hda_codec bluetooth hid_sensor_accel_3d
hid_sensor_incl_3d hid_sensor_gyro_3d ghash_clmulni_intel uvcvideo
hid_sensor_rotation hid_sensor_magn_3d snd_hda_core hid_sensor_trigger
hid_sensor_iio_common industrialio_triggered_buffer videobuf2_vmalloc
videobuf2_memops kfifo_buf videobuf2_v4l2 snd_hwdep videobuf2_common
industrialio snd_seq videodev cfg80211 snd_seq_device ecdh_generic joydev
snd_pcm media rtsx_pci_ms memstick rfkill snd_timer snd sp5100_tco soundcore
shpchp i2c_piix4 k10temp tpm_crb wmi tpm_tis hp_accel tpm_tis_core lis3lv02d
tpm i2c_scmi video hp_wireless input_polldev pinctrl_amd acpi_cpufreq amdkfd
hid_sensor_hub amd_iommu_v2 amdgpu hid_logitech_hidpp chash i2c_algo_bit
gpu_sched