Public bug reported:
Description: Ubuntu 20.04.2 LTS
Release: 20.04
Been suddenly seeing a number of crashes today on my threadripper 2950x
box today after the system being off over the weekend.
Suspect it may be tied to Ubuntu 5.4.0-80.90-generic 5.4.124 kernel, as
I wasn't seeing it last week or previously.
Aug 2 16:52:14 threadripper kernel: [ 600.168436] watchdog: BUG: soft lockup
- CPU#19 stuck for 22s! [kworker/19:0:11301]
Aug 2 16:52:14 threadripper kernel: [ 600.168490] Modules linked in: veth
xt_MASQUERADE nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo iptable_nat nf
_nat br_netfilter bridge stp llc aufs overlay nls_iso8859_1 dm_multipath
scsi_dh_rdac scsi_dh_emc scsi_dh_alua snd_hda_codec_realtek
snd_hda_codec_generic
ledtrig_audio snd_hda_codec_hdmi eeepc_wmi snd_hda_intel edac_mce_amd
snd_intel_dspcfg asus_wmi ftdi_sio snd_hda_codec kvm_amd usbserial
sparse_keymap snd_
hda_core kvm video wmi_bmof snd_hwdep snd_pcm snd_timer snd ccp soundcore
k10temp mac_hid nf_log_ipv6 ip6t_REJECT nf_reject_ipv6 xt_hl ip6t_rt
nf_log_ipv4
nf_log_common ipt_REJECT nf_reject_ipv4 xt_LOG xt_limit xt_addrtype
sch_fq_codel xt_tcpudp xt_conntrack nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4
ip6table
_filter ip6_tables iptable_filter bpfilter ip_tables x_tables autofs4 btrfs
zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor
async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear hid_generic usbhid
hid uas usb_storage amdgpu
Aug 2 16:52:14 threadripper kernel: [ 600.168542] amd_iommu_v2 gpu_sched
crct10dif_pclmul ttm crc32_pclmul ghash_clmulni_intel drm_kms_helper syscopyare
a aesni_intel crypto_simd mxm_wmi sysfillrect cryptd sysimgblt glue_helper
fb_sys_fops igb drm dca i2c_piix4 ahci i2c_algo_bit libahci gpio_amdpt wmi gpio_
generic
Aug 2 16:52:14 threadripper kernel: [ 600.168558] CPU: 19 PID: 11301 Comm:
kworker/19:0 Tainted: G L 5.4.0-80-generic #90-Ubuntu
Aug 2 16:52:14 threadripper kernel: [ 600.168559] Hardware name: System
manufacturer System Product Name/ROG STRIX X399-E GAMING, BIOS 1203 10/09/2019
Aug 2 16:52:14 threadripper kernel: [ 600.168569] Workqueue: events free_work
Aug 2 16:52:14 threadripper kernel: [ 600.168574] RIP:
0010:smp_call_function_many+0x205/0x270
Aug 2 16:52:14 threadripper kernel: [ 600.168576] Code: e8 50 10 92 00 3b 05
ae cf 70 01 89 c7 0f 83 9b fe ff ff 48 63 c7 48 8b 0b 48 03 0c c5 80 99 64 a
1 8b 41 18 a8 01 74 0a f3 90 <8b> 51 18 83 e2 01 75 f6 eb c8 89 cf 48 c7 c2 a0
b8 a4 a1 4c 89 fe
Aug 2 16:52:14 threadripper kernel: [ 600.168577] RSP: 0018:ffffb66b0aa17d00
EFLAGS: 00000202 ORIG_RAX: ffffffffffffff13
Aug 2 16:52:14 threadripper kernel: [ 600.168579] RAX: 0000000000000003 RBX:
ffff8de1fd4ebd40 RCX: ffff8de1fd0b2540
Aug 2 16:52:14 threadripper kernel: [ 600.168580] RDX: 0000000000000001 RSI:
0000000000000000 RDI: 0000000000000002
Aug 2 16:52:14 threadripper kernel: [ 600.168580] RBP: ffffb66b0aa17d40 R08:
ffff8de1f6da7190 R09: 0000000000000003
Aug 2 16:52:14 threadripper kernel: [ 600.168581] R10: ffff8de1f6da7190 R11:
0000000000000002 R12: ffffffffa0281930
Aug 2 16:52:14 threadripper kernel: [ 600.168581] R13: 0000000000000000 R14:
0000000000000001 R15: 0000000000000080
Aug 2 16:52:14 threadripper kernel: [ 600.168583] FS: 0000000000000000(0000)
GS:ffff8de1fd4c0000(0000) knlGS:0000000000000000
Aug 2 16:52:14 threadripper kernel: [ 600.168583] CS: 0010 DS: 0000 ES: 0000
CR0: 0000000080050033
Aug 2 16:52:14 threadripper kernel: [ 600.168584] CR2: 000055ea29edefd0 CR3:
00000009c500a000 CR4: 00000000003406e0
Aug 2 16:52:14 threadripper kernel: [ 600.168585] Call Trace:
Aug 2 16:52:14 threadripper kernel: [ 600.168592] ? load_new_mm_cr3+0xf0/0xf0
Aug 2 16:52:14 threadripper kernel: [ 600.168594] on_each_cpu+0x2d/0x60
Aug 2 16:52:14 threadripper kernel: [ 600.168596]
flush_tlb_kernel_range+0x38/0x90
Aug 2 16:52:14 threadripper kernel: [ 600.168597]
__purge_vmap_area_lazy+0x70/0x6d0
Aug 2 16:52:14 threadripper kernel: [ 600.168598]
free_vmap_area_noflush+0xe1/0xf0
Aug 2 16:52:14 threadripper kernel: [ 600.168600] remove_vm_area+0x9a/0xb0
Aug 2 16:52:14 threadripper kernel: [ 600.168602] __vunmap+0x5f/0x210
Aug 2 16:52:14 threadripper kernel: [ 600.168603] free_work+0x25/0x30
Aug 2 16:52:14 threadripper kernel: [ 600.168607]
process_one_work+0x1eb/0x3b0
Aug 2 16:52:14 threadripper kernel: [ 600.168609] worker_thread+0x4d/0x400
Aug 2 16:52:14 threadripper kernel: [ 600.168611] kthread+0x104/0x140
Aug 2 16:52:14 threadripper kernel: [ 600.168612] ?
process_one_work+0x3b0/0x3b0
Aug 2 16:52:14 threadripper kernel: [ 600.168613] ? kthread_park+0x90/0x90
Aug 2 16:52:14 threadripper kernel: [ 600.168617] ret_from_fork+0x22/0x40
Aug 2 16:52:40 threadripper kernel: [ 606.280524] rcu: INFO: rcu_sched
detected stalls on CPUs/tasks:
Aug 2 16:52:40 threadripper kernel: [ 606.280567] rcu: 2-...0: (1 GPs
behind) idle=ae6/1/0x4000000000000000 softirq=26910/26911 fqs=7179
Aug 2 16:52:40 threadripper kernel: [ 606.280609] rcu: 18-...0: (1 GPs
behind) idle=c8e/1/0x4000000000000000 softirq=28056/28057 fqs=7179
Aug 2 16:52:40 threadripper kernel: [ 606.280659] (detected by 24,
t=15002 jiffies, g=39017, q=5149545)
Aug 2 16:52:40 threadripper kernel: [ 606.280661] Sending NMI from CPU 24 to
CPUs 2:
Aug 2 16:52:40 threadripper kernel: [ 616.204803] Sending NMI from CPU 24 to
CPUs 18:
Aug 2 16:52:40 threadripper kernel: [ 626.131497] rcu: rcu_sched kthread
starved for 4960 jiffies! g39017 f0x2 RCU_GP_DOING_FQS(6) ->state=0x0 ->cpu=7
Aug 2 16:52:40 threadripper kernel: [ 626.131554] rcu: RCU grace-period
kthread stack dump:
Aug 2 16:52:40 threadripper kernel: [ 626.131577] rcu_sched R running
task 0 11 2 0x80004000
Aug 2 16:52:40 threadripper kernel: [ 626.131580] Call Trace:
Aug 2 16:52:40 threadripper kernel: [ 626.131589] __schedule+0x2e3/0x740
Aug 2 16:52:40 threadripper kernel: [ 626.131592]
preempt_schedule_common+0x18/0x30
Aug 2 16:52:40 threadripper kernel: [ 626.131594] _cond_resched+0x22/0x30
Aug 2 16:52:40 threadripper kernel: [ 626.131597] force_qs_rnp+0xa8/0x170
Aug 2 16:52:40 threadripper kernel: [ 626.131598] ?
synchronize_sched_expedited_wait+0x180/0x180
Aug 2 16:52:40 threadripper kernel: [ 626.131600] rcu_gp_kthread+0x5e8/0x990
Aug 2 16:52:40 threadripper kernel: [ 626.131604] kthread+0x104/0x140
Aug 2 16:52:40 threadripper kernel: [ 626.131605] ? kfree_call_rcu+0x20/0x20
Aug 2 16:52:40 threadripper kernel: [ 626.131607] ? kthread_park+0x90/0x90
Aug 2 16:52:40 threadripper kernel: [ 626.131608] ret_from_fork+0x22/0x40
ProblemType: Bug
DistroRelease: Ubuntu 20.04
Package: linux-image-5.4.0-80-generic 5.4.0-80.90
ProcVersionSignature: Ubuntu 5.4.0-80.90-generic 5.4.124
Uname: Linux 5.4.0-80-generic x86_64
AlsaVersion: Advanced Linux Sound Architecture Driver Version k5.4.0-80-generic.
ApportVersion: 2.20.11-0ubuntu27.18
Architecture: amd64
AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/by-path',
'/dev/snd/controlC0', '/dev/snd/hwC0D0', '/dev/snd/pcmC0D2c',
'/dev/snd/pcmC0D1p', '/dev/snd/pcmC0D0c', '/dev/snd/pcmC0D0p',
'/dev/snd/controlC1', '/dev/snd/hwC1D0', '/dev/snd/pcmC1D7p',
'/dev/snd/pcmC1D3p', '/dev/snd/seq', '/dev/snd/timer'] failed with exit code 1:
Card0.Amixer.info:
Card hw:0 'Generic'/'HD-Audio Generic at 0xba600000 irq 96'
Mixer name : 'Realtek ALC1220'
Components : 'HDA:10ec1168,10438723,00100003'
Controls : 46
Simple ctrls : 20
Card1.Amixer.info:
Card hw:1 'HDMI'/'HDA ATI HDMI at 0x9f860000 irq 98'
Mixer name : 'ATI R6xx HDMI'
Components : 'HDA:1002aa01,00aa0100,00100700'
Controls : 14
Simple ctrls : 2
CasperMD5CheckResult: skip
Date: Mon Aug 2 19:09:24 2021
IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig'
MachineType: System manufacturer System Product Name
ProcEnviron:
TERM=screen.xterm-256color
PATH=(custom, no user)
LANG=en_US.UTF-8
SHELL=/bin/bash
ProcFB: 0 amdgpudrmfb
ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-5.4.0-80-generic
root=UUID=04417339-7685-11e9-bdb0-049226da3a81 ro pci=nommconf consoleblank=60
RelatedPackageVersions:
linux-restricted-modules-5.4.0-80-generic N/A
linux-backports-modules-5.4.0-80-generic N/A
linux-firmware 1.187.15
RfKill: Error: [Errno 2] No such file or directory: 'rfkill'
SourcePackage: linux
UpgradeStatus: Upgraded to focal on 2021-01-23 (191 days ago)
dmi.bios.date: 10/09/2019
dmi.bios.vendor: American Megatrends Inc.
dmi.bios.version: 1203
dmi.board.asset.tag: Default string
dmi.board.name: ROG STRIX X399-E GAMING
dmi.board.vendor: ASUSTeK COMPUTER INC.
dmi.board.version: Rev 1.xx
dmi.chassis.asset.tag: Default string
dmi.chassis.type: 3
dmi.chassis.vendor: Default string
dmi.chassis.version: Default string
dmi.modalias:
dmi:bvnAmericanMegatrendsInc.:bvr1203:bd10/09/2019:svnSystemmanufacturer:pnSystemProductName:pvrSystemVersion:rvnASUSTeKCOMPUTERINC.:rnROGSTRIXX399-EGAMING:rvrRev1.xx:cvnDefaultstring:ct3:cvrDefaultstring:
dmi.product.family: To be filled by O.E.M.
dmi.product.name: System Product Name
dmi.product.sku: SKU
dmi.product.version: System Version
dmi.sys.vendor: System manufacturer
** Affects: linux (Ubuntu)
Importance: Undecided
Status: New
** Tags: amd64 apport-bug focal third-party-packages uec-images
--
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1938722
Title:
watchdog: BUG: soft lockup on Threadripper 2950X
To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1938722/+subscriptions
--
ubuntu-bugs mailing list
[email protected]
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs