Can confirm happening to my reasonably new installation on server 22.04.3.
Server isn't doing much, some dockers and a tp-link omada installation. Have 
null'd out the suspend options. 

Is anyone else also runing plex, after the soft lockup goes on for long
enough I start getting these in my kern log.

kern.log.1:Oct 18 03:43:24 nas kernel: [79118.116772] watchdog: BUG: soft 
lockup - CPU#8 stuck for 50271s! [PMS HttpClientC:311710]
kern.log.1:Oct 18 03:43:24 nas kernel: [79118.117632]  ? 
watchdog_timer_fn+0x1be/0x220
kern.log.1:Oct 18 03:43:28 nas kernel: [79122.128665] watchdog: BUG: soft 
lockup - CPU#13 stuck for 50272s! [PMS GTP:141334]
kern.log.1:Oct 18 03:43:28 nas kernel: [79122.129516]  ? 
watchdog_timer_fn+0x1be/0x220
kern.log.1:Oct 18 03:43:44 nas kernel: [79138.136234] watchdog: BUG: soft 
lockup - CPU#15 stuck for 50343s! [kworker/15:0:41010]
kern.log.1:Oct 18 03:43:44 nas kernel: [79138.137125]  ? 
watchdog_timer_fn+0x1be/0x220
kern.log.1:Oct 18 03:43:52 nas kernel: [79146.100025] watchdog: BUG: soft 
lockup - CPU#0 stuck for 50297s! [PMS HttpServer:9967]
kern.log.1:Oct 18 03:43:52 nas kernel: [79146.100894]  ? 
watchdog_timer_fn+0x1be/0x220
kern.log.1:Oct 18 03:43:52 nas kernel: [79146.108020] watchdog: BUG: soft 
lockup - CPU#4 stuck for 50297s! [PMS HttpServer:9968]
kern.log.1:Oct 18 03:43:52 nas kernel: [79146.108933]  ? 
watchdog_timer_fn+0x1be/0x220
kern.log.1:Oct 18 03:43:52 nas kernel: [79146.116019] watchdog: BUG: soft 
lockup - CPU#8 stuck for 50297s! [PMS HttpClientC:311710]
kern.log.1:Oct 18 03:43:52 nas kernel: [79146.116969]  ? 
watchdog_timer_fn+0x1be/0x220
kern.log.1:Oct 18 03:43:56 nas kernel: [79150.127912] watchdog: BUG: soft 
lockup - CPU#13 stuck for 50298s! [PMS GTP:141334]

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1989521

Title:
  Ubuntu 22.04.1 CPU soft lockup occurs repeatedly

Status in linux package in Ubuntu:
  Confirmed

Bug description:
  Hi all,

  Ubuntu server 22.04.1 is having issues freezing repeatedly with CPU
  softlocking. The issue seems to have started in the last week, all
  packages are up to date. I've updated to hwe kernel, rebooted several
  times, and it still happens. Hw info: 32G RAM, AMD 3600x CPU, Quadro
  RTX 4000 GPU.

  I caught the following in syslog :

  
  Sep 13 04:17:55 marcus-server kernel: [33687.436241] watchdog: BUG: soft 
lockup - CPU#2 stuck for 26s! [kworker/u64:17:154214]
  Sep 13 04:17:55 marcus-server kernel: [33687.436243] Modules linked in: tls 
xt_nat veth nf_conntrack_netlink xfrm_user xfrm_algo xt_addrtype br_netfilter 
xt_CHECKSUM xt_MASQUERADE xt_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp 
nft_compat nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 
nft_counter nf_tables nfnetlink overlay bridge stp llc nvidia_drm(PO) 
snd_hda_codec_realtek intel_rapl_msr intel_rapl_common nvidia_modeset(PO) 
snd_hda_codec_generic ledtrig_audio snd_hda_codec_hdmi snd_hda_intel 
snd_intel_dspcfg snd_intel_sdw_acpi snd_hda_codec snd_hda_core snd_hwdep 
snd_pcm snd_seq_midi snd_seq_midi_event zfs(PO) edac_mce_amd nls_iso8859_1 
snd_rawmidi kvm_amd zunicode(PO) nvidia(PO) snd_seq kvm zzstd(O) zlua(O) 
zavl(PO) snd_seq_device icp(PO) rapl wmi_bmof snd_timer zcommon(PO) k10temp ccp 
ucsi_ccg znvpair(PO) snd typec_ucsi typec spl(O) soundcore apex(OE) gasket(OE) 
mac_hid sch_fq_codel dm_multipath scsi_dh_rdac scsi_dh_emc scsi_dh_alua nct6775 
hwmon_vid ipmi_devi
 ntf ipmi_msghandler msr parport_pc ppdev lp
  Sep 13 04:17:55 marcus-server kernel: [33687.436279]  parport ramoops 
reed_solomon pstore_blk pstore_zone mtd efi_pstore ip_tables x_tables autofs4 
btrfs blake2b_generic zstd_compress raid10 raid456 async_raid6_recov 
async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 
multipath linear nouveau mxm_wmi drm_ttm_helper ttm drm_kms_helper syscopyarea 
sysfillrect sysimgblt fb_sys_fops cec rc_core crct10dif_pclmul crc32_pclmul 
ghash_clmulni_intel drm aesni_intel video crypto_simd igb cryptd xhci_pci ahci 
dca i2c_piix4 i2c_nvidia_gpu arcmsr libahci xhci_pci_renesas i2c_algo_bit wmi
  Sep 13 04:17:55 marcus-server kernel: [33687.436302] CPU: 2 PID: 154214 Comm: 
kworker/u64:17 Tainted: P           OE     5.15.0-47-generic #51-Ubuntu
  Sep 13 04:17:55 marcus-server kernel: [33687.436303] Hardware name: To Be 
Filled By O.E.M. To Be Filled By O.E.M./X570 Phantom Gaming 4, BIOS P4.20 
08/02/2021
  Sep 13 04:17:55 marcus-server kernel: [33687.436305] Workqueue: 
events_unbound async_run_entry_fn
  Sep 13 04:17:55 marcus-server kernel: [33687.436308] RIP: 
0010:arcmsr_wait_firmware_ready+0xc1/0x140 [arcmsr]
  Sep 13 04:17:55 marcus-server kernel: [33687.436312] Code: e3 49 8b 94 24 48 
08 00 00 b8 10 00 00 00 89 02 5b 41 5c 5d e9 b0 7b db e8 48 8b 47 50 4c 8d a0 
bc 00 00 00 eb 0c 41 8b 04 24 <85> c0 0f 88 64 ff ff ff f6 83 81 00 00 00 01 75 
eb bf 14 00 00 00
  Sep 13 04:17:55 marcus-server kernel: [33687.436313] RSP: 
0018:ffffade8d136fd10 EFLAGS: 00000202
  Sep 13 04:17:55 marcus-server kernel: [33687.436314] RAX: 0000000000000000 
RBX: ffff96720a460870 RCX: ffffade8c12b0034
  Sep 13 04:17:55 marcus-server kernel: [33687.436315] RDX: 000000000000000d 
RSI: ffff96721b53ef80 RDI: ffff96720a460870
  Sep 13 04:17:55 marcus-server kernel: [33687.436315] RBP: ffffade8d136fd20 
R08: ffffffffffffffff R09: 0000000000000000
  Sep 13 04:17:55 marcus-server kernel: [33687.436316] R10: 0000000000000284 
R11: ffffffffffffffff R12: ffffade8c12b00bc
  Sep 13 04:17:55 marcus-server kernel: [33687.436317] R13: 000000000000000d 
R14: ffff96720a460000 R15: ffff96720a460870
  Sep 13 04:17:55 marcus-server kernel: [33687.436318] FS:  
0000000000000000(0000) GS:ffff96791ea80000(0000) knlGS:0000000000000000
  Sep 13 04:17:55 marcus-server kernel: [33687.436319] CS:  0010 DS: 0000 ES: 
0000 CR0: 0000000080050033
  Sep 13 04:17:55 marcus-server kernel: [33687.436320] CR2: 0000000000000000 
CR3: 00000007c8c10000 CR4: 0000000000350ee0

  
  It happens pretty often too, but the system isn't overloaded, so I'm not sure 
what is causing it. 

  Message from syslogd@marcus-server at Sep 14 02:16:11 ...
   kernel:[ 1276.914096] watchdog: BUG: soft lockup - CPU#8 stuck for 26s! 
[kworker/u64:28:252938]

  Message from syslogd@marcus-server at Sep 14 02:16:11 ...
   kernel:[ 1304.913956] watchdog: BUG: soft lockup - CPU#8 stuck for 52s! 
[kworker/u64:28:252938]

  Message from syslogd@marcus-server at Sep 14 02:37:47 ...
   kernel:[ 2569.382397] watchdog: BUG: soft lockup - CPU#3 stuck for 26s! 
[kworker/u64:27:743931]

  Message from syslogd@marcus-server at Sep 14 02:37:47 ...
   kernel:[ 2597.382461] watchdog: BUG: soft lockup - CPU#3 stuck for 53s! 
[kworker/u64:27:743931]

  
  I've also uploaded apport file to this bug. Please lmk if anything else is 
needed to troubleshoot this issue.
  --- 
  ProblemType: Bug
  ApportVersion: 2.20.11-0ubuntu82.1
  Architecture: amd64
  AudioDevicesInUse:
   USER        PID ACCESS COMMAND
   /dev/snd/controlC1:  lightdm    5077 F.... pulseaudio
   /dev/snd/controlC0:  lightdm    5077 F.... pulseaudio
  CasperMD5CheckResult: pass
  DistroRelease: Ubuntu 22.04
  InstallationDate: Installed on 2022-05-14 (127 days ago)
  InstallationMedia: Ubuntu-Server 20.04.4 LTS "Focal Fossa" - Release amd64 
(20220223.1)
  MachineType: To Be Filled By O.E.M. To Be Filled By O.E.M.
  NonfreeKernelModules: nvidia_modeset zfs zunicode nvidia zavl icp zcommon 
znvpair
  Package: linux (not installed)
  ProcEnviron:
   TERM=xterm-256color
   PATH=(custom, no user)
   LANG=en_US.UTF-8
   SHELL=/bin/bash
  ProcFB: 0 EFI VGA
  ProcKernelCmdLine: BOOT_IMAGE=/vmlinuz-5.15.0-47-generic 
root=/dev/mapper/ubuntu--vg-ubuntu--lv ro nomodeset
  ProcVersionSignature: Ubuntu 5.15.0-47.51-generic 5.15.46
  PulseList: Error: command ['pacmd', 'list'] failed with exit code 1: No 
PulseAudio daemon running, or not running as session daemon.
  RelatedPackageVersions:
   linux-restricted-modules-5.15.0-47-generic N/A
   linux-backports-modules-5.15.0-47-generic  N/A
   linux-firmware                             20220329.git681281e4-0ubuntu3.5
  RfKill:
   
  Tags:  jammy uec-images
  Uname: Linux 5.15.0-47-generic x86_64
  UpgradeStatus: Upgraded to jammy on 2022-05-15 (126 days ago)
  UserGroups: N/A
  _MarkForUpload: True
  dmi.bios.date: 08/02/2021
  dmi.bios.release: 5.17
  dmi.bios.vendor: American Megatrends Inc.
  dmi.bios.version: P4.20
  dmi.board.name: X570 Phantom Gaming 4
  dmi.board.vendor: ASRock
  dmi.chassis.asset.tag: To Be Filled By O.E.M.
  dmi.chassis.type: 3
  dmi.chassis.vendor: To Be Filled By O.E.M.
  dmi.chassis.version: To Be Filled By O.E.M.
  dmi.modalias: 
dmi:bvnAmericanMegatrendsInc.:bvrP4.20:bd08/02/2021:br5.17:svnToBeFilledByO.E.M.:pnToBeFilledByO.E.M.:pvrToBeFilledByO.E.M.:rvnASRock:rnX570PhantomGaming4:rvr:cvnToBeFilledByO.E.M.:ct3:cvrToBeFilledByO.E.M.:skuToBeFilledByO.E.M.:
  dmi.product.family: To Be Filled By O.E.M.
  dmi.product.name: To Be Filled By O.E.M.
  dmi.product.sku: To Be Filled By O.E.M.
  dmi.product.version: To Be Filled By O.E.M.
  dmi.sys.vendor: To Be Filled By O.E.M.

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1989521/+subscriptions


-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to