Source: linux Version: Linux 6.1.0-9 Severity: important X-Debbugs-Cc: list-christ...@web.de
Dear Maintainer, Building a new system, I encounter a problem -reproducible- when the system comes under load. As soon as it is more than idle (e.g. a compile), I get the following exception, network goes down and SATA fails, rendering the system unusable. Trace from my logs: 2023-05-28T17:51:04.834075+02:00 diskstation kernel: [ 2220.049106] ------------[ cut here ]------------ 2023-05-28T17:51:04.846514+02:00 diskstation kernel: [ 2220.049110] NETDEV WATCHDOG: enp3s0 (r8169): transmit queue 0 timed out 2023-05-28T17:51:04.846519+02:00 diskstation kernel: [ 2220.049118] WARNING: CPU: 5 PID: 0 at net/sched/sch_generic.c:525 dev_watchdog+0x207/0x210 2023-05-28T17:51:04.846520+02:00 diskstation kernel: [ 2220.049123] Modules linked in: xt_nat veth eq3_char_loop(OE) rpi_rf_mod_led(OE) ledtrig_timer ledtrig_default_on xt_MASQUERADE nf_conntrack_netlink xfrm_user xfrm_algo xt_addrtype br_netfilter bridge stp llc overlay ip6t_rt nft_chain_nat nf_nat xt_set qrtr xt_tcpmss xt_tcpudp xt_conntrack nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 nft_compat nf_tables ip_set_hash_ip ip_set nfnetlink binfmt_misc nls_ascii nls_cp437 vfat fat amdgpu iwlmvm btusb btrtl btbcm btintel btmtk bluetooth snd_hda_codec_realtek mac80211 snd_hda_codec_generic ledtrig_audio snd_hda_codec_hdmi intel_rapl_msr intel_rapl_common snd_hda_intel gpu_sched edac_mce_amd libarc4 snd_intel_dspcfg drm_buddy snd_intel_sdw_acpi snd_hda_codec drm_display_helper iwlwifi snd_hda_core kvm_amd jitterentropy_rng cec snd_hwdep hb_rf_usb_2(OE) kvm rc_core cfg80211 generic_raw_uart(OE) snd_pcm drbg drm_ttm_helper irqbypass ttm ansi_cprng snd_timer rapl wmi_bmof ecdh_generic drm_kms_helper snd ccp pcspkr sp5100_tco rfkill i2c_algo_bit 2023-05-28T17:51:04.846522+02:00 diskstation kernel: [ 2220.049164] soundcore k10temp ecc rng_core watchdog joydev evdev acpi_cpufreq button sg nct6775 nct6775_core hwmon_vid drm msr fuse loop efi_pstore configfs efivarfs ip_tables x_tables autofs4 ext4 crc16 mbcache jbd2 btrfs blake2b_generic xor raid6_pq zstd_compress libcrc32c crc32c_generic dm_crypt dm_mod hid_generic usbhid hid sd_mod crc32_pclmul crc32c_intel ghash_clmulni_intel nvme sha512_ssse3 sha512_generic ahci nvme_core libahci xhci_pci t10_pi libata crc64_rocksoft_generic xhci_hcd aesni_intel r8169 crc64_rocksoft crc_t10dif realtek crct10dif_generic crypto_simd mdio_devres crct10dif_pclmul usbcore scsi_mod cryptd libphy crc64 i2c_piix4 crct10dif_common usb_common scsi_common video wmi gpio_amdpt gpio_generic 2023-05-28T17:51:04.846523+02:00 diskstation kernel: [ 2220.049196] CPU: 5 PID: 0 Comm: swapper/5 Tainted: G OE 6.1.0-9-amd64 #1 Debian 6.1.27-1 2023-05-28T17:51:04.846523+02:00 diskstation kernel: [ 2220.049199] Hardware name: To Be Filled By O.E.M. B550M-ITX/ac/B550M-ITX/ac, BIOS L2.62 01/31/2023 2023-05-28T17:51:04.846525+02:00 diskstation kernel: [ 2220.049200] RIP: 0010:dev_watchdog+0x207/0x210 2023-05-28T17:51:04.846525+02:00 diskstation kernel: [ 2220.049202] Code: 00 e9 40 ff ff ff 48 89 df c6 05 ff 5f 3d 01 01 e8 be 79 f9 ff 44 89 e9 48 89 de 48 c7 c7 c8 16 7b 87 48 89 c2 e8 09 d2 86 ff <0f> 0b e9 22 ff ff ff 66 90 0f 1f 44 00 00 55 53 48 89 fb 48 8b 6f 2023-05-28T17:51:04.846526+02:00 diskstation kernel: [ 2220.049203] RSP: 0018:ffffb1e2802b8e80 EFLAGS: 00010286 2023-05-28T17:51:04.846526+02:00 diskstation kernel: [ 2220.049204] RAX: 0000000000000000 RBX: ffff9ad241704000 RCX: 0000000000000000 2023-05-28T17:51:04.846527+02:00 diskstation kernel: [ 2220.049205] RDX: 0000000000000104 RSI: ffffffff8773fa66 RDI: 00000000ffffffff 2023-05-28T17:51:04.846527+02:00 diskstation kernel: [ 2220.049206] RBP: ffff9ad241704488 R08: 0000000000000000 R09: ffffb1e2802b8cf0 2023-05-28T17:51:04.846528+02:00 diskstation kernel: [ 2220.049207] R10: 0000000000000003 R11: ffff9ad97e27afe8 R12: ffff9ad2417043dc 2023-05-28T17:51:04.846529+02:00 diskstation kernel: [ 2220.049208] R13: 0000000000000000 R14: ffffffff86c2e7a0 R15: ffff9ad241704488 2023-05-28T17:51:04.846529+02:00 diskstation kernel: [ 2220.049209] FS: 0000000000000000(0000) GS:ffff9ad95e340000(0000) knlGS:0000000000000000 2023-05-28T17:51:04.846530+02:00 diskstation kernel: [ 2220.049210] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 2023-05-28T17:51:04.846530+02:00 diskstation kernel: [ 2220.049211] CR2: 00007f73a5a0b440 CR3: 00000001089fa000 CR4: 0000000000750ee0 2023-05-28T17:51:04.846531+02:00 diskstation kernel: [ 2220.049212] PKRU: 55555554 2023-05-28T17:51:04.846532+02:00 diskstation kernel: [ 2220.049213] Call Trace: 2023-05-28T17:51:04.846532+02:00 diskstation kernel: [ 2220.049214] <IRQ> 2023-05-28T17:51:04.846533+02:00 diskstation kernel: [ 2220.049217] ? pfifo_fast_reset+0x140/0x140 2023-05-28T17:51:04.846533+02:00 diskstation kernel: [ 2220.049219] call_timer_fn+0x27/0x130 2023-05-28T17:51:04.846534+02:00 diskstation kernel: [ 2220.049222] __run_timers+0x21c/0x2a0 2023-05-28T17:51:04.846534+02:00 diskstation kernel: [ 2220.049225] run_timer_softirq+0x2b/0x50 2023-05-28T17:51:04.846535+02:00 diskstation kernel: [ 2220.049226] __do_softirq+0xf0/0x2fe 2023-05-28T17:51:04.846535+02:00 diskstation kernel: [ 2220.049229] __irq_exit_rcu+0xc7/0x130 2023-05-28T17:51:04.846536+02:00 diskstation kernel: [ 2220.049232] sysvec_apic_timer_interrupt+0x9e/0xc0 2023-05-28T17:51:04.846536+02:00 diskstation kernel: [ 2220.049235] </IRQ> 2023-05-28T17:51:04.846537+02:00 diskstation kernel: [ 2220.049235] <TASK> 2023-05-28T17:51:04.846537+02:00 diskstation kernel: [ 2220.049236] asm_sysvec_apic_timer_interrupt+0x16/0x20 2023-05-28T17:51:04.846538+02:00 diskstation kernel: [ 2220.049237] RIP: 0010:mwait_idle+0x51/0x80 2023-05-28T17:51:04.846538+02:00 diskstation kernel: [ 2220.049240] Code: 31 d2 48 89 d1 65 48 8b 04 25 c0 fb 01 00 0f 01 c8 48 8b 00 a8 08 75 14 eb 07 0f 00 2d 88 22 5d 00 31 c0 48 89 c1 fb 0f 01 c9 <eb> 06 fb 0f 1f 44 00 00 65 48 8b 04 25 c0 fb 01 00 f0 80 60 02 df 2023-05-28T17:51:04.846539+02:00 diskstation kernel: [ 2220.049241] RSP: 0018:ffffb1e280103ee0 EFLAGS: 00000246 2023-05-28T17:51:04.846539+02:00 diskstation kernel: [ 2220.049242] RAX: 0000000000000000 RBX: ffff9ad2403d9980 RCX: 0000000000000000 2023-05-28T17:51:04.846540+02:00 diskstation kernel: [ 2220.049242] RDX: 0000000000000000 RSI: ffffffff8773fa66 RDI: ffffffff87718f95 2023-05-28T17:51:04.846540+02:00 diskstation kernel: [ 2220.049243] RBP: 0000000000000005 R08: 0000000000000002 R09: 0000000020e1db80 2023-05-28T17:51:04.846541+02:00 diskstation kernel: [ 2220.049244] R10: 0000000000000005 R11: 0000000000000001 R12: 0000000000000000 2023-05-28T17:51:04.846541+02:00 diskstation kernel: [ 2220.049245] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000 2023-05-28T17:51:04.846542+02:00 diskstation kernel: [ 2220.049249] default_idle_call+0x36/0xf0 2023-05-28T17:51:04.846543+02:00 diskstation kernel: [ 2220.049252] do_idle+0x225/0x2b0 2023-05-28T17:51:04.846543+02:00 diskstation kernel: [ 2220.049255] cpu_startup_entry+0x19/0x20 2023-05-28T17:51:04.846544+02:00 diskstation kernel: [ 2220.049258] start_secondary+0x11a/0x140 2023-05-28T17:51:04.846544+02:00 diskstation kernel: [ 2220.049261] secondary_startup_64_no_verify+0xe5/0xeb 2023-05-28T17:51:04.846545+02:00 diskstation kernel: [ 2220.049266] </TASK> 2023-05-28T17:51:04.846545+02:00 diskstation kernel: [ 2220.049267] ---[ end trace 0000000000000000 ]--- Found similar errors in the net, recommending to deactivate tcp offloading. Tried it, with no change. -- System Information: Debian Release: 12.0 APT prefers testing-security APT policy: (500, 'testing-security'), (500, 'testing') Architecture: amd64 (x86_64) Kernel: Linux 6.1.0-9-amd64 (SMP w/12 CPU threads; PREEMPT) Kernel taint flags: TAINT_WARN, TAINT_OOT_MODULE, TAINT_UNSIGNED_MODULE Locale: LANG=de_DE.UTF-8, LC_CTYPE=de_DE.UTF-8 (charmap=UTF-8), LANGUAGE not set Shell: /bin/sh linked to /usr/bin/dash Init: systemd (via /run/systemd/system) LSM: AppArmor: enabled