Related to reducing the likelihood of a backlog of transmits at 1G, try Disable TSO - ethtool -K eth0 tso off Reduce TX descriptors - ethtool -G eth0 tx 64
I would try the first by itself, then try the two combined. -- Jesse Brandeburg > On Feb 4, 2021, at 7:55 AM, Fujinaka, Todd <todd.fujin...@intel.com> wrote: > > I checked with our performance team and I think the only thing we can see is > the possible bottleneck with your link partner. We don't do any > interoperability testing with Netgear, and are unaware if they have any > equipment that isn't consumer-grade. > > I would suggest that you follow up with Ubuntu. They will contact us if they > need any further help with the issue. > > Todd Fujinaka > Software Application Engineer > Data Center Group > Intel Corporation > todd.fujin...@intel.com > > -----Original Message----- > From: Pekka Pietikäinen <pe...@pietikainen.org> > Sent: Thursday, February 4, 2021 2:15 AM > To: e1000-devel@lists.sourceforge.net > Subject: [E1000-devel] Fake Tx hangs with ixgbe 5.10.2 > > Hi, > > We were seeing a lot of Tx hangs (once a day in lab, much more in > production) with Ubuntu 18.04.5 built-in driver (5.0.0-k and 5.1.0-k), > causing link resets etc.. Trying to isolate the problem a bit more we're now > trying the latest out-of-tree driver. Now it's > > Feb 03 22:46:19 x kernel: NETDEV WATCHDOG: enp13s0 (ixgbe): transmit queue 3 > timed out Feb 03 22:46:19 x kernel: ixgbe 0000:0d:00.0 enp13s0: Fake Tx hang > detected with timeout of 5 seconds Feb 04 01:56:01 x kernel: ixgbe > 0000:0d:00.0 enp13s0: Fake Tx hang detected with timeout of 10 seconds Feb 04 > 15:13:21 x kernel: ixgbe 0000:0d:00.0 enp13s0: Fake Tx hang detected with > timeout of 20 seconds > > "Fake Tx Hang", but traffic still stops for quite a while. Setup is > > ixgbe 0000:0d:00.0: enabling device (0000 -> 0002) ixgbe 0000:0d:00.0 > 0000:0d:00.0 (uninitialized): ixgbe_check_options: > FCoE Offload feature enabled > ixgbe 0000:0d:00.0: Multiqueue Enabled: Rx Queue count = 8, Tx Queue count = > 8 XDP Queue count = 0 ixgbe 0000:0d:00.0: 32.000 Gb/s available PCIe > bandwidth, limited by 5 GT/s x8 link at 0000:00:01.0 (capable of 63.008 Gb/s > with 8 GT/s x8 link) ixgbe 0000:0d:00.0 eth0: MAC: 2, PHY: 20, SFP+: 5, PBA > No: E68787-011 > > ixgbe 0000:0d:00.0 eth0: Enabled Features: RxQ: 8 TxQ: 8 FdirHash ixgbe > 0000:0d:00.0 eth0: Intel(R) 10 Gigabit Network Connection > > talking via 10Gbps fibre to Netgear switch to 20 clients (1GbE over copper) > > Disabling irqbalance and using set_irq_affinity (-x local) results in > > /proc/interrupts > > 132: 6932 1128 1977 2921 7857 7960 > 4392 3754 IR-PCI-MSI 520192-edge enp0s31f6 > 133: 264128 280513 208862 114758 608511 > 839 90376 187222 IR-PCI-MSI 6815744-edge enp13s0-TxRx-0 > 134: 2844471 365014 105711 1009702 423599 > 688773 0 589578 IR-PCI-MSI 6815745-edge enp13s0-TxRx-1 > 135: 95567 140332 43052 46623 86704 57976 > 135573 167195 IR-PCI-MSI 6815746-edge enp13s0-TxRx-2 > 136: 318887 197761 625025 193550 101415 > 102114 55755 291272 IR-PCI-MSI 6815747-edge enp13s0-TxRx-3 > 137: 491537 214000 158900 244269 266167 97220 > 25460 289919 IR-PCI-MSI 6815748-edge enp13s0-TxRx-4 > 138: 65767 248812 115232 238502 80103 46189 > 50782 81833 IR-PCI-MSI 6815749-edge enp13s0-TxRx-5 > 139: 207157 260936 48883 47421 216735 97253 > 92818 89079 IR-PCI-MSI 6815750-edge enp13s0-TxRx-6 > 140: 21512 424646 5111 390019 441436 4728 > 291370 278387 IR-PCI-MSI 6815751-edge enp13s0-TxRx-7 > 141: 0 0 0 3 0 0 > 0 0 IR-PCI-MSI 6815752-edge enp13s0 > > TxRx-1 on CPU0 seems a bit unbalanced, but otherwise fine? > > ethtool would suggest it's not flow control related (which would explain > things?), now experimenting with tuning interrupt moderation / ring size / > avoiding CPU0 . Is there anything else to try? > > NIC statistics: > rx_packets: 188092169 > tx_packets: 59581798 > rx_bytes: 263463989424 > tx_bytes: 25727566814 > rx_errors: 0 > tx_errors: 0 > rx_dropped: 0 > tx_dropped: 0 > multicast: 1237 > collisions: 0 > rx_over_errors: 0 > rx_crc_errors: 0 > rx_frame_errors: 0 > rx_fifo_errors: 0 > rx_missed_errors: 0 > tx_aborted_errors: 0 > tx_carrier_errors: 0 > tx_fifo_errors: 0 > tx_heartbeat_errors: 0 > rx_pkts_nic: 188092169 > tx_pkts_nic: 59581798 > rx_bytes_nic: 264216358100 > tx_bytes_nic: 25965914840 > lsc_int: 3 > tx_busy: 0 > non_eop_descs: 0 > broadcast: 259 > rx_no_buffer_count: 0 > tx_timeout_count: 0 > tx_restart_queue: 4 > rx_length_errors: 0 > rx_long_length_errors: 0 > rx_short_length_errors: 0 > tx_flow_control_xon: 0 > rx_flow_control_xon: 0 > tx_flow_control_xoff: 0 > rx_flow_control_xoff: 0 > rx_csum_offload_errors: 0 > alloc_rx_page: 40526166 > alloc_rx_page_failed: 0 > alloc_rx_buff_failed: 0 > rx_no_dma_resources: 0 > hw_rsc_aggregated: 0 > hw_rsc_flushed: 0 > fdir_match: 188027415 > fdir_miss: 178920 > fdir_overflow: 0 > fcoe_bad_fccrc: 0 > fcoe_last_errors: 0 > rx_fcoe_dropped: 0 > rx_fcoe_packets: 0 > rx_fcoe_dwords: 0 > fcoe_noddp: 0 > fcoe_noddp_ext_buff: 0 > tx_fcoe_packets: 0 > tx_fcoe_dwords: 0 > os2bmc_rx_by_bmc: 0 > os2bmc_tx_by_bmc: 0 > os2bmc_tx_by_host: 0 > os2bmc_rx_by_host: 0 > tx_hwtstamp_timeouts: 0 > tx_hwtstamp_skipped: 0 > rx_hwtstamp_cleared: 0 > tx_queue_0_packets: 2551919 > tx_queue_0_bytes: 303310258 > tx_queue_1_packets: 34799436 > tx_queue_1_bytes: 2365301599 > tx_queue_2_packets: 726053 > tx_queue_2_bytes: 101796295 > tx_queue_3_packets: 5109959 > tx_queue_3_bytes: 4885506128 > tx_queue_4_packets: 4655587 > tx_queue_4_bytes: 4755028677 > tx_queue_5_packets: 2533314 > tx_queue_5_bytes: 2516349075 > tx_queue_6_packets: 4109721 > tx_queue_6_bytes: 5511688758 > tx_queue_7_packets: 5095809 > tx_queue_7_bytes: 5288586024 > tx_queue_8_packets: 0 > tx_queue_8_bytes: 0 > tx_queue_9_packets: 0 > tx_queue_9_bytes: 0 > tx_queue_10_packets: 0 > tx_queue_10_bytes: 0 > tx_queue_11_packets: 0 > tx_queue_11_bytes: 0 > tx_queue_12_packets: 0 > tx_queue_12_bytes: 0 > tx_queue_13_packets: 0 > tx_queue_13_bytes: 0 > tx_queue_14_packets: 0 > tx_queue_14_bytes: 0 > tx_queue_15_packets: 0 > tx_queue_15_bytes: 0 > tx_queue_16_packets: 0 > tx_queue_16_bytes: 0 > tx_queue_17_packets: 0 > tx_queue_17_bytes: 0 > tx_queue_18_packets: 0 > tx_queue_18_bytes: 0 > tx_queue_19_packets: 0 > tx_queue_19_bytes: 0 > tx_queue_20_packets: 0 > tx_queue_20_bytes: 0 > tx_queue_21_packets: 0 > tx_queue_21_bytes: 0 > tx_queue_22_packets: 0 > tx_queue_22_bytes: 0 > tx_queue_23_packets: 0 > tx_queue_23_bytes: 0 > tx_queue_24_packets: 0 > tx_queue_24_bytes: 0 > tx_queue_25_packets: 0 > tx_queue_25_bytes: 0 > tx_queue_26_packets: 0 > tx_queue_26_bytes: 0 > tx_queue_27_packets: 0 > tx_queue_27_bytes: 0 > tx_queue_28_packets: 0 > tx_queue_28_bytes: 0 > tx_queue_29_packets: 0 > tx_queue_29_bytes: 0 > tx_queue_30_packets: 0 > tx_queue_30_bytes: 0 > tx_queue_31_packets: 0 > tx_queue_31_bytes: 0 > tx_queue_32_packets: 0 > tx_queue_32_bytes: 0 > tx_queue_33_packets: 0 > tx_queue_33_bytes: 0 > tx_queue_34_packets: 0 > tx_queue_34_bytes: 0 > tx_queue_35_packets: 0 > tx_queue_35_bytes: 0 > tx_queue_36_packets: 0 > tx_queue_36_bytes: 0 > tx_queue_37_packets: 0 > tx_queue_37_bytes: 0 > tx_queue_38_packets: 0 > tx_queue_38_bytes: 0 > tx_queue_39_packets: 0 > tx_queue_39_bytes: 0 > tx_queue_40_packets: 0 > tx_queue_40_bytes: 0 > tx_queue_41_packets: 0 > tx_queue_41_bytes: 0 > tx_queue_42_packets: 0 > tx_queue_42_bytes: 0 > tx_queue_43_packets: 0 > tx_queue_43_bytes: 0 > tx_queue_44_packets: 0 > tx_queue_44_bytes: 0 > tx_queue_45_packets: 0 > tx_queue_45_bytes: 0 > tx_queue_46_packets: 0 > tx_queue_46_bytes: 0 > tx_queue_47_packets: 0 > tx_queue_47_bytes: 0 > tx_queue_48_packets: 0 > tx_queue_48_bytes: 0 > tx_queue_49_packets: 0 > tx_queue_49_bytes: 0 > tx_queue_50_packets: 0 > tx_queue_50_bytes: 0 > tx_queue_51_packets: 0 > tx_queue_51_bytes: 0 > tx_queue_52_packets: 0 > tx_queue_52_bytes: 0 > tx_queue_53_packets: 0 > tx_queue_53_bytes: 0 > tx_queue_54_packets: 0 > tx_queue_54_bytes: 0 > tx_queue_55_packets: 0 > tx_queue_55_bytes: 0 > tx_queue_56_packets: 0 > tx_queue_56_bytes: 0 > tx_queue_57_packets: 0 > tx_queue_57_bytes: 0 > tx_queue_58_packets: 0 > tx_queue_58_bytes: 0 > tx_queue_59_packets: 0 > tx_queue_59_bytes: 0 > tx_queue_60_packets: 0 > tx_queue_60_bytes: 0 > tx_queue_61_packets: 0 > tx_queue_61_bytes: 0 > tx_queue_62_packets: 0 > tx_queue_62_bytes: 0 > tx_queue_63_packets: 0 > tx_queue_63_bytes: 0 > tx_queue_64_packets: 0 > tx_queue_64_bytes: 0 > tx_queue_65_packets: 0 > tx_queue_65_bytes: 0 > tx_queue_66_packets: 0 > tx_queue_66_bytes: 0 > tx_queue_67_packets: 0 > tx_queue_67_bytes: 0 > tx_queue_68_packets: 0 > tx_queue_68_bytes: 0 > tx_queue_69_packets: 0 > tx_queue_69_bytes: 0 > tx_queue_70_packets: 0 > tx_queue_70_bytes: 0 > rx_queue_0_packets: 11217199 > rx_queue_0_bytes: 16124868046 > rx_queue_1_packets: 157176240 > rx_queue_1_bytes: 227080033991 > rx_queue_2_packets: 1501661 > rx_queue_2_bytes: 1972184904 > rx_queue_3_packets: 5075867 > rx_queue_3_bytes: 5607058971 > rx_queue_4_packets: 5326935 > rx_queue_4_bytes: 6020504920 > rx_queue_5_packets: 1606940 > rx_queue_5_bytes: 1282193306 > rx_queue_6_packets: 2148621 > rx_queue_6_bytes: 1388353208 > rx_queue_7_packets: 4038706 > rx_queue_7_bytes: 3988792078 > rx_queue_8_packets: 0 > rx_queue_8_bytes: 0 > rx_queue_9_packets: 0 > rx_queue_9_bytes: 0 > rx_queue_10_packets: 0 > rx_queue_10_bytes: 0 > rx_queue_11_packets: 0 > rx_queue_11_bytes: 0 > rx_queue_12_packets: 0 > rx_queue_12_bytes: 0 > rx_queue_13_packets: 0 > rx_queue_13_bytes: 0 > rx_queue_14_packets: 0 > rx_queue_14_bytes: 0 > rx_queue_15_packets: 0 > rx_queue_15_bytes: 0 > rx_queue_16_packets: 0 > rx_queue_16_bytes: 0 > rx_queue_17_packets: 0 > rx_queue_17_bytes: 0 > rx_queue_18_packets: 0 > rx_queue_18_bytes: 0 > rx_queue_19_packets: 0 > rx_queue_19_bytes: 0 > rx_queue_20_packets: 0 > rx_queue_20_bytes: 0 > rx_queue_21_packets: 0 > rx_queue_21_bytes: 0 > rx_queue_22_packets: 0 > rx_queue_22_bytes: 0 > rx_queue_23_packets: 0 > rx_queue_23_bytes: 0 > rx_queue_24_packets: 0 > rx_queue_24_bytes: 0 > rx_queue_25_packets: 0 > rx_queue_25_bytes: 0 > rx_queue_26_packets: 0 > rx_queue_26_bytes: 0 > rx_queue_27_packets: 0 > rx_queue_27_bytes: 0 > rx_queue_28_packets: 0 > rx_queue_28_bytes: 0 > rx_queue_29_packets: 0 > rx_queue_29_bytes: 0 > rx_queue_30_packets: 0 > rx_queue_30_bytes: 0 > rx_queue_31_packets: 0 > rx_queue_31_bytes: 0 > rx_queue_32_packets: 0 > rx_queue_32_bytes: 0 > rx_queue_33_packets: 0 > rx_queue_33_bytes: 0 > rx_queue_34_packets: 0 > rx_queue_34_bytes: 0 > rx_queue_35_packets: 0 > rx_queue_35_bytes: 0 > rx_queue_36_packets: 0 > rx_queue_36_bytes: 0 > rx_queue_37_packets: 0 > rx_queue_37_bytes: 0 > rx_queue_38_packets: 0 > rx_queue_38_bytes: 0 > rx_queue_39_packets: 0 > rx_queue_39_bytes: 0 > rx_queue_40_packets: 0 > rx_queue_40_bytes: 0 > rx_queue_41_packets: 0 > rx_queue_41_bytes: 0 > rx_queue_42_packets: 0 > rx_queue_42_bytes: 0 > rx_queue_43_packets: 0 > rx_queue_43_bytes: 0 > rx_queue_44_packets: 0 > rx_queue_44_bytes: 0 > rx_queue_45_packets: 0 > rx_queue_45_bytes: 0 > rx_queue_46_packets: 0 > rx_queue_46_bytes: 0 > rx_queue_47_packets: 0 > rx_queue_47_bytes: 0 > rx_queue_48_packets: 0 > rx_queue_48_bytes: 0 > rx_queue_49_packets: 0 > rx_queue_49_bytes: 0 > rx_queue_50_packets: 0 > rx_queue_50_bytes: 0 > rx_queue_51_packets: 0 > rx_queue_51_bytes: 0 > rx_queue_52_packets: 0 > rx_queue_52_bytes: 0 > rx_queue_53_packets: 0 > rx_queue_53_bytes: 0 > rx_queue_54_packets: 0 > rx_queue_54_bytes: 0 > rx_queue_55_packets: 0 > rx_queue_55_bytes: 0 > rx_queue_56_packets: 0 > rx_queue_56_bytes: 0 > rx_queue_57_packets: 0 > rx_queue_57_bytes: 0 > rx_queue_58_packets: 0 > rx_queue_58_bytes: 0 > rx_queue_59_packets: 0 > rx_queue_59_bytes: 0 > rx_queue_60_packets: 0 > rx_queue_60_bytes: 0 > rx_queue_61_packets: 0 > rx_queue_61_bytes: 0 > rx_queue_62_packets: 0 > rx_queue_62_bytes: 0 > rx_queue_63_packets: 0 > rx_queue_63_bytes: 0 > rx_queue_64_packets: 0 > rx_queue_64_bytes: 0 > rx_queue_65_packets: 0 > rx_queue_65_bytes: 0 > rx_queue_66_packets: 0 > rx_queue_66_bytes: 0 > rx_queue_67_packets: 0 > rx_queue_67_bytes: 0 > rx_queue_68_packets: 0 > rx_queue_68_bytes: 0 > rx_queue_69_packets: 0 > rx_queue_69_bytes: 0 > rx_queue_70_packets: 0 > rx_queue_70_bytes: 0 > tx_pb_0_pxon: 0 > tx_pb_0_pxoff: 0 > tx_pb_1_pxon: 0 > tx_pb_1_pxoff: 0 > tx_pb_2_pxon: 0 > tx_pb_2_pxoff: 0 > tx_pb_3_pxon: 0 > tx_pb_3_pxoff: 0 > tx_pb_4_pxon: 0 > tx_pb_4_pxoff: 0 > tx_pb_5_pxon: 0 > tx_pb_5_pxoff: 0 > tx_pb_6_pxon: 0 > tx_pb_6_pxoff: 0 > tx_pb_7_pxon: 0 > tx_pb_7_pxoff: 0 > rx_pb_0_pxon: 0 > rx_pb_0_pxoff: 0 > rx_pb_1_pxon: 0 > rx_pb_1_pxoff: 0 > rx_pb_2_pxon: 0 > rx_pb_2_pxoff: 0 > rx_pb_3_pxon: 0 > rx_pb_3_pxoff: 0 > rx_pb_4_pxon: 0 > rx_pb_4_pxoff: 0 > rx_pb_5_pxon: 0 > rx_pb_5_pxoff: 0 > rx_pb_6_pxon: 0 > rx_pb_6_pxoff: 0 > rx_pb_7_pxon: 0 > rx_pb_7_pxoff: 0 > > > > > > _______________________________________________ > E1000-devel mailing list > E1000-devel@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/e1000-devel > To learn more about Intel Ethernet, visit > https://forums.intel.com/s/topic/0TO0P00000018NbWAI/intel-ethernet > > _______________________________________________ > E1000-devel mailing list > E1000-devel@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/e1000-devel > To learn more about Intel Ethernet, visit > https://forums.intel.com/s/topic/0TO0P00000018NbWAI/intel-ethernet _______________________________________________ E1000-devel mailing list E1000-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/e1000-devel To learn more about Intel Ethernet, visit https://forums.intel.com/s/topic/0TO0P00000018NbWAI/intel-ethernet