Greg, On 2015-07-27 18:20, Christian Ruppert wrote: > On 2015-07-27 18:02, Rose, Gregory V wrote: >> So to make sure I've got your report right... Since updating the FW >> the link flap issue occurs much less frequently and VRRP is not >> causing a switch to a different host but your log is getting spammed >> with ATR messages. Is that correct? > > Exactly! > >> >> Have you tried turning off ntuple-filters? > > I'll try that.
No more: [341397.489477] i40e 0000:02:00.0: FD filter space full, new ntuple rules will not be added [341398.041918] i40e 0000:02:00.0: FD Filter table flushed and FD-SB replayed. [341398.049403] i40e 0000:02:00.0: FD Sideband/ntuple is being enabled since we have space in the table now [341398.059858] i40e 0000:02:00.0: ATR is being enabled since we have space in the table now [341850.732707] i40e 0000:02:00.0: ATR re-enabled. since I disabled ntuple. So that is gone (not fixed though) for now. The flaps are still persistent: [354195.790717] i40e 0000:02:00.0: TX driver issue detected, PF reset issued [354196.551366] i40e 0000:02:00.0: FCoE capability is disabled [354196.593458] i40e 0000:02:00.0: PHC enabled [354196.653574] i40e 0000:02:00.0: enabling bridge mode: VEPA [354196.672095] i40e 0000:02:00.0 eth0: NIC Link is Up 1000 Mbps Full Duplex, Flow Control: None > >> >> As for the link flap I don't think we've seen a lot of customers >> running the XL710 at 1Gbps speeds, most are using the 10Gbps or 40Gbps >> speeds. I'll do some research and see if there are any other reports >> of issues with the XL710 running at 1Gbps speed and get back to you. > > Thanks! > >> >> - Greg >> >>> -----Original Message----- >>> From: Christian Ruppert [mailto:id...@qasl.de] >>> Sent: Monday, July 27, 2015 8:47 AM >>> To: e1000-de...@lists.sf.net >>> Subject: [E1000-devel] X710 / i40e interface resets >>> >>> Hi List, >>> >>> after having some serious issues with the X520 ones I wanted to give >>> the >>> X710 a try. At first I had some trouble again. Started to flap >>> sometimes >>> and later flapping until VRRP switched to a different host. The >>> Kernel log >>> was spammed with: >>> Jul 24 12:02:10 somehost kernel: [ 1446.325468] i40e 0000:02:00.0: TX >>> driver issue detected, PF reset issued Jul 24 12:02:11 somehost >>> kernel: [ >>> 1447.154720] i40e 0000:02:00.0: FCoE capability is disabled Jul 24 >>> 12:02:11 somehost kernel: [ 1447.196823] i40e 0000:02:00.0: PHC >>> enabled >>> Jul 24 12:02:11 somehost kernel: [ 1447.255115] i40e 0000:02:00.0: >>> enabling bridge mode: VEPA >>> Jul 24 12:02:11 somehost kernel: [ 1447.267922] i40e 0000:02:00.0 >>> eth0: >>> NIC Link is Up 1000 Mbps Full Duplex, Flow Control: None >>> >>> and: >>> Jul 24 12:16:08 somehost kernel: [ 2285.524925] i40e 0000:02:00.0: FD >>> filter space full, new ntuple rules will not be added Jul 24 12:16:09 >>> somehost kernel: [ 2286.348784] i40e 0000:02:00.0: FD Filter table >>> flushed >>> and FD-SB replayed. >>> Jul 24 12:16:09 somehost kernel: [ 2286.356063] i40e 0000:02:00.0: FD >>> Sideband/ntuple is being enabled since we have space in the table now >>> Jul >>> 24 12:16:09 somehost kernel: [ 2286.365777] i40e 0000:02:00.0: ATR is >>> being enabled since we have space in the table now >>> >>> Then I read something about updating the NVM/preboot (here on the ML) >>> which I did then. After having some issues with flashing (the update >>> utility doesn't like CONFIG_STRICT_DEVMEM=y, there's also no warning >>> about >>> it. disabling it did help but not entirely) I decided to flash from a >>> Windows host (I had to setup one...) as it seemed somewhat easier and >>> safer, for now. >>> The preboot util still complained about having an older NVM than >>> expected >>> but somehow it worked and it's a more recent version now than it was >>> before. So here is the initialization of the 20.2 version/revision >>> and >>> also using the most recent 1.2.48 driver on a >>> 3.18.17 Kernel: >>> >>> Jul 24 11:38:13 somehost kernel: [ 2.251951] i40e: Intel(R) >>> Ethernet >>> Connection XL710 Network Driver - version 1.2.48 >>> Jul 24 11:38:13 somehost kernel: [ 2.259620] i40e: Copyright (c) >>> 2013 >>> - 2015 Intel Corporation. >>> Jul 24 11:38:13 somehost kernel: [ 2.283072] i40e 0000:02:00.0: >>> f4.33.31377 a1.2 n4.42 e191b >>> Jul 24 11:38:13 somehost kernel: [ 2.528098] i40e 0000:02:00.0: >>> FCoE >>> capability is disabled >>> Jul 24 11:38:13 somehost kernel: [ 2.541226] i40e 0000:02:00.0: >>> MAC >>> address: 68:05:ca:33:17:50 >>> Jul 24 11:38:13 somehost kernel: [ 2.545296] i40e 0000:02:00.0: >>> SAN >>> MAC: 68:05:ca:33:17:52 >>> Jul 24 11:38:13 somehost kernel: [ 2.559708] i40e 0000:02:00.0: >>> fcoe >>> queues = 0 >>> Jul 24 11:38:13 somehost kernel: [ 2.588853] i40e 0000:02:00.0: >>> enabling bridge mode: VEPA >>> Jul 24 11:38:13 somehost kernel: [ 2.623476] i40e 0000:02:00.0: >>> PHC >>> enabled >>> Jul 24 11:38:13 somehost kernel: [ 2.639801] i40e 0000:02:00.0: >>> PCI-Express: Speed 8.0GT/s Width x8 >>> Jul 24 11:38:13 somehost kernel: [ 2.653156] i40e 0000:02:00.0: >>> Features: PF-id[0] VFs: 64 VSIs: 66 QP: 8 RX: 1BUF RSS FD_ATR FD_SB >>> NTUPLE >>> DCB PTP >>> Jul 24 11:38:13 somehost kernel: [ 2.672182] i40e 0000:02:00.1: >>> f4.33.31377 a1.2 n4.42 e191b >>> Jul 24 11:38:13 somehost kernel: [ 2.921075] i40e 0000:02:00.1: >>> FCoE >>> capability is disabled >>> Jul 24 11:38:13 somehost kernel: [ 2.935064] i40e 0000:02:00.1: >>> MAC >>> address: 68:05:ca:33:17:51 >>> Jul 24 11:38:13 somehost kernel: [ 2.944833] i40e 0000:02:00.1: >>> SAN >>> MAC: 68:05:ca:33:17:53 >>> Jul 24 11:38:13 somehost kernel: [ 2.971041] i40e 0000:02:00.1: >>> fcoe >>> queues = 0 >>> Jul 24 11:38:13 somehost kernel: [ 3.004728] i40e 0000:02:00.1: >>> enabling bridge mode: VEPA >>> Jul 24 11:38:13 somehost kernel: [ 3.290178] i40e 0000:02:00.1: >>> PHC >>> enabled >>> Jul 24 11:38:13 somehost kernel: [ 3.310646] i40e 0000:02:00.1: >>> PCI-Express: Speed 8.0GT/s Width x8 >>> Jul 24 11:38:13 somehost kernel: [ 3.331079] i40e 0000:02:00.1: >>> Features: PF-id[1] VFs: 64 VSIs: 66 QP: 8 RX: 1BUF RSS FD_ATR FD_SB >>> NTUPLE >>> DCB PTP >>> Jul 24 11:38:13 somehost kernel: [ 4.279754] i40e 0000:02:00.1 >>> rename5: renamed from eth3 >>> Jul 24 11:38:13 somehost kernel: [ 4.315533] i40e 0000:02:00.0 >>> eth0: >>> renamed from eth1 >>> Jul 24 11:38:13 somehost kernel: [ 4.387610] i40e 0000:02:00.1 >>> eth1: >>> renamed from rename5 >>> Jul 24 11:38:13 somehost kernel: [ 6.276102] i40e 0000:02:00.0 >>> eth0: >>> NIC Link is Up 1000 Mbps Full Duplex, Flow Control: None >>> Jul 24 11:38:14 somehost kernel: [ 8.396154] i40e 0000:02:00.1 >>> eth1: >>> NIC Link is Up 1000 Mbps Full Duplex, Flow Control: None >>> >>> The above mentioned issues are still persistent even though less >>> frequent >>> and it's running about 3 days now without flapping that much that the >>> VRRP >>> switches to a different host. I'm not happy yet as it's still >>> flapping and >>> complaining about ntuple/ATR and such. Can I somehow help (by >>> providing >>> the necessary details) to get those issues fixed? >>> >>> # ethtool -i eth0 >>> driver: i40e >>> version: 1.2.48 >>> firmware-version: f4.33.31377 a1.2 n4.42 e191b >>> bus-info: 0000:02:00.0 >>> supports-statistics: yes >>> supports-test: yes >>> supports-eeprom-access: yes >>> supports-register-dump: yes >>> supports-priv-flags: yes >>> >>> # ethtool -k eth0 >>> Features for eth0: >>> rx-checksumming: on >>> tx-checksumming: on >>> tx-checksum-ipv4: on >>> tx-checksum-ip-generic: off [fixed] >>> tx-checksum-ipv6: on >>> tx-checksum-fcoe-crc: off [fixed] >>> tx-checksum-sctp: on >>> scatter-gather: on >>> tx-scatter-gather: on >>> tx-scatter-gather-fraglist: off [fixed] >>> tcp-segmentation-offload: on >>> tx-tcp-segmentation: on >>> tx-tcp-ecn-segmentation: on >>> tx-tcp6-segmentation: on >>> udp-fragmentation-offload: off [fixed] >>> generic-segmentation-offload: on >>> generic-receive-offload: on >>> large-receive-offload: off [fixed] >>> rx-vlan-offload: on >>> tx-vlan-offload: on >>> ntuple-filters: on >>> receive-hashing: on >>> highdma: on >>> rx-vlan-filter: on >>> vlan-challenged: off [fixed] >>> tx-lockless: off [fixed] >>> netns-local: off [fixed] >>> tx-gso-robust: off [fixed] >>> tx-fcoe-segmentation: off [fixed] >>> tx-gre-segmentation: off [fixed] >>> tx-ipip-segmentation: off [fixed] >>> tx-sit-segmentation: off [fixed] >>> tx-udp_tnl-segmentation: on >>> tx-mpls-segmentation: off [fixed] >>> fcoe-mtu: off [fixed] >>> tx-nocache-copy: off >>> loopback: off [fixed] >>> rx-fcs: off [fixed] >>> rx-all: off [fixed] >>> tx-vlan-stag-hw-insert: off [fixed] >>> rx-vlan-stag-hw-parse: off [fixed] >>> rx-vlan-stag-filter: off [fixed] >>> l2-fwd-offload: off [fixed] >>> busy-poll: off [fixed] >>> >>> # lspci -nvv >>> 02:00.1 0200: 8086:1572 (rev 01) >>> Subsystem: 8086:0000 >>> Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr+ >>> Stepping- SERR+ FastB2B- DisINTx+ >>> Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- >>> <TAbort- >>> <MAbort- >SERR- <PERR- INTx- >>> Latency: 0, Cache Line Size: 64 bytes >>> Interrupt: pin A routed to IRQ 17 >>> Region 0: Memory at c2e000000 (64-bit, prefetchable) [size=8M] >>> Region 3: Memory at c2f800000 (64-bit, prefetchable) [size=32K] >>> Expansion ROM at dfa00000 [disabled] [size=512K] >>> Capabilities: [40] Power Management version 3 >>> Flags: PMEClk- DSI+ D1- D2- AuxCurrent=0mA >>> PME(D0+,D1-,D2-,D3hot+,D3cold+) >>> Status: D0 NoSoftRst+ PME-Enable- DSel=0 DScale=1 PME- >>> Capabilities: [50] MSI: Enable- Count=1/1 Maskable+ 64bit+ >>> Address: 0000000000000000 Data: 0000 >>> Masking: 00000000 Pending: 00000000 >>> Capabilities: [70] MSI-X: Enable+ Count=129 Masked- >>> Vector table: BAR=3 offset=00000000 >>> PBA: BAR=3 offset=00001000 >>> Capabilities: [a0] Express (v2) Endpoint, MSI 00 >>> DevCap: MaxPayload 2048 bytes, PhantFunc 0, Latency L0s >>> <512ns, L1 >>> <64us >>> ExtTag+ AttnBtn- AttnInd- PwrInd- RBE+ FLReset+ >>> DevCtl: Report errors: Correctable+ Non-Fatal+ Fatal+ >>> Unsupported+ >>> RlxdOrd- ExtTag- PhantFunc- AuxPwr- NoSnoop- FLReset- >>> MaxPayload 128 bytes, MaxReadReq 512 bytes >>> DevSta: CorrErr+ UncorrErr- FatalErr- UnsuppReq+ AuxPwr- >>> TransPend+ >>> LnkCap: Port #0, Speed 8GT/s, Width x8, ASPM L1, Latency >>> L0 <2us, L1 >>> <16us >>> ClockPM- Surprise- LLActRep- BwNot- >>> LnkCtl: ASPM Disabled; RCB 64 bytes Disabled- Retrain- >>> CommClk+ >>> ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt- >>> LnkSta: Speed 8GT/s, Width x8, TrErr- Train- SlotClk+ >>> DLActive- >>> BWMgmt- ABWMgmt- >>> DevCap2: Completion Timeout: Range ABCD, TimeoutDis+ >>> DevCtl2: Completion Timeout: 50us to 50ms, TimeoutDis- >>> LnkCtl2: Target Link Speed: 2.5GT/s, EnterCompliance- >>> SpeedDis-, Selectable De-emphasis: -6dB >>> Transmit Margin: Normal Operating Range, >>> EnterModifiedCompliance- >>> ComplianceSOS- >>> Compliance De-emphasis: -6dB >>> LnkSta2: Current De-emphasis Level: -6dB, >>> EqualizationComplete-, >>> EqualizationPhase1- >>> EqualizationPhase2-, EqualizationPhase3-, >>> LinkEqualizationRequest- >>> Capabilities: [e0] Vital Product Data >>> Product Name: XL710 40GbE Controller >>> Read-only fields: >>> [PN] Part number: >>> [EC] Engineering changes: >>> [FG] Unknown: >>> [LC] Unknown: >>> [MN] Manufacture ID: >>> [PG] Unknown: >>> [SN] Serial number: >>> [V0] Vendor specific: >>> [RV] Reserved: checksum good, 0 byte(s) reserved >>> Read/write fields: >>> [V1] Vendor specific: >>> End >>> Capabilities: [100 v2] Advanced Error Reporting >>> UESta: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- >>> RxOF- >>> MalfTLP- ECRC- UnsupReq- ACSViol- >>> UEMsk: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- >>> RxOF- >>> MalfTLP- ECRC- UnsupReq- ACSViol- >>> UESvrt: DLP+ SDES+ TLP- FCP+ CmpltTO- CmpltAbrt- UnxCmplt- >>> RxOF+ >>> MalfTLP+ ECRC- UnsupReq- ACSViol- >>> CESta: RxErr- BadTLP- BadDLLP- Rollover- Timeout- >>> NonFatalErr+ >>> CEMsk: RxErr- BadTLP- BadDLLP- Rollover- Timeout- >>> NonFatalErr+ >>> AERCap: First Error Pointer: 00, GenCap+ CGenEn- ChkCap+ >>> ChkEn- >>> Capabilities: [140 v1] Device Serial Number 50-17-33-ff-ff-ca-05-68 >>> Capabilities: [150 v1] Alternative Routing-ID Interpretation (ARI) >>> ARICap: MFVC- ACS-, Next Function: 0 >>> ARICtl: MFVC- ACS-, Function Group: 0 >>> Capabilities: [160 v1] Single Root I/O Virtualization (SR-IOV) >>> IOVCap: Migration-, Interrupt Message Number: 000 >>> IOVCtl: Enable- Migration- Interrupt- MSE- ARIHierarchy- >>> IOVSta: Migration- >>> Initial VFs: 64, Total VFs: 64, Number of VFs: 0, Function >>> Dependency >>> Link: 01 >>> VF offset: 335, stride: 1, Device ID: 154c >>> Supported Page Size: 00000553, System Page Size: 00000001 >>> Region 0: Memory at 0000000c2f000000 (64-bit, prefetchable) >>> Region 3: Memory at 0000000c2f810000 (64-bit, prefetchable) >>> VF Migration: offset: 00000000, BIR: 0 >>> Capabilities: [1a0 v1] Transaction Processing Hints >>> Device specific mode supported >>> No steering table available >>> Capabilities: [1b0 v1] Access Control Services >>> ACSCap: SrcValid- TransBlk- ReqRedir- CmpltRedir- >>> UpstreamFwd- >>> EgressCtrl- DirectTrans- >>> ACSCtl: SrcValid- TransBlk- ReqRedir- CmpltRedir- >>> UpstreamFwd- >>> EgressCtrl- DirectTrans- >>> Kernel driver in use: i40e >>> >>> -- >>> Regards, >>> Christian Ruppert >>> >>> -------------------------------------------------------------------------- >>> ---- >>> _______________________________________________ >>> E1000-devel mailing list >>> E1000-devel@lists.sourceforge.net >>> https://lists.sourceforge.net/lists/listinfo/e1000-devel >>> To learn more about Intel® Ethernet, visit >>> http://communities.intel.com/community/wired -- Regards, Christian Ruppert ------------------------------------------------------------------------------ _______________________________________________ E1000-devel mailing list E1000-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/e1000-devel To learn more about Intel® Ethernet, visit http://communities.intel.com/community/wired