On 2015-07-27 18:02, Rose, Gregory V wrote: > So to make sure I've got your report right... Since updating the FW > the link flap issue occurs much less frequently and VRRP is not > causing a switch to a different host but your log is getting spammed > with ATR messages. Is that correct?
Exactly! > > Have you tried turning off ntuple-filters? I'll try that. > > As for the link flap I don't think we've seen a lot of customers > running the XL710 at 1Gbps speeds, most are using the 10Gbps or 40Gbps > speeds. I'll do some research and see if there are any other reports > of issues with the XL710 running at 1Gbps speed and get back to you. Thanks! > > - Greg > >> -----Original Message----- >> From: Christian Ruppert [mailto:id...@qasl.de] >> Sent: Monday, July 27, 2015 8:47 AM >> To: e1000-de...@lists.sf.net >> Subject: [E1000-devel] X710 / i40e interface resets >> >> Hi List, >> >> after having some serious issues with the X520 ones I wanted to give >> the >> X710 a try. At first I had some trouble again. Started to flap >> sometimes >> and later flapping until VRRP switched to a different host. The Kernel >> log >> was spammed with: >> Jul 24 12:02:10 somehost kernel: [ 1446.325468] i40e 0000:02:00.0: TX >> driver issue detected, PF reset issued Jul 24 12:02:11 somehost >> kernel: [ >> 1447.154720] i40e 0000:02:00.0: FCoE capability is disabled Jul 24 >> 12:02:11 somehost kernel: [ 1447.196823] i40e 0000:02:00.0: PHC >> enabled >> Jul 24 12:02:11 somehost kernel: [ 1447.255115] i40e 0000:02:00.0: >> enabling bridge mode: VEPA >> Jul 24 12:02:11 somehost kernel: [ 1447.267922] i40e 0000:02:00.0 >> eth0: >> NIC Link is Up 1000 Mbps Full Duplex, Flow Control: None >> >> and: >> Jul 24 12:16:08 somehost kernel: [ 2285.524925] i40e 0000:02:00.0: FD >> filter space full, new ntuple rules will not be added Jul 24 12:16:09 >> somehost kernel: [ 2286.348784] i40e 0000:02:00.0: FD Filter table >> flushed >> and FD-SB replayed. >> Jul 24 12:16:09 somehost kernel: [ 2286.356063] i40e 0000:02:00.0: FD >> Sideband/ntuple is being enabled since we have space in the table now >> Jul >> 24 12:16:09 somehost kernel: [ 2286.365777] i40e 0000:02:00.0: ATR is >> being enabled since we have space in the table now >> >> Then I read something about updating the NVM/preboot (here on the ML) >> which I did then. After having some issues with flashing (the update >> utility doesn't like CONFIG_STRICT_DEVMEM=y, there's also no warning >> about >> it. disabling it did help but not entirely) I decided to flash from a >> Windows host (I had to setup one...) as it seemed somewhat easier and >> safer, for now. >> The preboot util still complained about having an older NVM than >> expected >> but somehow it worked and it's a more recent version now than it was >> before. So here is the initialization of the 20.2 version/revision and >> also using the most recent 1.2.48 driver on a >> 3.18.17 Kernel: >> >> Jul 24 11:38:13 somehost kernel: [ 2.251951] i40e: Intel(R) >> Ethernet >> Connection XL710 Network Driver - version 1.2.48 >> Jul 24 11:38:13 somehost kernel: [ 2.259620] i40e: Copyright (c) >> 2013 >> - 2015 Intel Corporation. >> Jul 24 11:38:13 somehost kernel: [ 2.283072] i40e 0000:02:00.0: >> f4.33.31377 a1.2 n4.42 e191b >> Jul 24 11:38:13 somehost kernel: [ 2.528098] i40e 0000:02:00.0: >> FCoE >> capability is disabled >> Jul 24 11:38:13 somehost kernel: [ 2.541226] i40e 0000:02:00.0: MAC >> address: 68:05:ca:33:17:50 >> Jul 24 11:38:13 somehost kernel: [ 2.545296] i40e 0000:02:00.0: SAN >> MAC: 68:05:ca:33:17:52 >> Jul 24 11:38:13 somehost kernel: [ 2.559708] i40e 0000:02:00.0: >> fcoe >> queues = 0 >> Jul 24 11:38:13 somehost kernel: [ 2.588853] i40e 0000:02:00.0: >> enabling bridge mode: VEPA >> Jul 24 11:38:13 somehost kernel: [ 2.623476] i40e 0000:02:00.0: PHC >> enabled >> Jul 24 11:38:13 somehost kernel: [ 2.639801] i40e 0000:02:00.0: >> PCI-Express: Speed 8.0GT/s Width x8 >> Jul 24 11:38:13 somehost kernel: [ 2.653156] i40e 0000:02:00.0: >> Features: PF-id[0] VFs: 64 VSIs: 66 QP: 8 RX: 1BUF RSS FD_ATR FD_SB >> NTUPLE >> DCB PTP >> Jul 24 11:38:13 somehost kernel: [ 2.672182] i40e 0000:02:00.1: >> f4.33.31377 a1.2 n4.42 e191b >> Jul 24 11:38:13 somehost kernel: [ 2.921075] i40e 0000:02:00.1: >> FCoE >> capability is disabled >> Jul 24 11:38:13 somehost kernel: [ 2.935064] i40e 0000:02:00.1: MAC >> address: 68:05:ca:33:17:51 >> Jul 24 11:38:13 somehost kernel: [ 2.944833] i40e 0000:02:00.1: SAN >> MAC: 68:05:ca:33:17:53 >> Jul 24 11:38:13 somehost kernel: [ 2.971041] i40e 0000:02:00.1: >> fcoe >> queues = 0 >> Jul 24 11:38:13 somehost kernel: [ 3.004728] i40e 0000:02:00.1: >> enabling bridge mode: VEPA >> Jul 24 11:38:13 somehost kernel: [ 3.290178] i40e 0000:02:00.1: PHC >> enabled >> Jul 24 11:38:13 somehost kernel: [ 3.310646] i40e 0000:02:00.1: >> PCI-Express: Speed 8.0GT/s Width x8 >> Jul 24 11:38:13 somehost kernel: [ 3.331079] i40e 0000:02:00.1: >> Features: PF-id[1] VFs: 64 VSIs: 66 QP: 8 RX: 1BUF RSS FD_ATR FD_SB >> NTUPLE >> DCB PTP >> Jul 24 11:38:13 somehost kernel: [ 4.279754] i40e 0000:02:00.1 >> rename5: renamed from eth3 >> Jul 24 11:38:13 somehost kernel: [ 4.315533] i40e 0000:02:00.0 >> eth0: >> renamed from eth1 >> Jul 24 11:38:13 somehost kernel: [ 4.387610] i40e 0000:02:00.1 >> eth1: >> renamed from rename5 >> Jul 24 11:38:13 somehost kernel: [ 6.276102] i40e 0000:02:00.0 >> eth0: >> NIC Link is Up 1000 Mbps Full Duplex, Flow Control: None >> Jul 24 11:38:14 somehost kernel: [ 8.396154] i40e 0000:02:00.1 >> eth1: >> NIC Link is Up 1000 Mbps Full Duplex, Flow Control: None >> >> The above mentioned issues are still persistent even though less >> frequent >> and it's running about 3 days now without flapping that much that the >> VRRP >> switches to a different host. I'm not happy yet as it's still flapping >> and >> complaining about ntuple/ATR and such. Can I somehow help (by >> providing >> the necessary details) to get those issues fixed? >> >> # ethtool -i eth0 >> driver: i40e >> version: 1.2.48 >> firmware-version: f4.33.31377 a1.2 n4.42 e191b >> bus-info: 0000:02:00.0 >> supports-statistics: yes >> supports-test: yes >> supports-eeprom-access: yes >> supports-register-dump: yes >> supports-priv-flags: yes >> >> # ethtool -k eth0 >> Features for eth0: >> rx-checksumming: on >> tx-checksumming: on >> tx-checksum-ipv4: on >> tx-checksum-ip-generic: off [fixed] >> tx-checksum-ipv6: on >> tx-checksum-fcoe-crc: off [fixed] >> tx-checksum-sctp: on >> scatter-gather: on >> tx-scatter-gather: on >> tx-scatter-gather-fraglist: off [fixed] >> tcp-segmentation-offload: on >> tx-tcp-segmentation: on >> tx-tcp-ecn-segmentation: on >> tx-tcp6-segmentation: on >> udp-fragmentation-offload: off [fixed] >> generic-segmentation-offload: on >> generic-receive-offload: on >> large-receive-offload: off [fixed] >> rx-vlan-offload: on >> tx-vlan-offload: on >> ntuple-filters: on >> receive-hashing: on >> highdma: on >> rx-vlan-filter: on >> vlan-challenged: off [fixed] >> tx-lockless: off [fixed] >> netns-local: off [fixed] >> tx-gso-robust: off [fixed] >> tx-fcoe-segmentation: off [fixed] >> tx-gre-segmentation: off [fixed] >> tx-ipip-segmentation: off [fixed] >> tx-sit-segmentation: off [fixed] >> tx-udp_tnl-segmentation: on >> tx-mpls-segmentation: off [fixed] >> fcoe-mtu: off [fixed] >> tx-nocache-copy: off >> loopback: off [fixed] >> rx-fcs: off [fixed] >> rx-all: off [fixed] >> tx-vlan-stag-hw-insert: off [fixed] >> rx-vlan-stag-hw-parse: off [fixed] >> rx-vlan-stag-filter: off [fixed] >> l2-fwd-offload: off [fixed] >> busy-poll: off [fixed] >> >> # lspci -nvv >> 02:00.1 0200: 8086:1572 (rev 01) >> Subsystem: 8086:0000 >> Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr+ >> Stepping- SERR+ FastB2B- DisINTx+ >> Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- >> <TAbort- >> <MAbort- >SERR- <PERR- INTx- >> Latency: 0, Cache Line Size: 64 bytes >> Interrupt: pin A routed to IRQ 17 >> Region 0: Memory at c2e000000 (64-bit, prefetchable) [size=8M] >> Region 3: Memory at c2f800000 (64-bit, prefetchable) [size=32K] >> Expansion ROM at dfa00000 [disabled] [size=512K] >> Capabilities: [40] Power Management version 3 >> Flags: PMEClk- DSI+ D1- D2- AuxCurrent=0mA >> PME(D0+,D1-,D2-,D3hot+,D3cold+) >> Status: D0 NoSoftRst+ PME-Enable- DSel=0 DScale=1 PME- >> Capabilities: [50] MSI: Enable- Count=1/1 Maskable+ 64bit+ >> Address: 0000000000000000 Data: 0000 >> Masking: 00000000 Pending: 00000000 >> Capabilities: [70] MSI-X: Enable+ Count=129 Masked- >> Vector table: BAR=3 offset=00000000 >> PBA: BAR=3 offset=00001000 >> Capabilities: [a0] Express (v2) Endpoint, MSI 00 >> DevCap: MaxPayload 2048 bytes, PhantFunc 0, Latency L0s >> <512ns, L1 >> <64us >> ExtTag+ AttnBtn- AttnInd- PwrInd- RBE+ FLReset+ >> DevCtl: Report errors: Correctable+ Non-Fatal+ Fatal+ >> Unsupported+ >> RlxdOrd- ExtTag- PhantFunc- AuxPwr- NoSnoop- FLReset- >> MaxPayload 128 bytes, MaxReadReq 512 bytes >> DevSta: CorrErr+ UncorrErr- FatalErr- UnsuppReq+ AuxPwr- >> TransPend+ >> LnkCap: Port #0, Speed 8GT/s, Width x8, ASPM L1, Latency >> L0 <2us, L1 >> <16us >> ClockPM- Surprise- LLActRep- BwNot- >> LnkCtl: ASPM Disabled; RCB 64 bytes Disabled- Retrain- >> CommClk+ >> ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt- >> LnkSta: Speed 8GT/s, Width x8, TrErr- Train- SlotClk+ >> DLActive- >> BWMgmt- ABWMgmt- >> DevCap2: Completion Timeout: Range ABCD, TimeoutDis+ >> DevCtl2: Completion Timeout: 50us to 50ms, TimeoutDis- >> LnkCtl2: Target Link Speed: 2.5GT/s, EnterCompliance- >> SpeedDis-, Selectable De-emphasis: -6dB >> Transmit Margin: Normal Operating Range, >> EnterModifiedCompliance- >> ComplianceSOS- >> Compliance De-emphasis: -6dB >> LnkSta2: Current De-emphasis Level: -6dB, >> EqualizationComplete-, >> EqualizationPhase1- >> EqualizationPhase2-, EqualizationPhase3-, >> LinkEqualizationRequest- >> Capabilities: [e0] Vital Product Data >> Product Name: XL710 40GbE Controller >> Read-only fields: >> [PN] Part number: >> [EC] Engineering changes: >> [FG] Unknown: >> [LC] Unknown: >> [MN] Manufacture ID: >> [PG] Unknown: >> [SN] Serial number: >> [V0] Vendor specific: >> [RV] Reserved: checksum good, 0 byte(s) reserved >> Read/write fields: >> [V1] Vendor specific: >> End >> Capabilities: [100 v2] Advanced Error Reporting >> UESta: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- >> RxOF- >> MalfTLP- ECRC- UnsupReq- ACSViol- >> UEMsk: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- >> RxOF- >> MalfTLP- ECRC- UnsupReq- ACSViol- >> UESvrt: DLP+ SDES+ TLP- FCP+ CmpltTO- CmpltAbrt- UnxCmplt- >> RxOF+ >> MalfTLP+ ECRC- UnsupReq- ACSViol- >> CESta: RxErr- BadTLP- BadDLLP- Rollover- Timeout- >> NonFatalErr+ >> CEMsk: RxErr- BadTLP- BadDLLP- Rollover- Timeout- >> NonFatalErr+ >> AERCap: First Error Pointer: 00, GenCap+ CGenEn- ChkCap+ >> ChkEn- >> Capabilities: [140 v1] Device Serial Number 50-17-33-ff-ff-ca-05-68 >> Capabilities: [150 v1] Alternative Routing-ID Interpretation (ARI) >> ARICap: MFVC- ACS-, Next Function: 0 >> ARICtl: MFVC- ACS-, Function Group: 0 >> Capabilities: [160 v1] Single Root I/O Virtualization (SR-IOV) >> IOVCap: Migration-, Interrupt Message Number: 000 >> IOVCtl: Enable- Migration- Interrupt- MSE- ARIHierarchy- >> IOVSta: Migration- >> Initial VFs: 64, Total VFs: 64, Number of VFs: 0, Function >> Dependency >> Link: 01 >> VF offset: 335, stride: 1, Device ID: 154c >> Supported Page Size: 00000553, System Page Size: 00000001 >> Region 0: Memory at 0000000c2f000000 (64-bit, prefetchable) >> Region 3: Memory at 0000000c2f810000 (64-bit, prefetchable) >> VF Migration: offset: 00000000, BIR: 0 >> Capabilities: [1a0 v1] Transaction Processing Hints >> Device specific mode supported >> No steering table available >> Capabilities: [1b0 v1] Access Control Services >> ACSCap: SrcValid- TransBlk- ReqRedir- CmpltRedir- >> UpstreamFwd- >> EgressCtrl- DirectTrans- >> ACSCtl: SrcValid- TransBlk- ReqRedir- CmpltRedir- >> UpstreamFwd- >> EgressCtrl- DirectTrans- >> Kernel driver in use: i40e >> >> -- >> Regards, >> Christian Ruppert >> >> -------------------------------------------------------------------------- >> ---- >> _______________________________________________ >> E1000-devel mailing list >> E1000-devel@lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/e1000-devel >> To learn more about Intel® Ethernet, visit >> http://communities.intel.com/community/wired -- Regards, Christian Ruppert ------------------------------------------------------------------------------ _______________________________________________ E1000-devel mailing list E1000-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/e1000-devel To learn more about Intel® Ethernet, visit http://communities.intel.com/community/wired