So to make sure I've got your report right... Since updating the FW the link flap issue occurs much less frequently and VRRP is not causing a switch to a different host but your log is getting spammed with ATR messages. Is that correct?
Have you tried turning off ntuple-filters? As for the link flap I don't think we've seen a lot of customers running the XL710 at 1Gbps speeds, most are using the 10Gbps or 40Gbps speeds. I'll do some research and see if there are any other reports of issues with the XL710 running at 1Gbps speed and get back to you. - Greg > -----Original Message----- > From: Christian Ruppert [mailto:id...@qasl.de] > Sent: Monday, July 27, 2015 8:47 AM > To: e1000-de...@lists.sf.net > Subject: [E1000-devel] X710 / i40e interface resets > > Hi List, > > after having some serious issues with the X520 ones I wanted to give the > X710 a try. At first I had some trouble again. Started to flap sometimes > and later flapping until VRRP switched to a different host. The Kernel log > was spammed with: > Jul 24 12:02:10 somehost kernel: [ 1446.325468] i40e 0000:02:00.0: TX > driver issue detected, PF reset issued Jul 24 12:02:11 somehost kernel: [ > 1447.154720] i40e 0000:02:00.0: FCoE capability is disabled Jul 24 > 12:02:11 somehost kernel: [ 1447.196823] i40e 0000:02:00.0: PHC enabled > Jul 24 12:02:11 somehost kernel: [ 1447.255115] i40e 0000:02:00.0: > enabling bridge mode: VEPA > Jul 24 12:02:11 somehost kernel: [ 1447.267922] i40e 0000:02:00.0 eth0: > NIC Link is Up 1000 Mbps Full Duplex, Flow Control: None > > and: > Jul 24 12:16:08 somehost kernel: [ 2285.524925] i40e 0000:02:00.0: FD > filter space full, new ntuple rules will not be added Jul 24 12:16:09 > somehost kernel: [ 2286.348784] i40e 0000:02:00.0: FD Filter table flushed > and FD-SB replayed. > Jul 24 12:16:09 somehost kernel: [ 2286.356063] i40e 0000:02:00.0: FD > Sideband/ntuple is being enabled since we have space in the table now Jul > 24 12:16:09 somehost kernel: [ 2286.365777] i40e 0000:02:00.0: ATR is > being enabled since we have space in the table now > > Then I read something about updating the NVM/preboot (here on the ML) > which I did then. After having some issues with flashing (the update > utility doesn't like CONFIG_STRICT_DEVMEM=y, there's also no warning about > it. disabling it did help but not entirely) I decided to flash from a > Windows host (I had to setup one...) as it seemed somewhat easier and > safer, for now. > The preboot util still complained about having an older NVM than expected > but somehow it worked and it's a more recent version now than it was > before. So here is the initialization of the 20.2 version/revision and > also using the most recent 1.2.48 driver on a > 3.18.17 Kernel: > > Jul 24 11:38:13 somehost kernel: [ 2.251951] i40e: Intel(R) Ethernet > Connection XL710 Network Driver - version 1.2.48 > Jul 24 11:38:13 somehost kernel: [ 2.259620] i40e: Copyright (c) 2013 > - 2015 Intel Corporation. > Jul 24 11:38:13 somehost kernel: [ 2.283072] i40e 0000:02:00.0: > f4.33.31377 a1.2 n4.42 e191b > Jul 24 11:38:13 somehost kernel: [ 2.528098] i40e 0000:02:00.0: FCoE > capability is disabled > Jul 24 11:38:13 somehost kernel: [ 2.541226] i40e 0000:02:00.0: MAC > address: 68:05:ca:33:17:50 > Jul 24 11:38:13 somehost kernel: [ 2.545296] i40e 0000:02:00.0: SAN > MAC: 68:05:ca:33:17:52 > Jul 24 11:38:13 somehost kernel: [ 2.559708] i40e 0000:02:00.0: fcoe > queues = 0 > Jul 24 11:38:13 somehost kernel: [ 2.588853] i40e 0000:02:00.0: > enabling bridge mode: VEPA > Jul 24 11:38:13 somehost kernel: [ 2.623476] i40e 0000:02:00.0: PHC > enabled > Jul 24 11:38:13 somehost kernel: [ 2.639801] i40e 0000:02:00.0: > PCI-Express: Speed 8.0GT/s Width x8 > Jul 24 11:38:13 somehost kernel: [ 2.653156] i40e 0000:02:00.0: > Features: PF-id[0] VFs: 64 VSIs: 66 QP: 8 RX: 1BUF RSS FD_ATR FD_SB NTUPLE > DCB PTP > Jul 24 11:38:13 somehost kernel: [ 2.672182] i40e 0000:02:00.1: > f4.33.31377 a1.2 n4.42 e191b > Jul 24 11:38:13 somehost kernel: [ 2.921075] i40e 0000:02:00.1: FCoE > capability is disabled > Jul 24 11:38:13 somehost kernel: [ 2.935064] i40e 0000:02:00.1: MAC > address: 68:05:ca:33:17:51 > Jul 24 11:38:13 somehost kernel: [ 2.944833] i40e 0000:02:00.1: SAN > MAC: 68:05:ca:33:17:53 > Jul 24 11:38:13 somehost kernel: [ 2.971041] i40e 0000:02:00.1: fcoe > queues = 0 > Jul 24 11:38:13 somehost kernel: [ 3.004728] i40e 0000:02:00.1: > enabling bridge mode: VEPA > Jul 24 11:38:13 somehost kernel: [ 3.290178] i40e 0000:02:00.1: PHC > enabled > Jul 24 11:38:13 somehost kernel: [ 3.310646] i40e 0000:02:00.1: > PCI-Express: Speed 8.0GT/s Width x8 > Jul 24 11:38:13 somehost kernel: [ 3.331079] i40e 0000:02:00.1: > Features: PF-id[1] VFs: 64 VSIs: 66 QP: 8 RX: 1BUF RSS FD_ATR FD_SB NTUPLE > DCB PTP > Jul 24 11:38:13 somehost kernel: [ 4.279754] i40e 0000:02:00.1 > rename5: renamed from eth3 > Jul 24 11:38:13 somehost kernel: [ 4.315533] i40e 0000:02:00.0 eth0: > renamed from eth1 > Jul 24 11:38:13 somehost kernel: [ 4.387610] i40e 0000:02:00.1 eth1: > renamed from rename5 > Jul 24 11:38:13 somehost kernel: [ 6.276102] i40e 0000:02:00.0 eth0: > NIC Link is Up 1000 Mbps Full Duplex, Flow Control: None > Jul 24 11:38:14 somehost kernel: [ 8.396154] i40e 0000:02:00.1 eth1: > NIC Link is Up 1000 Mbps Full Duplex, Flow Control: None > > The above mentioned issues are still persistent even though less frequent > and it's running about 3 days now without flapping that much that the VRRP > switches to a different host. I'm not happy yet as it's still flapping and > complaining about ntuple/ATR and such. Can I somehow help (by providing > the necessary details) to get those issues fixed? > > # ethtool -i eth0 > driver: i40e > version: 1.2.48 > firmware-version: f4.33.31377 a1.2 n4.42 e191b > bus-info: 0000:02:00.0 > supports-statistics: yes > supports-test: yes > supports-eeprom-access: yes > supports-register-dump: yes > supports-priv-flags: yes > > # ethtool -k eth0 > Features for eth0: > rx-checksumming: on > tx-checksumming: on > tx-checksum-ipv4: on > tx-checksum-ip-generic: off [fixed] > tx-checksum-ipv6: on > tx-checksum-fcoe-crc: off [fixed] > tx-checksum-sctp: on > scatter-gather: on > tx-scatter-gather: on > tx-scatter-gather-fraglist: off [fixed] > tcp-segmentation-offload: on > tx-tcp-segmentation: on > tx-tcp-ecn-segmentation: on > tx-tcp6-segmentation: on > udp-fragmentation-offload: off [fixed] > generic-segmentation-offload: on > generic-receive-offload: on > large-receive-offload: off [fixed] > rx-vlan-offload: on > tx-vlan-offload: on > ntuple-filters: on > receive-hashing: on > highdma: on > rx-vlan-filter: on > vlan-challenged: off [fixed] > tx-lockless: off [fixed] > netns-local: off [fixed] > tx-gso-robust: off [fixed] > tx-fcoe-segmentation: off [fixed] > tx-gre-segmentation: off [fixed] > tx-ipip-segmentation: off [fixed] > tx-sit-segmentation: off [fixed] > tx-udp_tnl-segmentation: on > tx-mpls-segmentation: off [fixed] > fcoe-mtu: off [fixed] > tx-nocache-copy: off > loopback: off [fixed] > rx-fcs: off [fixed] > rx-all: off [fixed] > tx-vlan-stag-hw-insert: off [fixed] > rx-vlan-stag-hw-parse: off [fixed] > rx-vlan-stag-filter: off [fixed] > l2-fwd-offload: off [fixed] > busy-poll: off [fixed] > > # lspci -nvv > 02:00.1 0200: 8086:1572 (rev 01) > Subsystem: 8086:0000 > Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr+ > Stepping- SERR+ FastB2B- DisINTx+ > Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- > <TAbort- > <MAbort- >SERR- <PERR- INTx- > Latency: 0, Cache Line Size: 64 bytes > Interrupt: pin A routed to IRQ 17 > Region 0: Memory at c2e000000 (64-bit, prefetchable) [size=8M] > Region 3: Memory at c2f800000 (64-bit, prefetchable) [size=32K] > Expansion ROM at dfa00000 [disabled] [size=512K] > Capabilities: [40] Power Management version 3 > Flags: PMEClk- DSI+ D1- D2- AuxCurrent=0mA > PME(D0+,D1-,D2-,D3hot+,D3cold+) > Status: D0 NoSoftRst+ PME-Enable- DSel=0 DScale=1 PME- > Capabilities: [50] MSI: Enable- Count=1/1 Maskable+ 64bit+ > Address: 0000000000000000 Data: 0000 > Masking: 00000000 Pending: 00000000 > Capabilities: [70] MSI-X: Enable+ Count=129 Masked- > Vector table: BAR=3 offset=00000000 > PBA: BAR=3 offset=00001000 > Capabilities: [a0] Express (v2) Endpoint, MSI 00 > DevCap: MaxPayload 2048 bytes, PhantFunc 0, Latency L0s > <512ns, L1 > <64us > ExtTag+ AttnBtn- AttnInd- PwrInd- RBE+ FLReset+ > DevCtl: Report errors: Correctable+ Non-Fatal+ Fatal+ > Unsupported+ > RlxdOrd- ExtTag- PhantFunc- AuxPwr- NoSnoop- FLReset- > MaxPayload 128 bytes, MaxReadReq 512 bytes > DevSta: CorrErr+ UncorrErr- FatalErr- UnsuppReq+ AuxPwr- > TransPend+ > LnkCap: Port #0, Speed 8GT/s, Width x8, ASPM L1, Latency > L0 <2us, L1 > <16us > ClockPM- Surprise- LLActRep- BwNot- > LnkCtl: ASPM Disabled; RCB 64 bytes Disabled- Retrain- > CommClk+ > ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt- > LnkSta: Speed 8GT/s, Width x8, TrErr- Train- SlotClk+ > DLActive- > BWMgmt- ABWMgmt- > DevCap2: Completion Timeout: Range ABCD, TimeoutDis+ > DevCtl2: Completion Timeout: 50us to 50ms, TimeoutDis- > LnkCtl2: Target Link Speed: 2.5GT/s, EnterCompliance- > SpeedDis-, Selectable De-emphasis: -6dB > Transmit Margin: Normal Operating Range, > EnterModifiedCompliance- > ComplianceSOS- > Compliance De-emphasis: -6dB > LnkSta2: Current De-emphasis Level: -6dB, > EqualizationComplete-, > EqualizationPhase1- > EqualizationPhase2-, EqualizationPhase3-, > LinkEqualizationRequest- > Capabilities: [e0] Vital Product Data > Product Name: XL710 40GbE Controller > Read-only fields: > [PN] Part number: > [EC] Engineering changes: > [FG] Unknown: > [LC] Unknown: > [MN] Manufacture ID: > [PG] Unknown: > [SN] Serial number: > [V0] Vendor specific: > [RV] Reserved: checksum good, 0 byte(s) reserved > Read/write fields: > [V1] Vendor specific: > End > Capabilities: [100 v2] Advanced Error Reporting > UESta: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- > RxOF- > MalfTLP- ECRC- UnsupReq- ACSViol- > UEMsk: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- > RxOF- > MalfTLP- ECRC- UnsupReq- ACSViol- > UESvrt: DLP+ SDES+ TLP- FCP+ CmpltTO- CmpltAbrt- UnxCmplt- > RxOF+ > MalfTLP+ ECRC- UnsupReq- ACSViol- > CESta: RxErr- BadTLP- BadDLLP- Rollover- Timeout- > NonFatalErr+ > CEMsk: RxErr- BadTLP- BadDLLP- Rollover- Timeout- > NonFatalErr+ > AERCap: First Error Pointer: 00, GenCap+ CGenEn- ChkCap+ > ChkEn- > Capabilities: [140 v1] Device Serial Number 50-17-33-ff-ff-ca-05-68 > Capabilities: [150 v1] Alternative Routing-ID Interpretation (ARI) > ARICap: MFVC- ACS-, Next Function: 0 > ARICtl: MFVC- ACS-, Function Group: 0 > Capabilities: [160 v1] Single Root I/O Virtualization (SR-IOV) > IOVCap: Migration-, Interrupt Message Number: 000 > IOVCtl: Enable- Migration- Interrupt- MSE- ARIHierarchy- > IOVSta: Migration- > Initial VFs: 64, Total VFs: 64, Number of VFs: 0, Function > Dependency > Link: 01 > VF offset: 335, stride: 1, Device ID: 154c > Supported Page Size: 00000553, System Page Size: 00000001 > Region 0: Memory at 0000000c2f000000 (64-bit, prefetchable) > Region 3: Memory at 0000000c2f810000 (64-bit, prefetchable) > VF Migration: offset: 00000000, BIR: 0 > Capabilities: [1a0 v1] Transaction Processing Hints > Device specific mode supported > No steering table available > Capabilities: [1b0 v1] Access Control Services > ACSCap: SrcValid- TransBlk- ReqRedir- CmpltRedir- > UpstreamFwd- > EgressCtrl- DirectTrans- > ACSCtl: SrcValid- TransBlk- ReqRedir- CmpltRedir- > UpstreamFwd- > EgressCtrl- DirectTrans- > Kernel driver in use: i40e > > -- > Regards, > Christian Ruppert > > -------------------------------------------------------------------------- > ---- > _______________________________________________ > E1000-devel mailing list > E1000-devel@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/e1000-devel > To learn more about Intel® Ethernet, visit > http://communities.intel.com/community/wired ------------------------------------------------------------------------------ _______________________________________________ E1000-devel mailing list E1000-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/e1000-devel To learn more about Intel® Ethernet, visit http://communities.intel.com/community/wired