Hi List,

after having some serious issues with the X520 ones I wanted to give the 
X710 a try. At first I had some trouble again. Started to flap sometimes 
and later flapping until VRRP switched to a different host. The Kernel 
log was spammed with:
Jul 24 12:02:10 somehost kernel: [ 1446.325468] i40e 0000:02:00.0: TX 
driver issue detected, PF reset issued
Jul 24 12:02:11 somehost kernel: [ 1447.154720] i40e 0000:02:00.0: FCoE 
capability is disabled
Jul 24 12:02:11 somehost kernel: [ 1447.196823] i40e 0000:02:00.0: PHC 
enabled
Jul 24 12:02:11 somehost kernel: [ 1447.255115] i40e 0000:02:00.0: 
enabling bridge mode: VEPA
Jul 24 12:02:11 somehost kernel: [ 1447.267922] i40e 0000:02:00.0 eth0: 
NIC Link is Up 1000 Mbps Full Duplex, Flow Control: None

and:
Jul 24 12:16:08 somehost kernel: [ 2285.524925] i40e 0000:02:00.0: FD 
filter space full, new ntuple rules will not be added
Jul 24 12:16:09 somehost kernel: [ 2286.348784] i40e 0000:02:00.0: FD 
Filter table flushed and FD-SB replayed.
Jul 24 12:16:09 somehost kernel: [ 2286.356063] i40e 0000:02:00.0: FD 
Sideband/ntuple is being enabled since we have space in the table now
Jul 24 12:16:09 somehost kernel: [ 2286.365777] i40e 0000:02:00.0: ATR 
is being enabled since we have space in the table now

Then I read something about updating the NVM/preboot (here on the ML) 
which I did then. After having some issues with flashing (the update 
utility doesn't like
CONFIG_STRICT_DEVMEM=y, there's also no warning about it. disabling it 
did help but not entirely) I decided to flash from a Windows host (I had 
to setup one...) as it seemed somewhat easier and safer, for now.
The preboot util still complained about having an older NVM than 
expected but somehow it worked and it's a more recent version now than 
it was before. So here is the initialization of the 20.2 
version/revision and also using the most recent 1.2.48 driver on a 
3.18.17 Kernel:

Jul 24 11:38:13 somehost kernel: [    2.251951] i40e: Intel(R) Ethernet 
Connection XL710 Network Driver - version 1.2.48
Jul 24 11:38:13 somehost kernel: [    2.259620] i40e: Copyright (c) 2013 
- 2015 Intel Corporation.
Jul 24 11:38:13 somehost kernel: [    2.283072] i40e 0000:02:00.0: 
f4.33.31377 a1.2 n4.42 e191b
Jul 24 11:38:13 somehost kernel: [    2.528098] i40e 0000:02:00.0: FCoE 
capability is disabled
Jul 24 11:38:13 somehost kernel: [    2.541226] i40e 0000:02:00.0: MAC 
address: 68:05:ca:33:17:50
Jul 24 11:38:13 somehost kernel: [    2.545296] i40e 0000:02:00.0: SAN 
MAC: 68:05:ca:33:17:52
Jul 24 11:38:13 somehost kernel: [    2.559708] i40e 0000:02:00.0: fcoe 
queues = 0
Jul 24 11:38:13 somehost kernel: [    2.588853] i40e 0000:02:00.0: 
enabling bridge mode: VEPA
Jul 24 11:38:13 somehost kernel: [    2.623476] i40e 0000:02:00.0: PHC 
enabled
Jul 24 11:38:13 somehost kernel: [    2.639801] i40e 0000:02:00.0: 
PCI-Express: Speed 8.0GT/s Width x8
Jul 24 11:38:13 somehost kernel: [    2.653156] i40e 0000:02:00.0: 
Features: PF-id[0] VFs: 64 VSIs: 66 QP: 8 RX: 1BUF RSS FD_ATR FD_SB 
NTUPLE DCB PTP
Jul 24 11:38:13 somehost kernel: [    2.672182] i40e 0000:02:00.1: 
f4.33.31377 a1.2 n4.42 e191b
Jul 24 11:38:13 somehost kernel: [    2.921075] i40e 0000:02:00.1: FCoE 
capability is disabled
Jul 24 11:38:13 somehost kernel: [    2.935064] i40e 0000:02:00.1: MAC 
address: 68:05:ca:33:17:51
Jul 24 11:38:13 somehost kernel: [    2.944833] i40e 0000:02:00.1: SAN 
MAC: 68:05:ca:33:17:53
Jul 24 11:38:13 somehost kernel: [    2.971041] i40e 0000:02:00.1: fcoe 
queues = 0
Jul 24 11:38:13 somehost kernel: [    3.004728] i40e 0000:02:00.1: 
enabling bridge mode: VEPA
Jul 24 11:38:13 somehost kernel: [    3.290178] i40e 0000:02:00.1: PHC 
enabled
Jul 24 11:38:13 somehost kernel: [    3.310646] i40e 0000:02:00.1: 
PCI-Express: Speed 8.0GT/s Width x8
Jul 24 11:38:13 somehost kernel: [    3.331079] i40e 0000:02:00.1: 
Features: PF-id[1] VFs: 64 VSIs: 66 QP: 8 RX: 1BUF RSS FD_ATR FD_SB 
NTUPLE DCB PTP
Jul 24 11:38:13 somehost kernel: [    4.279754] i40e 0000:02:00.1 
rename5: renamed from eth3
Jul 24 11:38:13 somehost kernel: [    4.315533] i40e 0000:02:00.0 eth0: 
renamed from eth1
Jul 24 11:38:13 somehost kernel: [    4.387610] i40e 0000:02:00.1 eth1: 
renamed from rename5
Jul 24 11:38:13 somehost kernel: [    6.276102] i40e 0000:02:00.0 eth0: 
NIC Link is Up 1000 Mbps Full Duplex, Flow Control: None
Jul 24 11:38:14 somehost kernel: [    8.396154] i40e 0000:02:00.1 eth1: 
NIC Link is Up 1000 Mbps Full Duplex, Flow Control: None

The above mentioned issues are still persistent even though less 
frequent and it's running about 3 days now without flapping that much 
that the VRRP switches to a different host. I'm not happy yet as it's 
still flapping and complaining about ntuple/ATR and such. Can I somehow 
help (by providing the necessary details) to get those issues fixed?

# ethtool -i eth0
driver: i40e
version: 1.2.48
firmware-version: f4.33.31377 a1.2 n4.42 e191b
bus-info: 0000:02:00.0
supports-statistics: yes
supports-test: yes
supports-eeprom-access: yes
supports-register-dump: yes
supports-priv-flags: yes

# ethtool -k eth0
Features for eth0:
rx-checksumming: on
tx-checksumming: on
        tx-checksum-ipv4: on
        tx-checksum-ip-generic: off [fixed]
        tx-checksum-ipv6: on
        tx-checksum-fcoe-crc: off [fixed]
        tx-checksum-sctp: on
scatter-gather: on
        tx-scatter-gather: on
        tx-scatter-gather-fraglist: off [fixed]
tcp-segmentation-offload: on
        tx-tcp-segmentation: on
        tx-tcp-ecn-segmentation: on
        tx-tcp6-segmentation: on
udp-fragmentation-offload: off [fixed]
generic-segmentation-offload: on
generic-receive-offload: on
large-receive-offload: off [fixed]
rx-vlan-offload: on
tx-vlan-offload: on
ntuple-filters: on
receive-hashing: on
highdma: on
rx-vlan-filter: on
vlan-challenged: off [fixed]
tx-lockless: off [fixed]
netns-local: off [fixed]
tx-gso-robust: off [fixed]
tx-fcoe-segmentation: off [fixed]
tx-gre-segmentation: off [fixed]
tx-ipip-segmentation: off [fixed]
tx-sit-segmentation: off [fixed]
tx-udp_tnl-segmentation: on
tx-mpls-segmentation: off [fixed]
fcoe-mtu: off [fixed]
tx-nocache-copy: off
loopback: off [fixed]
rx-fcs: off [fixed]
rx-all: off [fixed]
tx-vlan-stag-hw-insert: off [fixed]
rx-vlan-stag-hw-parse: off [fixed]
rx-vlan-stag-filter: off [fixed]
l2-fwd-offload: off [fixed]
busy-poll: off [fixed]

# lspci -nvv
02:00.1 0200: 8086:1572 (rev 01)
        Subsystem: 8086:0000
        Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr+ 
Stepping- SERR+ FastB2B- DisINTx+
        Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- 
<MAbort- >SERR- <PERR- INTx-
        Latency: 0, Cache Line Size: 64 bytes
        Interrupt: pin A routed to IRQ 17
        Region 0: Memory at c2e000000 (64-bit, prefetchable) [size=8M]
        Region 3: Memory at c2f800000 (64-bit, prefetchable) [size=32K]
        Expansion ROM at dfa00000 [disabled] [size=512K]
        Capabilities: [40] Power Management version 3
                Flags: PMEClk- DSI+ D1- D2- AuxCurrent=0mA 
PME(D0+,D1-,D2-,D3hot+,D3cold+)
                Status: D0 NoSoftRst+ PME-Enable- DSel=0 DScale=1 PME-
        Capabilities: [50] MSI: Enable- Count=1/1 Maskable+ 64bit+
                Address: 0000000000000000  Data: 0000
                Masking: 00000000  Pending: 00000000
        Capabilities: [70] MSI-X: Enable+ Count=129 Masked-
                Vector table: BAR=3 offset=00000000
                PBA: BAR=3 offset=00001000
        Capabilities: [a0] Express (v2) Endpoint, MSI 00
                DevCap: MaxPayload 2048 bytes, PhantFunc 0, Latency L0s <512ns, 
L1 
<64us
                        ExtTag+ AttnBtn- AttnInd- PwrInd- RBE+ FLReset+
                DevCtl: Report errors: Correctable+ Non-Fatal+ Fatal+ 
Unsupported+
                        RlxdOrd- ExtTag- PhantFunc- AuxPwr- NoSnoop- FLReset-
                        MaxPayload 128 bytes, MaxReadReq 512 bytes
                DevSta: CorrErr+ UncorrErr- FatalErr- UnsuppReq+ AuxPwr- 
TransPend+
                LnkCap: Port #0, Speed 8GT/s, Width x8, ASPM L1, Latency L0 
<2us, L1 
<16us
                        ClockPM- Surprise- LLActRep- BwNot-
                LnkCtl: ASPM Disabled; RCB 64 bytes Disabled- Retrain- CommClk+
                        ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
                LnkSta: Speed 8GT/s, Width x8, TrErr- Train- SlotClk+ DLActive- 
BWMgmt- ABWMgmt-
                DevCap2: Completion Timeout: Range ABCD, TimeoutDis+
                DevCtl2: Completion Timeout: 50us to 50ms, TimeoutDis-
                LnkCtl2: Target Link Speed: 2.5GT/s, EnterCompliance- 
SpeedDis-, 
Selectable De-emphasis: -6dB
                         Transmit Margin: Normal Operating Range, 
EnterModifiedCompliance- 
ComplianceSOS-
                         Compliance De-emphasis: -6dB
                LnkSta2: Current De-emphasis Level: -6dB, 
EqualizationComplete-, 
EqualizationPhase1-
                         EqualizationPhase2-, EqualizationPhase3-, 
LinkEqualizationRequest-
        Capabilities: [e0] Vital Product Data
                Product Name: XL710 40GbE Controller
                Read-only fields:
                        [PN] Part number:
                        [EC] Engineering changes:
                        [FG] Unknown:
                        [LC] Unknown:
                        [MN] Manufacture ID:
                        [PG] Unknown:
                        [SN] Serial number:
                        [V0] Vendor specific:
                        [RV] Reserved: checksum good, 0 byte(s) reserved
                Read/write fields:
                        [V1] Vendor specific:
                End
        Capabilities: [100 v2] Advanced Error Reporting
                UESta:  DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- 
RxOF- 
MalfTLP- ECRC- UnsupReq- ACSViol-
                UEMsk:  DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- 
RxOF- 
MalfTLP- ECRC- UnsupReq- ACSViol-
                UESvrt: DLP+ SDES+ TLP- FCP+ CmpltTO- CmpltAbrt- UnxCmplt- 
RxOF+ 
MalfTLP+ ECRC- UnsupReq- ACSViol-
                CESta:  RxErr- BadTLP- BadDLLP- Rollover- Timeout- NonFatalErr+
                CEMsk:  RxErr- BadTLP- BadDLLP- Rollover- Timeout- NonFatalErr+
                AERCap: First Error Pointer: 00, GenCap+ CGenEn- ChkCap+ ChkEn-
        Capabilities: [140 v1] Device Serial Number 50-17-33-ff-ff-ca-05-68
        Capabilities: [150 v1] Alternative Routing-ID Interpretation (ARI)
                ARICap: MFVC- ACS-, Next Function: 0
                ARICtl: MFVC- ACS-, Function Group: 0
        Capabilities: [160 v1] Single Root I/O Virtualization (SR-IOV)
                IOVCap: Migration-, Interrupt Message Number: 000
                IOVCtl: Enable- Migration- Interrupt- MSE- ARIHierarchy-
                IOVSta: Migration-
                Initial VFs: 64, Total VFs: 64, Number of VFs: 0, Function 
Dependency 
Link: 01
                VF offset: 335, stride: 1, Device ID: 154c
                Supported Page Size: 00000553, System Page Size: 00000001
                Region 0: Memory at 0000000c2f000000 (64-bit, prefetchable)
                Region 3: Memory at 0000000c2f810000 (64-bit, prefetchable)
                VF Migration: offset: 00000000, BIR: 0
        Capabilities: [1a0 v1] Transaction Processing Hints
                Device specific mode supported
                No steering table available
        Capabilities: [1b0 v1] Access Control Services
                ACSCap: SrcValid- TransBlk- ReqRedir- CmpltRedir- UpstreamFwd- 
EgressCtrl- DirectTrans-
                ACSCtl: SrcValid- TransBlk- ReqRedir- CmpltRedir- UpstreamFwd- 
EgressCtrl- DirectTrans-
        Kernel driver in use: i40e

-- 
Regards,
Christian Ruppert

------------------------------------------------------------------------------
_______________________________________________
E1000-devel mailing list
E1000-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/e1000-devel
To learn more about Intel&#174; Ethernet, visit 
http://communities.intel.com/community/wired

Reply via email to