Officially, that means you need to talk to Dell first, and they need to
escalate to us.
I would also try the latest out-of-tree driver on e1000.sourceforge.com as well
(igb-5.2.5.tar.gz).
Todd Fujinaka
Software Application Engineer
Networking Division (ND)
Intel Corporation
[email protected]
(503) 712-4565
From: Dilip Uppugandla [mailto:[email protected]]
Sent: Friday, June 13, 2014 12:34 PM
To: Fujinaka, Todd
Cc: [email protected]
Subject: Re: [E1000-devel] igb rx lockup with high rx_fifo_errors
lspci | grep Ethernet output:
01:00.0 Ethernet controller: Intel Corporation 82599ES 10-Gigabit SFI/SFP+
Network Connection (rev 01)
Subsystem: Dell Ethernet 10G 4P X520/I350 rNDC
01:00.1 Ethernet controller: Intel Corporation 82599ES 10-Gigabit SFI/SFP+
Network Connection (rev 01)
Subsystem: Dell Ethernet 10G 4P X520/I350 rNDC
08:00.0 Ethernet controller: Intel Corporation I350 Gigabit Network Connection
(rev 01)
08:00.1 Ethernet controller: Intel Corporation I350 Gigabit Network Connection
(rev 01)
lspci -vvv output for the ethernet interface that exhibited this hung state:
08:00.0 Ethernet controller: Intel Corporation I350 Gigabit Network Connection
(rev 01)
Subsystem: Dell Device 1f73
Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping- SERR- FastB2B- DisINTx+
Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort-
<MAbort- >SERR- <PERR- INTx-
Latency: 0, Cache Line Size: 64 bytes
Interrupt: pin D routed to IRQ 19
Region 0: Memory at da680000 (32-bit, non-prefetchable) [size=512K]
Region 3: Memory at da7f8000 (32-bit, non-prefetchable) [size=16K]
Expansion ROM at d9800000 [disabled] [size=512K]
Capabilities: [40] Power Management version 3
Flags: PMEClk- DSI+ D1- D2- AuxCurrent=0mA
PME(D0+,D1-,D2-,D3hot+,D3cold+)
Status: D0 NoSoftRst+ PME-Enable- DSel=0 DScale=1 PME-
Capabilities: [50] MSI: Enable- Count=1/1 Maskable+ 64bit+
Address: 0000000000000000 Data: 0000
Masking: 00000000 Pending: 00000000
Capabilities: [70] MSI-X: Enable+ Count=10 Masked-
Vector table: BAR=3 offset=00000000
PBA: BAR=3 offset=00002000
Capabilities: [a0] Express (v2) Endpoint, MSI 00
DevCap: MaxPayload 512 bytes, PhantFunc 0, Latency L0s <512ns,
L1 <64us
ExtTag- AttnBtn- AttnInd- PwrInd- RBE+ FLReset+
DevCtl: Report errors: Correctable- Non-Fatal+ Fatal+
Unsupported+
RlxdOrd+ ExtTag- PhantFunc- AuxPwr- NoSnoop+ FLReset-
MaxPayload 128 bytes, MaxReadReq 512 bytes
DevSta: CorrErr+ UncorrErr- FatalErr- UnsuppReq+ AuxPwr+
TransPend-
LnkCap: Port #0, Speed 5GT/s, Width x4, ASPM L0s L1, Latency L0
<4us, L1 <32us
ClockPM- Surprise- LLActRep- BwNot-
LnkCtl: ASPM Disabled; RCB 64 bytes Disabled- Retrain- CommClk+
ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
LnkSta: Speed 5GT/s, Width x2, TrErr- Train- SlotClk+ DLActive-
BWMgmt- ABWMgmt-
DevCap2: Completion Timeout: Range ABCD, TimeoutDis+
DevCtl2: Completion Timeout: 65ms to 210ms, TimeoutDis-
LnkCtl2: Target Link Speed: 5GT/s, EnterCompliance- SpeedDis-,
Selectable De-emphasis: -6dB
Transmit Margin: Normal Operating Range,
EnterModifiedCompliance- ComplianceSOS-
Compliance De-emphasis: -6dB
LnkSta2: Current De-emphasis Level: -6dB,
EqualizationComplete-, EqualizationPhase1-
EqualizationPhase2-, EqualizationPhase3-,
LinkEqualizationRequest-
Capabilities: [e0] Vital Product Data
Product Name: I350 GbE Controller
Read-only fields:
[V0] Vendor specific: FFV15.0.28
[PN] Part number: G61346
[MN] Manufacture ID: 31 30 32 38
[V1] Vendor specific: DSV1028VPDR.VER1.0
[V3] Vendor specific: DTINIC
[V4] Vendor specific:
DCM1001008FC52101008FC53201008FC54301008FC5
[V5] Vendor specific: NPY4
[V6] Vendor specific: PMT13
[V7] Vendor specific: NMVIntel Corp
[RV] Reserved: checksum good, 1 byte(s) reserved
End
Capabilities: [100 v2] Advanced Error Reporting
UESta: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt-
RxOF- MalfTLP- ECRC- UnsupReq- ACSViol-
UEMsk: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt+ UnxCmplt+
RxOF- MalfTLP- ECRC- UnsupReq- ACSViol-
UESvrt: DLP+ SDES+ TLP+ FCP+ CmpltTO+ CmpltAbrt- UnxCmplt-
RxOF+ MalfTLP+ ECRC+ UnsupReq- ACSViol-
CESta: RxErr- BadTLP- BadDLLP- Rollover- Timeout- NonFatalErr+
CEMsk: RxErr+ BadTLP+ BadDLLP+ Rollover+ Timeout+ NonFatalErr+
AERCap: First Error Pointer: 00, GenCap+ CGenEn- ChkCap+ ChkEn-
Capabilities: [140 v1] Device Serial Number b8-ca-3a-ff-ff-6c-f6-f7
Capabilities: [150 v1] Alternative Routing-ID Interpretation (ARI)
ARICap: MFVC- ACS-, Next Function: 1
ARICtl: MFVC- ACS-, Function Group: 0
Capabilities: [1a0 v1] Transaction Processing Hints
Device specific mode supported
Steering table in TPH capability structure
Capabilities: [1c0 v1] Latency Tolerance Reporting
Max snoop latency: 0ns
Max no snoop latency: 0ns
Capabilities: [1d0 v1] Access Control Services
ACSCap: SrcValid- TransBlk- ReqRedir- CmpltRedir- UpstreamFwd-
EgressCtrl- DirectTrans-
ACSCtl: SrcValid- TransBlk- ReqRedir- CmpltRedir- UpstreamFwd-
EgressCtrl- DirectTrans-
Kernel driver in use: igb
Thanks
~Dilip
On Fri, Jun 13, 2014 at 7:54 AM, Fujinaka, Todd
<[email protected]<mailto:[email protected]>> wrote:
It's hard to tell what hardware you're running. Can you send the output of
"lspci | grep Ethernet"?
Todd Fujinaka
Software Application Engineer
Networking Division (ND)
Intel Corporation
[email protected]<mailto:[email protected]>
(503) 712-4565<tel:%28503%29%20712-4565>
-----Original Message-----
From: Dilip Kumar Uppugandla
[mailto:[email protected]<mailto:[email protected]>]
Sent: Thursday, June 12, 2014 9:02 PM
To: [email protected]<mailto:[email protected]>
Subject: [E1000-devel] igb rx lockup with high rx_fifo_errors
Hi,
We are noticing a problem with the igb driver in Linux 3.6.11 kernel. The
version of the driver is 4.0.1-k with firmware version: 1.67, 0x0b858000,
15.0.28.
eth0 seems to get into a hung state where packets can go out of the interface
(tx counters increase), but none are received (rx_packets increase at the same
rate as rx_fifo_errors counter). Upping and downing the interface seemed to
clear the hung state. Collected the following information from the device while
it is rx hung state. The register dump anytime during the hung state shows the
Receive desc head and tail pointers pointing to 0x0.
ethtool -i eth0:
driver: igb
version: 4.0.1-k
firmware-version: 1.67, 0x0b858000, 15.0.28
bus-info: 0000:08:00.0
supports-statistics: yes
supports-test: yes
supports-eeprom-access: yes
supports-register-dump: yes
supports-priv-flags: no
ethtool -S eth0:
eth0 statistics:
rx_packets: 21324
tx_packets: 30864
rx_bytes: 1463656
tx_bytes: 7949422
rx_broadcast: 1767
tx_broadcast: 16953
rx_multicast: 0
tx_multicast: 12
.......
tx_tcp_seg_good: 1286
.........
rx_fifo_errors: 18867
ethtool -d eth0:
......
0x00100: RCTL (Receive control register) 0x04048022
Receiver: enabled
Store bad packets: disabled
Unicast promiscuous: disabled
Multicast promiscuous: disabled
Long packet: enabled
Descriptor minimum threshold size: 1/2
Broadcast accept mode: accept
VLAN filter: enabled
Cononical form indicator: disabled
Discard pause frames: filtered
Pass MAC control frames: don't pass
Loopback mode: normal
Receive buffer size: 2048
0x02808: RDLEN (Receive desc length) 0x00001000
0x02810: RDH (Receive desc head) 0x00000000
0x02818: RDT (Receive desc tail) 0x00000000
0x00400: TCTL (Transmit ctrl register) 0xA503F0FA
......
Is this a known issue? The issue of rx being hung was observed only once in our
testing.
What are the next steps to address this issue?
Thanks
~Dilip
------------------------------------------------------------------------------
HPCC Systems Open Source Big Data Platform from LexisNexis Risk Solutions Find
What Matters Most in Your Big Data with HPCC Systems Open Source. Fast.
Scalable. Simple. Ideal for Dirty Data.
Leverages Graph Analysis for Fast Processing & Easy Data Exploration
http://p.sf.net/sfu/hpccsystems _______________________________________________
E1000-devel mailing list
[email protected]<mailto:[email protected]>
https://lists.sourceforge.net/lists/listinfo/e1000-devel
To learn more about Intel® Ethernet, visit
http://communities.intel.com/community/wired
------------------------------------------------------------------------------
HPCC Systems Open Source Big Data Platform from LexisNexis Risk Solutions
Find What Matters Most in Your Big Data with HPCC Systems
Open Source. Fast. Scalable. Simple. Ideal for Dirty Data.
Leverages Graph Analysis for Fast Processing & Easy Data Exploration
http://p.sf.net/sfu/hpccsystems
_______________________________________________
E1000-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/e1000-devel
To learn more about Intel® Ethernet, visit
http://communities.intel.com/community/wired