Hello, Todd Fujinaka & List

Am Montag, 16. März 2015 17:56:08 schrieb Fujinaka, Todd:
> These cards don't have firmware. They're just configuration settings. You
> can download the OpenSDM on the files section to decode the differences,
> but mostly it's identification information.

Great.
So we can safely rule the "firmware" item out of the list of suspects :-)

I kept moving things around, and now, on top of my list of possible 
bottlenecks is the mainboard.
When I change the position of the NICs on the board, the bottleneck stays with 
the PCIe socket, not with the NIC or the external network path.

============

my results in more detail:


I figured out that I can not only put one single iperf on my aggregated link, 
but still also access the individual links, since teql is a layer 3 
aggregation scheme, and all patways still have their own IP and MAC.
This way I also can rule out possible problems caused by the teql layer.

So I did
for i in 62 94 126 158 190 222; do ( iperf -c 192.168.130.$i -t1 & ) ; done
to call a number of iperf instances in parallel, each over a different GBit 
link.

On a "good link" I get > 980 MBit / s out of 1000 MBit possible.
On a "bad link", I get < 800 MBit / s

I can have all 4 Links on the "good NIC" and 1 link on the "bad NIC" at close 
to 1000 MBIt. 
A teql aggregation over those five links gets iperf  > 4.7 GBit/s out of 5 
Gbit, which is close to 95 % - I'd say: "perfect".

However, when I run two paralle iperf over the "bad NIC", bandwith gets down 
to 740 Mbit/s for each. Even worse, teql seems to be irritated by the 
reschedule / reordering, so aggregated bandwith goes down even more. But this 
is a knwon issue in link aggregation, so I really were desperate to work on 
top of "clean" physical links.

Next, I interchanged external connections, but the problem stuck with the 
NICs, so I can rule out external causes, like cabling, switches, VLAN setup, 
client system.

Then I interchanged the PCIe slot position of the NICs in the mainboard, and 
now I see that the "bad NIC" is still the one in PCI bus position 07:00 / 
08:00. So it is not a question of NIC Branding or a hardware failure there.

I can establish the following combined  iperf connection rates on "bad" NIC / 
slot:
1000 Mbit/s
730 + 730 Mbit/s
730 + 400 + 400 MBit/s
400 + 400 + 400 + 400 MBit/s

Those rates are reproducible at +- 20 MBit and independent on the traffic 
generated on the other NIC (which may be from 0 to 4x 1Gbit/s).

Up to now, I tried debug level 4 and 7 on loading module e1000e, but I still 
get no trace of packet loss in my syslog. 
Or do I still have to enable somthing?
Like some secret sysfs paramter?



Where to go from here?

update of Possible causes on my list

### - firmware problem (NICs)
!!!!     firmware problem ( Mainboard)
###- hardware problem (NICs, 
!!!     hardware Mainboard)
!!!!   conceptual limitation of hardware design
????  driver problem
????  kernel / scheduling issue / IRQ / race...whatever?
???? some realy weird hidden tweak paramater
####  still the nasty VC blade switch?
( which led me already to migrate from layer 2 bonding to layer 3 teql)
#### other network / client side issues
????  any more????

### ruled out
??? still under consideration
!!! top suspect


So Ill gooogle for sabertooth, but there are still some ?? on driver related 
issues.

Wolfgang Rosner

==============================================


more details on Mainboard 


(from hwinfo ...) 

  type 0x02 [0x0002]: 02 0f 02 00 01 02 03 04 05 09 06 03 00 0a 00
       str1: "ASUSTeK COMPUTER INC."
       str2: "SABERTOOTH 990FX R2.0"
       str3: "Rev 1.xx"
       str4: "141033355400133"


root@cruncher:/cluster/etc/scripts/available# lspci | grep -i ether
07:00.0 Ethernet controller: Intel Corporation 82571EB Gigabit Ethernet 
Controller (rev 06)
07:00.1 Ethernet controller: Intel Corporation 82571EB Gigabit Ethernet 
Controller (rev 06)
08:00.0 Ethernet controller: Intel Corporation 82571EB Gigabit Ethernet 
Controller (rev 06)
08:00.1 Ethernet controller: Intel Corporation 82571EB Gigabit Ethernet 
Controller (rev 06)
0c:00.0 Ethernet controller: Intel Corporation 82571EB Gigabit Ethernet 
Controller (Copper) (rev 06)
0c:00.1 Ethernet controller: Intel Corporation 82571EB Gigabit Ethernet 
Controller (Copper) (rev 06)
0d:00.0 Ethernet controller: Intel Corporation 82571EB Gigabit Ethernet 
Controller (Copper) (rev 06)
0d:00.1 Ethernet controller: Intel Corporation 82571EB Gigabit Ethernet 
Controller (Copper) (rev 06)
0e:05.0 Ethernet controller: Realtek Semiconductor Co., Ltd. 
RTL-8100/8101L/8139 PCI Fast Ethernet Adapter (rev 10)
10:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168/8411 
PCI Express Gigabit Ethernet Controller (rev 09)

root@cruncher:/cluster/etc/scripts/available# lspci -t
-[0000:00]-+-00.0
           +-02.0-[01]--+-00.0
           |            \-00.1
           +-04.0-[02]----00.0
           +-05.0-[03]----00.0
           +-09.0-[04]----00.0
           +-0a.0-[05-08]----00.0-[06-08]--+-00.0-[07]--+-00.0
           |                               |            \-00.1
           |                               \-01.0-[08]--+-00.0
           |                                            \-00.1
           +-0b.0-[09]--+-00.0
           |            \-00.1
           +-0d.0-[0a-0d]----00.0-[0b-0d]--+-02.0-[0c]--+-00.0
           |                               |            \-00.1
           |                               \-04.0-[0d]--+-00.0
           |                                            \-00.1
           +-11.0
           +-12.0
           +-12.2
           +-13.0
           +-13.2
           +-14.0
           +-14.2
           +-14.3
           +-14.4-[0e]----05.0
           +-14.5
           +-15.0-[0f]--
           +-15.1-[10]----00.0
           +-15.2-[11]----00.0
           +-15.3-[12]----00.0
           +-16.0
           +-16.2
           +-18.0
           +-18.1
           +-18.2
           +-18.3
           +-18.4
           \-18.5


root@cruncher:/cluster/etc/scripts/available# lspci -vvs 00:0a
00:0a.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] RD890 PCI to PCI 
bridge (external gfx1 port A) (prog-if 00 [Normal decode])
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- 
Stepping- SERR- FastB2B- DisINTx-
        Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- 
<TAbort- <MAbort- >SERR- <PERR- INTx-
        Latency: 0, Cache Line Size: 64 bytes
        Bus: primary=00, secondary=05, subordinate=08, sec-latency=0
        I/O behind bridge: 00007000-00008fff
        Memory behind bridge: fc500000-fc6fffff
        Secondary status: 66MHz- FastB2B- ParErr- DEVSEL=fast >TAbort- 
<TAbort- <MAbort+ <SERR- <PERR-
        BridgeCtl: Parity- SERR- NoISA- VGA- MAbort- >Reset- FastB2B-
                PriDiscTmr- SecDiscTmr- DiscTmrStat- DiscTmrSERREn-
        Capabilities: [50] Power Management version 3
                Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA 
PME(D0+,D1-,D2-,D3hot+,D3cold+)
                Status: D0 NoSoftRst- PME-Enable- DSel=0 DScale=0 PME-
        Capabilities: [58] Express (v2) Root Port (Slot+), MSI 00
                DevCap: MaxPayload 128 bytes, PhantFunc 0
                        ExtTag+ RBE+
                DevCtl: Report errors: Correctable- Non-Fatal- Fatal- 
Unsupported-
                        RlxdOrd- ExtTag- PhantFunc- AuxPwr- NoSnoop+
                        MaxPayload 128 bytes, MaxReadReq 128 bytes
                DevSta: CorrErr- UncorrErr- FatalErr- UnsuppReq- AuxPwr- 
TransPend-
                LnkCap: Port #5, Speed 2.5GT/s, Width x1, ASPM L0s L1, Exit 
Latency L0s <1us, L1 <8us
                        ClockPM- Surprise- LLActRep+ BwNot+
                LnkCtl: ASPM Disabled; RCB 64 bytes Disabled- CommClk+
                        ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
                LnkSta: Speed 2.5GT/s, Width x1, TrErr- Train- SlotClk+ 
DLActive+ BWMgmt+ ABWMgmt-
                SltCap: AttnBtn- PwrCtrl- MRL- AttnInd- PwrInd- HotPlug- 
Surprise-
                        Slot #10, PowerLimit 75.000W; Interlock- NoCompl+
                SltCtl: Enable: AttnBtn- PwrFlt- MRL- PresDet- CmdCplt- HPIrq- 
LinkChg-
                        Control: AttnInd Unknown, PwrInd Unknown, Power- 
Interlock-
                SltSta: Status: AttnBtn- PowerFlt- MRL- CmdCplt- PresDet+ 
Interlock-
                        Changed: MRL- PresDet- LinkState-
                RootCtl: ErrCorrectable- ErrNon-Fatal- ErrFatal- PMEIntEna- 
CRSVisible+
                RootCap: CRSVisible-
                RootSta: PME ReqID 0000, PMEStatus- PMEPending-
                DevCap2: Completion Timeout: Range ABCD, TimeoutDis+, LTR-, 
OBFF Not Supported ARIFwd+
                DevCtl2: Completion Timeout: 65ms to 210ms, TimeoutDis-, LTR-, 
OBFF Disabled ARIFwd-
                LnkCtl2: Target Link Speed: 2.5GT/s, EnterCompliance- 
SpeedDis-
                         Transmit Margin: Normal Operating Range, 
EnterModifiedCompliance- ComplianceSOS-
                         Compliance De-emphasis: -6dB
                LnkSta2: Current De-emphasis Level: -6dB, 
EqualizationComplete-, EqualizationPhase1-
                         EqualizationPhase2-, EqualizationPhase3-, 
LinkEqualizationRequest-
        Capabilities: [a0] MSI: Enable- Count=1/1 Maskable- 64bit-
                Address: 00000000  Data: 0000
        Capabilities: [b0] Subsystem: Advanced Micro Devices, Inc. [AMD/ATI] 
Device 5a14
        Capabilities: [b8] HyperTransport: MSI Mapping Enable+ Fixed+
        Capabilities: [100 v1] Vendor Specific Information: ID=0001 Rev=1 
Len=010 <?>
        Capabilities: [190 v1] Access Control Services
                ACSCap: SrcValid+ TransBlk+ ReqRedir+ CmpltRedir+ UpstreamFwd+ 
EgressCtrl- DirectTrans+
                ACSCtl: SrcValid- TransBlk- ReqRedir- CmpltRedir- UpstreamFwd- 
EgressCtrl- DirectTrans-
        Kernel driver in use: pcieport




root@cruncher:/cluster/etc/scripts/available# lspci
00:00.0 Host bridge: Advanced Micro Devices, Inc. [AMD/ATI] RD890 PCI to PCI 
bridge (external gfx0 port B) (rev 02)
00:02.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] RD890 PCI to PCI 
bridge (PCI express gpp port B)
00:04.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] RD890 PCI to PCI 
bridge (PCI express gpp port D)
00:05.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] RD890 PCI to PCI 
bridge (PCI express gpp port E)
00:09.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] RD890 PCI to PCI 
bridge (PCI express gpp port H)
00:0a.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] RD890 PCI to PCI 
bridge (external gfx1 port A)
00:0b.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] RD890 PCI to PCI 
bridge (NB-SB link)
00:0d.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] RD890 PCI to PCI 
bridge (external gfx1 port B)
00:11.0 SATA controller: Advanced Micro Devices, Inc. [AMD/ATI] 
SB7x0/SB8x0/SB9x0 SATA Controller [AHCI mode] (rev 40)
00:12.0 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] 
SB7x0/SB8x0/SB9x0 USB OHCI0 Controller
00:12.2 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] 
SB7x0/SB8x0/SB9x0 USB EHCI Controller
00:13.0 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] 
SB7x0/SB8x0/SB9x0 USB OHCI0 Controller
00:13.2 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] 
SB7x0/SB8x0/SB9x0 USB EHCI Controller
00:14.0 SMBus: Advanced Micro Devices, Inc. [AMD/ATI] SBx00 SMBus Controller 
(rev 42)
00:14.2 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] SBx00 Azalia 
(Intel HDA) (rev 40)
00:14.3 ISA bridge: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 
LPC host controller (rev 40)
00:14.4 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] SBx00 PCI to PCI 
Bridge (rev 40)
00:14.5 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] 
SB7x0/SB8x0/SB9x0 USB OHCI2 Controller
00:15.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] SB700/SB800/SB900 
PCI to PCI bridge (PCIE port 0)
00:15.1 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] SB700/SB800/SB900 
PCI to PCI bridge (PCIE port 1)
00:15.2 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] SB900 PCI to PCI 
bridge (PCIE port 2)
00:15.3 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] SB900 PCI to PCI 
bridge (PCIE port 3)
00:16.0 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] 
SB7x0/SB8x0/SB9x0 USB OHCI0 Controller
00:16.2 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] 
SB7x0/SB8x0/SB9x0 USB EHCI Controller
00:18.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h Processor 
Function 0
00:18.1 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h Processor 
Function 1
00:18.2 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h Processor 
Function 2
00:18.3 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h Processor 
Function 3
00:18.4 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h Processor 
Function 4
00:18.5 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h Processor 
Function 5
01:00.0 3D controller: NVIDIA Corporation GF100GL [Tesla M2070] (rev a3)
01:00.1 Audio device: NVIDIA Corporation GF100 High Definition Audio 
Controller (rev a1)
02:00.0 SATA controller: ASMedia Technology Inc. ASM1062 Serial ATA Controller 
(rev 01)
03:00.0 SATA controller: ASMedia Technology Inc. ASM1062 Serial ATA Controller 
(rev 01)
04:00.0 USB controller: ASMedia Technology Inc. ASM1042A USB 3.0 Host 
Controller
05:00.0 PCI bridge: Integrated Device Technology, Inc. [IDT] PES12N3A PCI 
Express Switch (rev 04)
06:00.0 PCI bridge: Integrated Device Technology, Inc. [IDT] PES12N3A PCI 
Express Switch (rev 04)
06:01.0 PCI bridge: Integrated Device Technology, Inc. [IDT] PES12N3A PCI 
Express Switch (rev 04)
07:00.0 Ethernet controller: Intel Corporation 82571EB Gigabit Ethernet 
Controller (rev 06)
07:00.1 Ethernet controller: Intel Corporation 82571EB Gigabit Ethernet 
Controller (rev 06)
08:00.0 Ethernet controller: Intel Corporation 82571EB Gigabit Ethernet 
Controller (rev 06)
08:00.1 Ethernet controller: Intel Corporation 82571EB Gigabit Ethernet 
Controller (rev 06)
09:00.0 VGA compatible controller: NVIDIA Corporation GF106GL [Quadro 2000] 
(rev a1)
09:00.1 Audio device: NVIDIA Corporation GF106 High Definition Audio 
Controller (rev a1)
0a:00.0 PCI bridge: Integrated Device Technology, Inc. [IDT] PES12N3A PCI 
Express Switch (rev 0e)
0b:02.0 PCI bridge: Integrated Device Technology, Inc. [IDT] PES12N3A PCI 
Express Switch (rev 0e)
0b:04.0 PCI bridge: Integrated Device Technology, Inc. [IDT] PES12N3A PCI 
Express Switch (rev 0e)
0c:00.0 Ethernet controller: Intel Corporation 82571EB Gigabit Ethernet 
Controller (Copper) (rev 06)
0c:00.1 Ethernet controller: Intel Corporation 82571EB Gigabit Ethernet 
Controller (Copper) (rev 06)
0d:00.0 Ethernet controller: Intel Corporation 82571EB Gigabit Ethernet 
Controller (Copper) (rev 06)
0d:00.1 Ethernet controller: Intel Corporation 82571EB Gigabit Ethernet 
Controller (Copper) (rev 06)
0e:05.0 Ethernet controller: Realtek Semiconductor Co., Ltd. 
RTL-8100/8101L/8139 PCI Fast Ethernet Adapter (rev 10)
10:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168/8411 
PCI Express Gigabit Ethernet Controller (rev 09)
11:00.0 USB controller: ASMedia Technology Inc. ASM1042A USB 3.0 Host 
Controller
12:00.0 USB controller: ASMedia Technology Inc. ASM1042A USB 3.0 Host 
Controller


>
> Todd Fujinaka
> Software Application Engineer
> Networking Division (ND)
> Intel Corporation
> todd.fujin...@intel.com
> (503) 712-4565
>
> -----Original Message-----
> From: Wolfgang Rosner [mailto:wros...@tirnet.de]
> Sent: Monday, March 16, 2015 9:43 AM
> To: Scott Silverman
> Cc: e1000-devel@lists.sourceforge.net
> Subject: Re: [E1000-devel] rx_missed_errors with e1000e on 82571EB chip
> only on HP, not on Intel card
>
> Am Montag, 16. März 2015 14:14:06 schrieben Sie:
> > Probably unrelated,
>
> surely not!
>
> > but I remember that this was once a thing:
> > http://h20564.www2.hp.com/hpsc/doc/public/display?docId=emr_na-c015015
> > 74
>......

------------------------------------------------------------------------------
Dive into the World of Parallel Programming The Go Parallel Website, sponsored
by Intel and developed in partnership with Slashdot Media, is your hub for all
things parallel software development, from weekly thought leadership blogs to
news, videos, case studies, tutorials and more. Take a look and join the 
conversation now. http://goparallel.sourceforge.net/
_______________________________________________
E1000-devel mailing list
E1000-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/e1000-devel
To learn more about Intel&#174; Ethernet, visit 
http://communities.intel.com/community/wired

Reply via email to