Hi Denis,

The fundamental issue right now is that in certain VLAN configurations
the system is not taking advantage of hardware polling as it should be.
This means that the system falls back to high watermarks for packet
rates that are a bit low and end up netting the performance around
roughly what you're seeing for a single thread.

As you might imagine, we're acutely aware of this problem and are in the
process of implementing RFD 97
(https://github.com/joyent/rfd/tree/master/rfd/0097) to address it,
which is already seeing promising results from our current experiments.

Robert

On 11/9/17 19:35 , Denis Cheong wrote:
> I am adding 10GbE to my existing SmartOS server but am experiencing unusual 
> and severe performance issues that I’m at a loss to explain.
> 
> Over the default untagged 10GbE link, I can get >9Gbit/sec consistently under 
> all configurations.
> As soon as I test over a VLAN, transfer rates plummet to a very inconsistent 
> 3-4Gbit/sec RX, and <1Gbit/sec TX.
> 
> Does anybody have any ideas what might be going on here?
> 
> Performance over default VLAN ID (SmartOS is running iperf3 -s; nb with 
> SmartOS as client and other host as server, performance is identical):
> 
> Connecting to host 192.168.245.14, port 5201
>       local 192.168.245.21 port 56809 connected to 192.168.245.14 port 5201
>       Interval           Transfer     Bandwidth
>                           0.00-1.00   sec  1.12 GBytes  9.58 Gbits/sec
>                           1.00-2.00   sec  1.12 GBytes  9.62 Gbits/sec
>                           2.00-3.00   sec  1.12 GBytes  9.62 Gbits/sec
>                           3.00-4.00   sec  1.12 GBytes  9.60 Gbits/sec
>                           4.00-5.00   sec  1.12 GBytes  9.59 Gbits/sec
>                           5.00-6.00   sec  1.12 GBytes  9.61 Gbits/sec
>                           6.00-7.00   sec  1.12 GBytes  9.59 Gbits/sec
>                           7.00-8.00   sec  1.10 GBytes  9.47 Gbits/sec
>                           8.00-9.00   sec  1.12 GBytes  9.60 Gbits/sec
>                           9.00-10.00  sec  1.12 GBytes  9.63 Gbits/sec
>   - - - - - - - - - - - - - - - - - - - - - - - -
>       Interval           Transfer     Bandwidth
>         0.00-10.00  sec  11.2 GBytes  9.59 Gbits/sec                  sender
>         0.00-10.00  sec  11.2 GBytes  9.59 Gbits/sec                  receive
> 
> Performance over the same link, but over VLAN 300 (SmartOS is running iperf3 
> -s; note wild variation from 2 - 5Gbit/sec):
> 
> Connecting to host 192.168.245.134, port 5201
>       local 192.168.245.133 port 56786 connected to 192.168.245.134 port 5201
>       Interval           Transfer     Bandwidth
>                           0.00-1.00   sec   523 MBytes  4.39 Gbits/sec
>                           1.00-2.00   sec   481 MBytes  4.04 Gbits/sec
>                           2.00-3.00   sec   608 MBytes  5.10 Gbits/sec
>                           3.00-4.00   sec   560 MBytes  4.70 Gbits/sec
>                           4.00-5.00   sec   242 MBytes  2.03 Gbits/sec
>                           5.00-6.00   sec   592 MBytes  4.96 Gbits/sec
>                           6.00-7.00   sec   553 MBytes  4.64 Gbits/sec
>                           7.00-8.00   sec   253 MBytes  2.12 Gbits/sec
>                           8.00-9.00   sec   569 MBytes  4.77 Gbits/sec
>                           9.00-10.00  sec   507 MBytes  4.25 Gbits/sec
>   - - - - - - - - - - - - - - - - - - - - - - - -
>       Interval           Transfer     Bandwidth
>         0.00-10.00  sec  4.77 GBytes  4.10 Gbits/sec                  sender
>         0.00-10.00  sec  4.77 GBytes  4.10 Gbits/sec                  receiver
> 
> Performance over the same link, VLAN 300, SmartOS as client, server on other 
> host (note significantly worse performance on transmit):
> 
> Connecting to host 192.168.245.133, port 5201
>       local 192.168.245.134 port 35851 connected to 192.168.245.133 port 5201
>       Interval           Transfer     Bandwidth
>                           0.00-1.00   sec   104 MBytes   875 Mbits/sec
>                           1.00-2.00   sec  46.3 MBytes   389 Mbits/sec
>                           2.00-3.00   sec   130 MBytes  1.09 Gbits/sec
>                           3.00-4.00   sec  76.0 MBytes   638 Mbits/sec
>                           4.00-5.00   sec  97.0 MBytes   814 Mbits/sec
>                           5.00-6.00   sec  17.4 MBytes   146 Mbits/sec
>                           6.00-7.00   sec  67.6 MBytes   567 Mbits/sec
>                           7.00-8.00   sec  92.4 MBytes   775 Mbits/sec
>                           8.00-9.00   sec  79.7 MBytes   669 Mbits/sec
>                           9.00-10.00  sec  73.3 MBytes   615 Mbits/sec
>   - - - - - - - - - - - - - - - - - - - - - - - -
>       Interval           Transfer     Bandwidth
>         0.00-10.00  sec   785 MBytes   658 Mbits/sec                  sender
>         0.00-10.00  sec   784 MBytes   658 Mbits/sec                  receiver
> 
> Other observations:
> Multiple parallel TCP transfers (-P 6) make no difference, aggregate transfer 
> rate is identical
> Initial real-world testing copying 6GB file to SmartOS confirmed transfer 
> rate at about 1Gbit/s (also observed through switch traffic monitoring at no 
> more than 1Gbit/s)
> UDP tests with iperf3 seem highly problematic (VLAN or not) - but not sure if 
> this is iperf3 issue or not (I suspect iperf3 given that TCP bandwidth tests 
> seem fine):
> Packets received out of order on SmartOS
> Extreme packet loss at anything >5M (yes 5mbit) bandwidth, can go up to 99%
> iperf3 in a zone vs gz makes no difference
> Eliminating the switch (Mikrotik CRS326-24G-2S+) and running direct fiber 
> between the two machines makes no difference.
> MTU is 9000 (switch max MTU is 9500).  Other endpoint is 9000.
> 
> Relevant system configuration:
> Intel S2600CO motherboard, 2 x Xeon E5-2670 CPUs
> Intel 10GbE dual-SFP+ NIC (I have another HP 10GbE card that I was originally 
> using that exhibited the same initial performance issues - but its max MTU in 
> SmartOS is 1500 - hence switched to the Intel card)
> Original PI 20170928T144204Z.  Upgraded to PI 20171109T032417 during testing 
> with no improvement
> 


-------------------------------------------
smartos-discuss
Archives: https://www.listbox.com/member/archive/184463/=now
RSS Feed: https://www.listbox.com/member/archive/rss/184463/25769125-55cfbc00
Modify Your Subscription: 
https://www.listbox.com/member/?member_id=25769125&id_secret=25769125-7688e9fb
Powered by Listbox: http://www.listbox.com

Reply via email to