[Bug 221919] ixl: TX queue hang when using TSO and having a high and mixed network load

2020-01-14 Thread bugzilla-noreply
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=221919

Kubilay Kocak  changed:

   What|Removed |Added

URL||https://reviews.freebsd.org
   ||/D14985
 Resolution|--- |FIXED
 Status|New |Closed
  Flags||mfc-stable11+

--- Comment #26 from Kubilay Kocak  ---
^Triage: 

 - Close (appears resolved)
 - Track MFC
- head was 12.x in base r333149
- MFC'd to stable/11 in base r43

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
freebsd-net@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


[Bug 221919] ixl: TX queue hang when using TSO and having a high and mixed network load

2018-12-18 Thread bugzilla-noreply
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=221919

--- Comment #25 from Peter Eriksson  ---
> Do you have any scripts that reconfigure the interface (ifconfig ixl0 down 
> up/mtu/tso) while it receives/transmits traffic?

Hmm... Aha! Bingo!

When testing on the production servers (that always receive SMB/NFS/SSH
traffic) I just did an "ifconfig lagg0 tso" to enable it and then started my
iperf3 testing (and some off and on to get test data).

I can now reliably reproduce this if I start an "iperf3" test session between
two servers and while it is running disable / enable tso "on the fly".

I can now provoke the "hang" on:
FreeBSD 11.2-RELEASE-p6 with ixl driver 1.9.9-k and firmware 5.60
FreeBSD 11.2-RELEASE-p6 with ixl driver 1.10.4 and firmware 6.80

I've not (so far) been able to provoke it to occur on:
FreeBSD 12.0-RELEASE-p0 with ixl driver 2.1.0-k and firmware 6.80

Sometimes it self-heals after a while, but most often I have to do an "ifconfig
lagg0 down ; ifconfig lagg0 up" to get it to recover.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
freebsd-net@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


[Bug 221919] ixl: TX queue hang when using TSO and having a high and mixed network load

2018-12-17 Thread bugzilla-noreply
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=221919

Aleksandr Ivanov  changed:

   What|Removed |Added

 CC||haro...@gmail.com

--- Comment #24 from Aleksandr Ivanov  ---
(In reply to Peter Eriksson from comment #22)

Do you have any scripts that reconfigure the interface (ifconfig ixl0 down
up/mtu/tso) while it receives/transmits traffic?

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
freebsd-net@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


[Bug 221919] ixl: TX queue hang when using TSO and having a high and mixed network load

2018-12-17 Thread bugzilla-noreply
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=221919

--- Comment #23 from Eric Joyner  ---
(In reply to Peter Eriksson from comment #22)

Your issue doesn't look like the original bug's because your logs don't mention
MDD events.

Maybe there's some interaction with the driver and lagg?

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
freebsd-net@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


[Bug 221919] ixl: TX queue hang when using TSO and having a high and mixed network load

2018-12-16 Thread bugzilla-noreply
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=221919

--- Comment #22 from Peter Eriksson  ---
I'm running 1.10.4 on one of our test (running 11.2-RELEASE-p5) servers. And
also  2.1.0-k on FreeBSD 12.0 on another. Both works fine. But then again, so
does the same version (1.9.9-k) that fails on the production servers... I
hesitate a bit on doing experimentation on the production systems.

I've been trying to provoke the problem (without having to set up Samba servers
and a couple of hundred Windows clients to connect with and simulate the users)
to happen on the test servers a bit today but so far no real luck... Sigh.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
freebsd-net@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


[Bug 221919] ixl: TX queue hang when using TSO and having a high and mixed network load

2018-12-16 Thread bugzilla-noreply
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=221919

ix...@riseup.net changed:

   What|Removed |Added

 CC||ix...@riseup.net

--- Comment #21 from ix...@riseup.net ---
Hi,
have you tried the recently released version 1.10.4?
(https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=233531)

and FreeBSD 12.0 comes with ixl version 2.1.0-k

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
freebsd-net@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


[Bug 221919] ixl: TX queue hang when using TSO and having a high and mixed network load

2018-12-15 Thread bugzilla-noreply
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=221919

--- Comment #20 from Peter Eriksson  ---
Just a quick note that we're still seeing the same problem on our production
servers if we enable "tso" on the 10G interfaces. FreeBSD 11.2-RELEASE-p6.
Haven't been able to reproduce it on the test servers (identical hardware)
running 11.2-RELEASE-p5 (and 12-0-RELEASE) so far though (but they don't see
any traffic)...

Driver version:
> dev.ixl.0.%desc: Intel(R) Ethernet Connection 700 Series PF Driver, Version - 
> 1.9.9-k

Firmware:
> dev.ixl.0.fw_version: fw 6.80.48812 api 1.7 nvm 6.00 etid 80003751 oem 
> 18.4608.17

Watch Events in the output from sysctl -a
> dev.ixl.0.watchdog_events: 4

Dmesg errors:
> ixl0: WARNING: queue 3 appears to be hung!
> ixl0: WARNING: queue 2 appears to be hung!
> ixl2: WARNING: queue 2 appears to be hung!
> ixl2: WARNING: queue 4 appears to be hung!
> ixl2: WARNING: queue 7 appears to be hung!
> ixl2: WARNING: queue 3 appears to be hung!
> ixl0: WARNING: queue 7 appears to be hung!
> ixl2: WARNING: queue 3 appears to be hung!
> ixl0: WARNING: queue 4 appears to be hung!

(Output from ifconfig with TSO disabled)
> # ifconfig lagg0
> lagg0: flags=8843 metric 0 mtu 1500
>   > 
> options=6404bb
>   ether 3c:fd:fe:25:47:a0
>   inet6 fe80::3efd:feff:fe25:47a0%lagg0 prefixlen 64 scopeid 0xa
>   inet6 2001:6b0:17:2400::8:43 prefixlen 64
>   inet 130.236.8.43 netmask 0xffe0 broadcast 130.236.8.63
>   nd6 options=21
>   media: Ethernet autoselect
>   status: active
>   groups: lagg
>   laggproto lacp lagghash l2,l3,l4
>   laggport: ixl0 flags=1c
>   laggport: ixl2 flags=1c

iperf3 output with TSO disabled:
> # iperf3 -c filur00 -t4
> Connecting to host filur00, port 5201
> [  5] local 2001:6b0:17:2400::8:43 port 51226 connected to 
> 2001:6b0:17:2400::8:40 port 5201
> [ ID] Interval   Transfer Bitrate Retr  Cwnd
> [  5]   0.00-1.00   sec   318 MBytes  2.66 Gbits/sec0561 KBytes
> [  5]   1.00-2.00   sec   350 MBytes  2.94 Gbits/sec0   1.11 MBytes
> [  5]   2.00-3.00   sec   392 MBytes  3.28 Gbits/sec0   1.67 MBytes
> [  5]   3.00-4.00   sec   351 MBytes  2.94 Gbits/sec0   1.77 MBytes
> - - - - - - - - - - - - - - - - - - - - - - - - -
> [ ID] Interval   Transfer Bitrate Retr
> [  5]   0.00-4.00   sec  1.38 GBytes  2.95 Gbits/sec0 sender
> [  5]   0.00-4.00   sec  1.38 GBytes  2.95 Gbits/sec  receiver
> 
> iperf Done.


With TSO enabled (when things work):

> # ifconfig lagg0 tso ; iperf3 -c filur00 -t4
> Connecting to host filur00, port 5201
> [  5] local 2001:6b0:17:2400::8:43 port 51237 connected to 
> 2001:6b0:17:2400::8:40 port 5201
> [ ID] Interval   Transfer Bitrate Retr  Cwnd
> [  5]   0.00-1.00   sec   976 MBytes  8.19 Gbits/sec0492 KBytes
> [  5]   1.00-2.00   sec  1.08 GBytes  9.29 Gbits/sec0   1021 KBytes
> [  5]   2.00-3.00   sec  1.08 GBytes  9.29 Gbits/sec0   1.50 MBytes
> [  5]   3.00-4.00   sec  1.08 GBytes  9.28 Gbits/sec0   1.75 MBytes
> - - - - - - - - - - - - - - - - - - - - - - - - -
> [ ID] Interval   Transfer Bitrate Retr
> [  5]   0.00-4.00   sec  4.20 GBytes  9.01 Gbits/sec0 sender
> [  5]   0.00-4.00   sec  4.19 GBytes  9.01 Gbits/sec  receiver
> 
> iperf Done.

But often queues get stuck and freezes. Hmm.. I just noticed that it was IPv6
that stopped working when I tried to enable it on a production server and ran
iperf3 on it - IPv4 traffic was still passing thru. 

Can it be that there still are IPv6 (TSO6)-related bugs and that the IPv4 ones
are solved? Too bad I can't find a way to force it to happen on the test
servers...

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
freebsd-net@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


[Bug 221919] ixl: TX queue hang when using TSO and having a high and mixed network load

2018-05-01 Thread bugzilla-noreply
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=221919

--- Comment #19 from commit-h...@freebsd.org ---
A commit references this bug:

Author: erj
Date: Tue May  1 18:50:13 UTC 2018
New revision: 333149
URL: https://svnweb.freebsd.org/changeset/base/333149

Log:
  ixl(4): Update to 1.9.9-k

  Refresh upstream driver before impending conversion to iflib.

  Major changes:

  - Support for descriptor writeback mode (required by ixlv(4) for AVF support)
  - Ability to disable firmware LLDP agent by user (PR 221530)
  - Fix for TX queue hang when using TSO (PR 221919)
  - Separate descriptor ring sizes for TX and RX rings

  PR:   221530, 221919
  Submitted by: Krzysztof Galazka 
  Reviewed by:  #IntelNetworking
  MFC after:1 day
  Relnotes: Yes
  Sponsored by: Intel Corporation
  Differential Revision:https://reviews.freebsd.org/D14985

Changes:
  head/sys/conf/files.amd64
  head/sys/dev/ixl/i40e_adminq.c
  head/sys/dev/ixl/i40e_adminq.h
  head/sys/dev/ixl/i40e_adminq_cmd.h
  head/sys/dev/ixl/i40e_alloc.h
  head/sys/dev/ixl/i40e_common.c
  head/sys/dev/ixl/i40e_dcb.c
  head/sys/dev/ixl/i40e_dcb.h
  head/sys/dev/ixl/i40e_devids.h
  head/sys/dev/ixl/i40e_hmc.c
  head/sys/dev/ixl/i40e_hmc.h
  head/sys/dev/ixl/i40e_lan_hmc.c
  head/sys/dev/ixl/i40e_lan_hmc.h
  head/sys/dev/ixl/i40e_nvm.c
  head/sys/dev/ixl/i40e_osdep.c
  head/sys/dev/ixl/i40e_osdep.h
  head/sys/dev/ixl/i40e_prototype.h
  head/sys/dev/ixl/i40e_register.h
  head/sys/dev/ixl/i40e_status.h
  head/sys/dev/ixl/i40e_type.h
  head/sys/dev/ixl/i40e_virtchnl.h
  head/sys/dev/ixl/if_ixl.c
  head/sys/dev/ixl/if_ixlv.c
  head/sys/dev/ixl/ixl.h
  head/sys/dev/ixl/ixl_iw.c
  head/sys/dev/ixl/ixl_iw.h
  head/sys/dev/ixl/ixl_iw_int.h
  head/sys/dev/ixl/ixl_pf.h
  head/sys/dev/ixl/ixl_pf_i2c.c
  head/sys/dev/ixl/ixl_pf_iov.c
  head/sys/dev/ixl/ixl_pf_iov.h
  head/sys/dev/ixl/ixl_pf_main.c
  head/sys/dev/ixl/ixl_pf_qmgr.c
  head/sys/dev/ixl/ixl_pf_qmgr.h
  head/sys/dev/ixl/ixl_txrx.c
  head/sys/dev/ixl/ixlv.h
  head/sys/dev/ixl/ixlv_vc_mgr.h
  head/sys/dev/ixl/ixlvc.c
  head/sys/dev/ixl/virtchnl.h
  head/sys/modules/ixl/Makefile

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
freebsd-net@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


[Bug 221919] ixl: TX queue hang when using TSO and having a high and mixed network load

2018-04-17 Thread bugzilla-noreply
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=221919

Steven Hartland  changed:

   What|Removed |Added

 CC||s...@freebsd.org

--- Comment #18 from Steven Hartland  ---
That review is still pending.

We've just had what appears to be a RX hang with TSO disabled, related?

No messages in /var/log/messages tcpdump still showing outbound traffic but no
inbound, had to reboot to recover.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
freebsd-net@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


[Bug 221919] ixl: TX queue hang when using TSO and having a high and mixed network load

2018-04-06 Thread bugzilla-noreply
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=221919

Krzysztof Galazka  changed:

   What|Removed |Added

 CC||krzysztof.gala...@intel.com

--- Comment #17 from Krzysztof Galazka  ---
The Phabricator review which should fix this issue:
https://reviews.freebsd.org/D14985

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
freebsd-net@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


[Bug 221919] ixl: TX queue hang when using TSO and having a high and mixed network load

2018-03-29 Thread bugzilla-noreply
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=221919

--- Comment #16 from Jeff Pieper  ---
We will have a patch ready for Phabricator soon. It should be committed before
code freeze.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
freebsd-net@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


[Bug 221919] ixl: TX queue hang when using TSO and having a high and mixed network load

2018-03-29 Thread bugzilla-noreply
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=221919

Jeff Pieper  changed:

   What|Removed |Added

 CC||jeffrey.e.pie...@intel.com

--- Comment #15 from Jeff Pieper  ---
We will have a patch ready for Phabricator soon. It should be committed before
code freeze.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
freebsd-net@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


[Bug 221919] ixl: TX queue hang when using TSO and having a high and mixed network load

2018-03-28 Thread bugzilla-noreply
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=221919

--- Comment #14 from Jason Tubnor  ---
Has the Intel driver been upstreamed yet to make the 11.2-RELEASE? re@ have
just sent a reminder of the release schedule.  If the updated vendor driver
works for others here that run their own build service, can it be merged in
time for 11.2 for those that follow supported updates?

Thanks.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
freebsd-net@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


[Bug 221919] ixl: TX queue hang when using TSO and having a high and mixed network load

2018-02-14 Thread bugzilla-noreply
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=221919

Jason Tubnor  changed:

   What|Removed |Added

 CC||ja...@tubnor.net

--- Comment #13 from Jason Tubnor  ---
I am also seeing this on our Lenovo SR650 7x06 servers.  We too are using 10GbE
XL710 cards:

Intel(R) Ethernet Controller X710 for 10GbE SFP+

# pciconf -l | grep ixl
ixl0@pci0:10:0:0:   class=0x02 card=0x402117aa chip=0x37d18086 rev=0x09
hdr=0x00
ixl1@pci0:10:0:1:   class=0x02 card=0x402117aa chip=0x37d18086 rev=0x09
hdr=0x00
ixl2@pci0:10:0:2:   class=0x02 card=0x402117aa chip=0x37d18086 rev=0x09
hdr=0x00
ixl3@pci0:10:0:3:   class=0x02 card=0x402117aa chip=0x37d18086 rev=0x09
hdr=0x00
ixl4@pci0:174:0:0:  class=0x02 card=0x000a8086 chip=0x15728086 rev=0x01
hdr=0x00
ixl5@pci0:174:0:1:  class=0x02 card=0x8086 chip=0x15728086 rev=0x01
hdr=0x00

snip from /var/log/messages:

Feb 15 09:50:53 server01 kernel: ixl5: Malicious Driver Detection event 2 on TX
queue 769, pf number 1
Feb 15 09:50:53 server01 kernel: ixl5: MDD TX event is for this function!
Feb 15 09:50:54 server01 kernel: ixl5: WARNING: queue 0 appears to be hung!
Feb 15 09:50:54 server01 kernel: ixl5: WARNING: Resetting!
Feb 15 09:50:57 server01 kernel: WARNING: 192.168.1.14
(iqn.1998-01.com.vmware:HOST-): no ping reply (NOP-Out) after 5
seconds; dropping connection
Feb 15 09:51:25 server01 kernel: ixl5: Malicious Driver Detection event 2 on TX
queue 775, pf number 1
Feb 15 09:51:25 server01 kernel: ixl5: MDD TX event is for this function!
Feb 15 09:51:29 server01 kernel: WARNING: 192.168.1.14
(iqn.1998-01.com.vmware:HOST-): no ping reply (NOP-Out) after 5
seconds; dropping connection
Feb 15 09:51:53 server01 kernel: ixl5: WARNING: queue 7 appears to be hung!
Feb 15 09:51:53 server01 kernel: ixl5: WARNING: Resetting!
Feb 15 09:51:55 server01 kernel: ixl5: Malicious Driver Detection event 2 on TX
queue 768, pf number 1
Feb 15 09:51:55 server01 kernel: ixl5: MDD TX event is for this function!

This is easily able to be reproduced when hooking 10GbE VMWare ESXi hosts up to
these storage servers via iSCSI.  We could trigger it by performing a vMotion
move from one datastore to another.

I do not have a test server that I can test any patches on as 3 of these exist
in production running 11.1-RELEASE and cannot afford to have them off-line or
deviate away from the standard supported freebsd-update mechanism.

I hope something can be worked out pretty soon and rolled into update as this
issue for us can't wait for 11.2 or 12.

I will be trying out -tso, but was trying to avoid that for performance
reasons.

Thanks!

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
freebsd-net@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


[Bug 221919] ixl: TX queue hang when using TSO and having a high and mixed network load

2018-02-12 Thread bugzilla-noreply
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=221919

Ed Schouten  changed:

   What|Removed |Added

 CC||e...@freebsd.org

--- Comment #12 from Ed Schouten  ---
Hi Eric,

We're also experiencing these issues on a SuperMicro system having these NICs:

# pciconf -l | grep ixl
ixl0@pci0:2:0:0:class=0x02 card=0x089e15d9 chip=0x15728086 rev=0x02
hdr=0x00
ixl1@pci0:2:0:1:class=0x02 card=0x15d9 chip=0x15728086 rev=0x02
hdr=0x00

I'll integrate the ixl-1.9.5 driver into our own codebase to work around this
issue, but it would be nice if this driver got upstreamed instead. Are there
any concrete plans for doing this?

Thanks,
Ed

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
freebsd-net@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


[Bug 221919] ixl: TX queue hang when using TSO and having a high and mixed network load

2018-01-04 Thread bugzilla-noreply
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=221919

--- Comment #11 from Eric Joyner  ---
This should be fixed in ixl-1.9.5.

We're working on getting that upstreamed, but in the meantime, you can download
it from the Intel download center.

https://downloadcenter.intel.com/download/25160/Intel-Network-Adapter-Driver-for-PCIe-40-Gigabit-Ethernet-Network-Connection-under-FreeBSD-?product=36773

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
freebsd-net@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


[Bug 221919] ixl: TX queue hang when using TSO and having a high and mixed network load

2017-12-28 Thread bugzilla-noreply
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=221919

--- Comment #10 from Garrett Wollman  ---
(In reply to Ryan Stone from comment #7)
Doesn't seem to have made any difference.  (Had to wait for the post-Xmas
outage window now that this server is in production.)

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
freebsd-net@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


[Bug 221919] ixl: TX queue hang when using TSO and having a high and mixed network load

2017-12-28 Thread bugzilla-noreply
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=221919

Wallace  changed:

   What|Removed |Added

 CC||in...@incin.me

--- Comment #9 from Wallace  ---
We are having the same issue on a new Supermicro server purchsed this month.

ixl0@pci0:26:0:0:   class=0x02 card=0x37d215d9 chip=0x37d28086 rev=0x09
hdr=0x00
vendor = 'Intel Corporation'
device = 'Ethernet Connection X722 for 10GBASE-T'
class  = network
subclass   = ethernet

Dec 27 10:23:03 hostname kernel: ixl0: WARNING: queue 1 appears to be hung!
Dec 27 10:23:03 hostname kernel: ixl0: WARNING: Resetting!
Dec 27 10:23:10 hostname kernel: ixl0: Malicious Driver Detection event 14 on
TX queue 1, pf number 0
Dec 27 10:23:10 hostname kernel: ixl0: MDD TX event is for this function!

After playing with lro and tso things seemed to be better. No more errors in
the log files and NFS shares seemed more stable. Over the past week it seemed
it didn't matter if there was light or heavy traffic. 


Errors: 

ixl0: flags=8843 metric 0 mtu 1500
   
options=6407bb
ether ac:1f:6b:61:a3:80
hwaddr ac:1f:6b:61:a3:80
inet x.x.x.x netmask 0xf800 broadcast x.x.x.x
inet x.x.x.x netmask 0xf800 broadcast x.x.x.x 
nd6 options=29
media: Ethernet autoselect (10Gbase-T )
status: active

No Errors:

[root@backup0 ~]# ifconfig ixl0 -lro -tso
[root@backup0 ~]# ifconfig ixl0
ixl0: flags=8843 metric 0 mtu 1500
   
options=6400bb
ether ac:1f:6b:61:a3:80
hwaddr ac:1f:6b:61:a3:80
inet x.x.x.x netmask 0xf800 broadcast x.x.x.x 
inet x.x.x.x netmask 0xf800 broadcast x.x.x.x
nd6 options=29
media: Ethernet

[root@hostname /var/log]# freebsd-version -k
11.1-RELEASE-p4
[root@hostname /var/log]# freebsd-version -u
11.1-RELEASE-p6

I can provide more info if anyone needs it or help debug the issue more.

Thanks!

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
freebsd-net@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


[Bug 221919] ixl: TX queue hang when using TSO and having a high and mixed network load

2017-12-28 Thread bugzilla-noreply
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=221919

KurtC  changed:

   What|Removed |Added

 CC||k...@thinkcsc.com

--- Comment #8 from KurtC  ---
I am running into this exact Malicious Driver Detection event under high load
on a X710-DA2 running driver 1.7.12.  Disabling TSO does not fix the problem
for me.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
freebsd-net@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


[Bug 221919] ixl: TX queue hang when using TSO and having a high and mixed network load

2017-12-12 Thread bugzilla-noreply
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=221919

--- Comment #7 from Ryan Stone  ---
Sorry, there was a mistake in the patch.  I think that something got lost in
translation when I ported it forward.  I've regenerated the patch at the same
location, or you can replace this line in ixl_pf_main.c:

ifp->if_hw_tsomaxsegpermss = IXL_MAX_TX_SEGS;

with

ifp->if_hw_tsomaxsegpermss = IXL_SPARSE_CHAIN;


Sorry for the confusion.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
freebsd-net@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


[Bug 221919] ixl: TX queue hang when using TSO and having a high and mixed network load

2017-12-12 Thread bugzilla-noreply
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=221919

Garrett Wollman  changed:

   What|Removed |Added

 CC||woll...@freebsd.org

--- Comment #6 from Garrett Wollman  ---
Applied the patch from #c3 to my 11.1 source tree and found that it did not
improve matters.  It would be better if this "feature" could simply be
disabled, as the Linux drivers (apparently?) allow.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
freebsd-net@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


[Bug 221919] ixl: TX queue hang when using TSO and having a high and mixed network load

2017-12-07 Thread bugzilla-noreply
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=221919

--- Comment #5 from Peter Eriksson  ---
I haven't had time to test the patch yet (started on it but got side-tracked
with other bugs), but I'll make another attempt. Might not happen until this
weekend or early next week though.

One problem is that the issue take some time to pop up. I tried creating a test
setup that would force it on our test system but so far it has only shown
itself on our production systems :-(

It would be nice to be able to trigger the bug on the non-production system :-)

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
freebsd-net@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


[Bug 221919] ixl: TX queue hang when using TSO and having a high and mixed network load

2017-12-07 Thread bugzilla-noreply
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=221919

Eugene Grosbein  changed:

   What|Removed |Added

 CC||eu...@freebsd.org,
   ||freebsd-net@FreeBSD.org
   Assignee|freebsd-net@FreeBSD.org |e...@freebsd.org

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
freebsd-net@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


[Bug 221919] ixl: TX queue hang when using TSO and having a high and mixed network load

2017-12-07 Thread bugzilla-noreply
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=221919

--- Comment #4 from Eric Joyner  ---
We did find another bug in the function to detect packets that violate the HW
restriction on how many buffers each segment in a TSO can span (and the fix
will be in the next update to the driver in 12), but Ryan's patch should ensure
packets like those don't reach the driver.

Could you report if it works for you guys, Nikita and Peter?

-- 
You are receiving this mail because:
You are the assignee for the bug.
___
freebsd-net@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


[Bug 221919] ixl: TX queue hang when using TSO and having a high and mixed network load

2017-10-31 Thread bugzilla-noreply
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=221919

Ryan Stone  changed:

   What|Removed |Added

 CC||rst...@freebsd.org

--- Comment #3 from Ryan Stone  ---
How reproducible is the hang?  Could you please try this patch and confirm
whether it fixes your issue?

https://people.freebsd.org/~rstone/patches/ixl_tsosegpermss.diff

-- 
You are receiving this mail because:
You are the assignee for the bug.
___
freebsd-net@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


[Bug 221919] ixl: TX queue hang when using TSO and having a high and mixed network load

2017-10-31 Thread bugzilla-noreply
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=221919

Peter Eriksson  changed:

   What|Removed |Added

 CC||pe...@ifm.liu.se

--- Comment #2 from Peter Eriksson  ---
This is a really annoying bug that we've also seen. I do not think it's related
to iSCSI though (since we aren't using it). Disabling TSO seems to help (but
also severly reduces transmission speed - in our case it drops from around
10Gbps to 3Gbps without TSO).

I our servers are SMB (and NFS, but not much yet) servers. Dell PowerEdge
730xd.

> FreeBSD 11.1
> ixl2: fw 5.40.47690 api 1.5 nvm 5.40 etid 80002d35 oem 18.4608.16

ixl0: Link is up, 10 Gbps Full Duplex, FEC: None, Autoneg: False, Flow Control:
Full
ixl0: link state changed to UP
ixl2: Link is up, 10 Gbps Full Duplex, FEC: None, Autoneg: False, Flow Control:
Full
ixl2: link state changed to UP

ixl0:  mem
0xc900-0xc9ff,0xca008000-0xca00 at device 0.0 numa-domain 1 on
pci15
ixl0: Using MSIX interrupts with 9 vectors
ixl0: fw 5.40.47690 api 1.5 nvm 5.40 etid 80002d35 oem 18.4608.16
ixl0: PF-ID[0]: VFs 64, MSIX 129, VF MSIX 5, QPs 768, I2C
ixl0: Allocating 8 queues for PF LAN VSI; 8 queues active
ixl0: Ethernet address: 3c:fd:fe:24:e7:e0
ixl0: PCI Express Bus: Speed 8.0GT/s Width x8
ixl0: Failed to initialize SR-IOV (error=2)
ixl0: netmap queues/slots: TX 8/1024, RX 8/1024

ixl2:  mem
0xcc00-0xccff,0xcd008000-0xcd00 at device 0.0 numa-domain 1 on
pci16
ixl2: Using MSIX interrupts with 9 vectors
ixl2: fw 5.40.47690 api 1.5 nvm 5.40 etid 80002d35 oem 18.4608.16
ixl2: PF-ID[0]: VFs 64, MSIX 129, VF MSIX 5, QPs 768, I2C
ixl2: Allocating 8 queues for PF LAN VSI; 8 queues active
ixl2: Ethernet address: 3c:fd:fe:24:d6:a0
ixl2: PCI Express Bus: Speed 8.0GT/s Width x8
ixl2: Failed to initialize SR-IOV (error=2)
ixl2: netmap queues/slots: TX 8/1024, RX 8/1024

ixl0: Link is up, 10 Gbps Full Duplex, FEC: None, Autoneg: False, Flow Control:
Full
ixl0: link state changed to UP
ixl2: Link is up, 10 Gbps Full Duplex, FEC: None, Autoneg: False, Flow Control:
Full
ixl2: link state changed to UP
ixl2: link state changed to DOWN
ixl0: link state changed to DOWN
ixl0: Link is up, 10 Gbps Full Duplex, FEC: None, Autoneg: False, Flow Control:
Full
ixl0: link state changed to UP
ixl2: Link is up, 10 Gbps Full Duplex, FEC: None, Autoneg: False, Flow Control:
Full
ixl2: link state changed to UP
ixl2: Malicious Driver Detection event 2 on TX queue 0, pf number 0
ixl2: MDD TX event is for this function!ixl2: Interface stopped DISTRIBUTING,
possible flapping
ixl2: Interface stopped DISTRIBUTING, possible flapping
ixl2: Interface stopped DISTRIBUTING, possible flapping
...repeat...
ixl2: WARNING: queue 0 appears to be hung!
ixl2: WARNING: Resetting!

I managed to login to the server after a while and disable TSO and then things
started working again.


Would using the Intel-provided (instead of the 11.1 one) driver and firmware
(from their web site) help with this issue?

-- 
You are receiving this mail because:
You are the assignee for the bug.
___
freebsd-net@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


[Bug 221919] ixl: TX queue hang when using TSO and having a high and mixed network load

2017-08-30 Thread bugzilla-noreply
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=221919

Eric Joyner  changed:

   What|Removed |Added

 CC||e...@freebsd.org

--- Comment #1 from Eric Joyner  ---
Try removing the ixl_init_locked() in ixl_local_timer(), right after it prints
the "WARNING: Resetting!" message -- the queues might actually not be hung and
don't need to be reinitialized.

-- 
You are receiving this mail because:
You are the assignee for the bug.
___
freebsd-net@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


[Bug 221919] ixl: TX queue hang when using TSO and having a high and mixed network load

2017-08-29 Thread bugzilla-noreply
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=221919

Mark Linimon  changed:

   What|Removed |Added

 CC||sbr...@freebsd.org
   Assignee|freebsd-b...@freebsd.org|freebsd-net@FreeBSD.org
   Keywords||IntelNetworking

-- 
You are receiving this mail because:
You are the assignee for the bug.
___
freebsd-net@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"