Dear developers,
We have a X8DTU Supermicro system with onboard lan gigabit interfaces, we
run debian stretch on it, we use it for virtualisation with xen.
Unfortunately, we periodically end up with igb tx timeouts on the host,
the driver starts resetting the interface without success, the only
solution is to reset the host.
Xen hypervisor version is:
# xl info
host : x
release : 4.9.0-6-amd64
version : #1 SMP Debian 4.9.88-1+deb9u1 (2018-05-07)
machine : x86_64
nr_cpus : 8
max_cpu_id : 23
nr_nodes : 2
cores_per_socket : 4
threads_per_core : 1
cpu_mhz : 3066
hw_caps :
b7ebfbff:029ee3ff:2c100800:00000001:00000000:00000000:00000000:00000100
5~virt_caps : hvm hvm_directio
total_memory : 196599
free_memory : 125241
sharing_freed_memory : 0
sharing_used_memory : 0
outstanding_claims : 0
free_cpus : 0
xen_major : 4
xen_minor : 8
xen_extra : .4-pre
xen_version : 4.8.4-pre
xen_caps : xen-3.0-x86_64 xen-3.0-x86_32p hvm-3.0-x86_32
hvm-3.0-x86_32p hvm-3.0-x86_64
xen_scheduler : credit
xen_pagesize : 4096
platform_params : virt_start=0xffff800000000000
xen_changeset :
xen_commandline : placeholder dom0_mem=4096M gnttab_max_frames=256
msi=off
cc_compiler : gcc (Debian 6.3.0-18+deb9u1) 6.3.0 20170516
cc_compile_by : ijackson
cc_compile_domain : chiark.greenend.org.uk
cc_compile_date : Tue May 22 17:44:34 UTC 2018
build_id : 2989c900c3cae88daf7e27570a3bc2239d7f7b76
xend_config_format : 4
Linux kernel version:
# uname -a Linux x 4.9.0-6-amd64 #1 SMP Debian 4.9.88-1+deb9u1
(2018-05-07) x86_64 GNU/Linux
Intel igb driver version:
# modinfo igb
filename:
/lib/modules/4.9.0-6-amd64/kernel/drivers/net/ethernet/intel/igb/igb.ko
version: 5.4.0-k
license: GPL
description: Intel(R) Gigabit Ethernet Network Driver
author: Intel Corporation, <e1000-devel@lists.sourceforge.net>
srcversion: 9F40B4B7DB12A56E9933EB8
alias: pci:v00008086d000010D6sv*sd*bc*sc*i*
alias: pci:v00008086d000010A9sv*sd*bc*sc*i*
alias: pci:v00008086d000010A7sv*sd*bc*sc*i*
alias: pci:v00008086d000010E8sv*sd*bc*sc*i*
alias: pci:v00008086d00001526sv*sd*bc*sc*i*
alias: pci:v00008086d0000150Dsv*sd*bc*sc*i*
alias: pci:v00008086d000010E7sv*sd*bc*sc*i*
alias: pci:v00008086d000010E6sv*sd*bc*sc*i*
alias: pci:v00008086d00001518sv*sd*bc*sc*i*
alias: pci:v00008086d0000150Asv*sd*bc*sc*i*
alias: pci:v00008086d000010C9sv*sd*bc*sc*i*
alias: pci:v00008086d00000440sv*sd*bc*sc*i*
alias: pci:v00008086d0000043Csv*sd*bc*sc*i*
alias: pci:v00008086d0000043Asv*sd*bc*sc*i*
alias: pci:v00008086d00000438sv*sd*bc*sc*i*
alias: pci:v00008086d00001516sv*sd*bc*sc*i*
alias: pci:v00008086d00001511sv*sd*bc*sc*i*
alias: pci:v00008086d00001510sv*sd*bc*sc*i*
alias: pci:v00008086d00001527sv*sd*bc*sc*i*
alias: pci:v00008086d0000150Fsv*sd*bc*sc*i*
alias: pci:v00008086d0000150Esv*sd*bc*sc*i*
alias: pci:v00008086d00001524sv*sd*bc*sc*i*
alias: pci:v00008086d00001523sv*sd*bc*sc*i*
alias: pci:v00008086d00001522sv*sd*bc*sc*i*
alias: pci:v00008086d00001521sv*sd*bc*sc*i*
alias: pci:v00008086d0000157Csv*sd*bc*sc*i*
alias: pci:v00008086d0000157Bsv*sd*bc*sc*i*
alias: pci:v00008086d00001538sv*sd*bc*sc*i*
alias: pci:v00008086d00001537sv*sd*bc*sc*i*
alias: pci:v00008086d00001536sv*sd*bc*sc*i*
alias: pci:v00008086d00001533sv*sd*bc*sc*i*
alias: pci:v00008086d00001539sv*sd*bc*sc*i*
alias: pci:v00008086d00001F45sv*sd*bc*sc*i*
alias: pci:v00008086d00001F41sv*sd*bc*sc*i*
alias: pci:v00008086d00001F40sv*sd*bc*sc*i*
depends: ptp,dca,i2c-algo-bit
retpoline: Y
intree: Y
vermagic: 4.9.0-6-amd64 SMP mod_unload modversions
parm: max_vfs:Maximum number of virtual functions to allocate
per physical function (uint)
parm: debug:Debug level (0=none,...,16=all) (int)
Please find the kernel messages attached.
There are ethtool dumps, in normal operating modes and when the tx unit
hang. We have tried with pci=nomsi kernel parameter also, but did not
help.
Unfortunately, we cannot force the bug arise, it just happens after a few
days of uptime.
Please help me debug and solve the issue. What more information should I
collect?
Thanks in advance,
Kojedzinszky Richárd
Euronet Magyarorszag Informatika Zrt.
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
E1000-devel mailing list
E1000-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/e1000-devel
To learn more about Intel® Ethernet, visit
http://communities.intel.com/community/wired