Thank you Alex.

We need autoneg on out ports, but could do without flow control.  Is 
that possible to configure?   "autoneg on rx off" does not do it . It 
looks like autoneg enables/negotiates pause.

Best
John

On 10/13/2016 04:09 PM, Alexander Duyck wrote:
> On Wed, Oct 12, 2016 at 2:53 PM, John Osswald <john.ossw...@talari.com> wrote:
>> We have multiple systems (and system types)  experiencing a kernel panic,
>> when they upgrade to Debian 8. We tried the native Debian IGB driver, as
>> well as, IGB 5.3.5.3, with no difference in behavior.  The common element is
>> that they are attached to our Portwell  CAR-3000, running  Linux 2.6.26 and
>> e1000e: Intel(R) PRO/1000 Network Driver - 1.0.2.5-NAPI.
>> The CAR-3000 board from Portwell has one Intel Core 2 Duo E8400 CPU running
>> at 3.00 GHz, 4GB of memory, and an Intel  82574L controller.
>>
>> The setup is pretty simple, two nodes  (  CAR-3000 and the DUT), with one
>> cable between them, running iperf. The DUT connection is through ports
>> attached to an I354.  The PANICs have not been been observed, when using a
>> DUT's I210 ports.
>>
>>
>> During the test, the  CAR-3000 floods  pause frames back to the DUT.  The
>> CAR-3000 sees frequent unit hangs and, itself,  may PANIC.
>>
>> These issues are not seen, if autoneg is disabled on the CAR-3000 port
>>
>>
>> Oct  5 22:09:04 CAR-3000kernel: [25609.890202] e1000e: eth2 NIC Link is Down
>> Oct  5 22:09:04 CAR-3000kernel: [25609.894366] 0000:05:00.0: eth2: Detected
>> Tx Unit Hang: *(this is the port connected to the 7573)*
>> Oct  5 22:09:04 CAR-3000kernel: [25609.894367] TDH                  <48>
>> Oct  5 22:09:04 CAR-3000kernel: [25609.894368] TDT                  <92>
>> Oct  5 22:09:04 CAR-3000kernel: [25609.894368] next_to_use          <92>
>> Oct  5 22:09:04 CAR-3000kernel: [25609.894369] next_to_clean        <48>
>> Oct  5 22:09:04 CAR-3000kernel: [25609.894369] buffer_info[next_to_clean]:
>> Oct  5 22:09:04 CAR-3000kernel: [25609.894370] time_stamp
>> <10060025e>
>> Oct  5 22:09:04 CAR-3000kernel: [25609.894370] next_to_watch        <48>
>> Oct  5 22:09:04 CAR-3000kernel: [25609.894371] jiffies
>> <100600b8a>
>> Oct  5 22:09:04 CAR-3000kernel: [25609.894372] next_to_watch.status <0>
>> Oct  5 22:09:07 CAR-3000kernel: [25613.074687] e1000e: eth2 NIC Link is Up
>> 1000 Mbps Full Duplex, Flow Control: RX/TX
>>
>>
>> Coincidental with the above Unit Hangs on the  CAR-3000,  the DUT may  PANIC
>>
>> In this instance, The PANIC is occurred on a  Lannar 7573 (connection
>> through an I354), however it has been seen on other platforms from different
>> vendors.
>>
>>
>> [  110.986096] ------------[ cut here ]------------
>> [  110.990774] WARNING: CPU: 7 PID: 53 at net/sched/sch_generic.c:264
>> dev_watchdog+0xd9/0x13f()
>> [  110.999300] NETDEV WATCHDOG: tn-lbp0 (igb): transmit queue 0 timed out
>> [  111.005879] Modules linked in: dmi_sysfs ipv6 fuse coretemp kvm_intel kvm
>> crc32c_intel aesni_intel aes_x86_64 ablk_helper cryptd lrw gf128mul
>> glue_helper iTCO_wdt iTCO_vendor_support i2c_i801 microcode i2c_core lpc_ich
>> mfd_core ipmi_msghandler button ext3 mbcache jbd sd_mod crc_t10dif
>> crct10dif_generic crct10dif_common ahci ehci_pci libahci ehci_hcd igb(O)
>> libata dca ptp usbcore scsi_mod usb_common pps_core
>> [  111.043335] CPU: 7 PID: 53 Comm: ksoftirqd/7 Tainted: G           O
>> 3.16.7-e100v1 #1
>> [  111.051218] Hardware name: To be filled by O.E.M. To be filled by
>> O.E.M./To be filled by O.E.M., BIOS 5.6.5 06/30/2016
>> [  111.061966]  0000000000000006 ffffffff812fefff ffff88046ed03d20
>> 0000000000000009
>> [  111.069559]  ffffffff8103a0a9 ffffffff8128dec0 ffff88046e410000
>> ffff88046ed03d78
>> [  111.077122]  ffffffff8128dde7 ffff88046e410388 ffffffff8103a102
>> ffffffff8159d053
>> [  111.084686] Call Trace:
>> [  111.087147]  [<ffffffff812fefff>] ? dump_stack+0x46/0x59
>> [  111.092497]  [<ffffffff8103a0a9>] ? warn_slowpath_common+0x74/0x89
>> [  111.098731]  [<ffffffff8128dec0>] ? dev_watchdog+0xd9/0x13f
>> [  111.104334]  [<ffffffff8128dde7>] ? dev_deactivate_queue+0x53/0x53
>> [  111.110545]  [<ffffffff8103a102>] ? warn_slowpath_fmt+0x44/0x4d
>> [  111.116495]  [<ffffffff81302a8e>] ? _raw_spin_lock+0xe/0x17
>> [  111.122097]  [<ffffffff81302b05>] ? _raw_spin_unlock+0xe/0x20
>> [  111.127873]  [<ffffffff8128db84>] ? netif_tx_lock+0x6c/0x79
>> [  111.133491]  [<ffffffff8128dec0>] ? dev_watchdog+0xd9/0x13f
>> [  111.139098]  [<ffffffff81042ebb>] ? call_timer_fn+0x36/0x108
>> [  111.144787]  [<ffffffff8128dde7>] ? dev_deactivate_queue+0x53/0x53
>> [  111.150998]  [<ffffffff81043785>] ? run_timer_softirq+0x1dd/0x20a
>> [  111.157122]  [<ffffffff8103d8f9>] ? __do_softirq+0xf8/0x27d
>> [  111.162734]  [<ffffffff8103da93>] ? run_ksoftirqd+0x15/0x4d
>> [  111.168362]  [<ffffffff810582f5>] ? smpboot_thread_fn+0x207/0x226
>> [  111.174521]  [<ffffffff810580ee>] ? SyS_setgroups+0xe3/0xe3
>> [  111.180149]  [<ffffffff81053223>] ? kthread+0xa8/0xb0
>> [  111.185224]  [<ffffffff8105317b>] ? __kthread_parkme+0x5b/0x5b
>> [  111.191105]  [<ffffffff81303148>] ? ret_from_fork+0x58/0x90
>> [  111.196717]  [<ffffffff8105317b>] ? __kthread_parkme+0x5b/0x5b
>> [  111.202604] ---[ end trace 0242955cf46c53c7 ]---
>>
>>
>> Thank you
> The error messages you are seeing indicate that the Tx isn't moving.
> If the CAR-3000 is flooding pause frames this is somewhat expected
> behavior.  The warnings mean that the Tx path is full of packets and
> that the Tx queues are not draining.  By the way the "panic" you refer
> to is actually a netdev watchdog warning.  It has the same root cause.
> The stack and the driver think the Tx is hung because DMA is not
> moving so it is resetting the device.  If you are wanting to work
> around this you could disable flow control on the interface using
> "ethtool -A <iface> tx off rx off autoneg off" and the NIC will ignore
> the hangs on the switch.
>
> - Alex


------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most 
engaging tech sites, SlashDot.org! http://sdm.link/slashdot
_______________________________________________
E1000-devel mailing list
E1000-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/e1000-devel
To learn more about Intel&#174; Ethernet, visit 
http://communities.intel.com/community/wired

Reply via email to