We have an issue with X520-DA cards when they are connected to Arista 7124SX
switches, and only these switches. All our other Arista switches are fine. The
issue is that we have continual increasing lsc_int values via ethtool, with it
sometimes taking the link down, others not. It's not the switch ports or
cables, as, replacing with a non-Intel 10G card has no issues. We have had this
issue, with varying degrees of how often we get the lsc_int's, with dozens of
cards, some do it 100's of times a minute, others do it 100 times a day. We
have had this occur on Dell R610's, Dell 2950's, and Dell 1950's. We have 100's
of X520-DA's working perfectly on 3 other models of Arista switches (which have
a higher latency). Nothing on the switch shows any issues. We have seen this on
the native CentOS 5.4 and 6.2 ixgbe drivers, as well as a compiled 3.9.17
driver.
Any help with first figuring out what would cause it to keep needing to
reenable it's irq's would great, and hopefully might lead to root cause. Below
is some debug out from the driver as well as ethtool -S output.
Thanks,
Brian
We have compiled it with -DDBG and -DDEBUG to get more info, and this is what
we see in syslog:
Jun 22 10:54:26 hostname kernel: ixgbe_msix_other: Reg - 0x00800, value -
0x80100000
Jun 22 10:54:26 hostname kernel: ixgbe_check_lsc: Reg - 0x00888, value -
0x00100000
Jun 22 10:54:26 hostname kernel: ixgbe_irq_enable: Reg - 0x00880, value -
0xD6080000
Jun 22 10:54:26 hostname kernel: ixgbe 0000:0c:00.0: eth1: NIC Link is Down
Jun 22 10:54:26 hostname kernel: ixgbe_watchdog_update_link: Reg - 0x00880,
value - 0x00100000
Jun 22 10:54:26 hostname kernel: ixgbe_msix_other: Reg - 0x00800, value -
0x80100000
Jun 22 10:54:26 hostname kernel: ixgbe_check_lsc: Reg - 0x00888, value -
0x00100000
Jun 22 10:54:26 hostname kernel: ixgbe_irq_enable: Reg - 0x00880, value -
0xD6080000
Jun 22 10:54:26 hostname kernel: ixgbe_watchdog_update_link: Reg - 0x00880,
value - 0x00100000
...(add repeating of the previous 4 lines a hundred times or so)...
Jun 22 10:54:27 hostname kernel: ixgbe 0000:0c:00.0: eth1: NIC Link is Up 10
Gbps, Flow Control: RX/TX
Jun 22 10:54:27 hostname kernel: ixgbe_msix_other: Reg - 0x00800, value -
0x80100000
Jun 22 10:54:27 hostname kernel: ixgbe_check_lsc: Reg - 0x00888, value -
0x00100000
Jun 22 10:54:27 hostname kernel: ixgbe_irq_enable: Reg - 0x00880, value -
0xD6080000
Jun 22 10:54:28 hostname kernel: ixgbe_watchdog_update_link: Reg - 0x00880,
value - 0x00100000
Jun 22 10:54:28 hostname kernel: ixgbe_watchdog_update_link: Reg - 0x00880,
value - 0x00100000
Jun 22 10:54:28 hostname kernel: ixgbe_msix_other: Reg - 0x00800, value -
0x80100000
Ethtool -S output (box being up only 10 hours):
NIC statistics:
rx_packets: 128069
tx_packets: 6774870
rx_bytes: 12813602
tx_bytes: 731663757
rx_errors: 22535
tx_errors: 0
rx_dropped: 0
tx_dropped: 0
multicast: 0
collisions: 0
rx_over_errors: 0
rx_crc_errors: 21866
rx_frame_errors: 0
rx_fifo_errors: 0
rx_missed_errors: 0
tx_aborted_errors: 0
tx_carrier_errors: 0
tx_fifo_errors: 0
tx_heartbeat_errors: 0
rx_pkts_nic: 128122
tx_pkts_nic: 6774863
rx_bytes_nic: 13842955
tx_bytes_nic: 785896842
lsc_int: 1608249
tx_busy: 0
non_eop_descs: 1
broadcast: 322
rx_no_buffer_count: 0
tx_timeout_count: 0
tx_restart_queue: 0
rx_long_length_errors: 0
rx_short_length_errors: 0
tx_flow_control_xon: 0
rx_flow_control_xon: 0
tx_flow_control_xoff: 0
rx_flow_control_xoff: 0
rx_csum_offload_errors: 0
alloc_rx_page_failed: 0
alloc_rx_buff_failed: 0
lro_aggregated: 0
lro_flushed: 0
rx_no_dma_resources: 0
hw_rsc_aggregated: 546
hw_rsc_flushed: 536
os2bmc_rx_by_bmc: 0
os2bmc_tx_by_bmc: 0
os2bmc_tx_by_host: 0
os2bmc_rx_by_host: 0
tx_queue_0_packets: 6774870
tx_queue_0_bytes: 731663757
rx_queue_0_packets: 112597
rx_queue_0_bytes: 10211966
rx_queue_1_packets: 6041
rx_queue_1_bytes: 1333456
rx_queue_2_packets: 7719
rx_queue_2_bytes: 913256
rx_queue_3_packets: 1712
rx_queue_3_bytes: 354924
------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and
threat landscape has changed and how IT managers can respond. Discussions
will include endpoint security, mobile security and the latest in malware
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
E1000-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/e1000-devel
To learn more about Intel® Ethernet, visit
http://communities.intel.com/community/wired