We have an issue with X520-DA cards when they are connected to Arista 7124SX 
switches, and only these switches. All our other Arista switches are fine. The 
issue is that we have continual increasing lsc_int values via ethtool, with it 
sometimes taking the link down, others not. It's not the switch ports or 
cables, as, replacing with a non-Intel 10G card has no issues. We have had this 
issue, with varying degrees of how often we get the lsc_int's, with dozens of 
cards, some do it 100's of times a minute, others do it 100 times a day. We 
have had this occur on Dell R610's, Dell 2950's, and Dell 1950's. We have 100's 
of X520-DA's working perfectly on 3 other models of Arista switches (which have 
a higher latency). Nothing on the switch shows any issues. We have seen this on 
the native CentOS 5.4 and 6.2 ixgbe drivers, as well as a compiled 3.9.17 
driver.

Any help with first figuring out what would cause it to keep needing to 
reenable it's irq's would great, and hopefully might lead to root cause. Below 
is some debug out from the driver as well as ethtool -S output.

Thanks,
Brian


We have compiled it with -DDBG and -DDEBUG to get more info, and this is what 
we see in syslog:

Jun 22 10:54:26 hostname kernel: ixgbe_msix_other: Reg - 0x00800, value - 
0x80100000

Jun 22 10:54:26 hostname kernel: ixgbe_check_lsc: Reg - 0x00888, value - 
0x00100000

Jun 22 10:54:26 hostname kernel: ixgbe_irq_enable: Reg - 0x00880, value - 
0xD6080000

Jun 22 10:54:26 hostname kernel: ixgbe 0000:0c:00.0: eth1: NIC Link is Down

Jun 22 10:54:26 hostname kernel: ixgbe_watchdog_update_link: Reg - 0x00880, 
value - 0x00100000

Jun 22 10:54:26 hostname kernel: ixgbe_msix_other: Reg - 0x00800, value - 
0x80100000

Jun 22 10:54:26 hostname kernel: ixgbe_check_lsc: Reg - 0x00888, value - 
0x00100000

Jun 22 10:54:26 hostname kernel: ixgbe_irq_enable: Reg - 0x00880, value - 
0xD6080000

Jun 22 10:54:26 hostname kernel: ixgbe_watchdog_update_link: Reg - 0x00880, 
value - 0x00100000

...(add repeating of the previous 4 lines a hundred times or so)...

Jun 22 10:54:27 hostname kernel: ixgbe 0000:0c:00.0: eth1: NIC Link is Up 10 
Gbps, Flow Control: RX/TX

Jun 22 10:54:27 hostname kernel: ixgbe_msix_other: Reg - 0x00800, value - 
0x80100000

Jun 22 10:54:27 hostname kernel: ixgbe_check_lsc: Reg - 0x00888, value - 
0x00100000

Jun 22 10:54:27 hostname kernel: ixgbe_irq_enable: Reg - 0x00880, value - 
0xD6080000

Jun 22 10:54:28 hostname kernel: ixgbe_watchdog_update_link: Reg - 0x00880, 
value - 0x00100000

Jun 22 10:54:28 hostname kernel: ixgbe_watchdog_update_link: Reg - 0x00880, 
value - 0x00100000

Jun 22 10:54:28 hostname kernel: ixgbe_msix_other: Reg - 0x00800, value - 
0x80100000

Ethtool -S output (box being up only 10 hours):

NIC statistics:

     rx_packets: 128069

     tx_packets: 6774870

     rx_bytes: 12813602

     tx_bytes: 731663757

     rx_errors: 22535

     tx_errors: 0

     rx_dropped: 0

     tx_dropped: 0

     multicast: 0

     collisions: 0

     rx_over_errors: 0

     rx_crc_errors: 21866

     rx_frame_errors: 0

     rx_fifo_errors: 0

     rx_missed_errors: 0

     tx_aborted_errors: 0

     tx_carrier_errors: 0

     tx_fifo_errors: 0

     tx_heartbeat_errors: 0

     rx_pkts_nic: 128122

     tx_pkts_nic: 6774863

     rx_bytes_nic: 13842955

     tx_bytes_nic: 785896842

     lsc_int: 1608249

     tx_busy: 0

     non_eop_descs: 1

     broadcast: 322

     rx_no_buffer_count: 0

     tx_timeout_count: 0

     tx_restart_queue: 0

     rx_long_length_errors: 0

     rx_short_length_errors: 0

     tx_flow_control_xon: 0

     rx_flow_control_xon: 0

     tx_flow_control_xoff: 0

     rx_flow_control_xoff: 0

     rx_csum_offload_errors: 0

     alloc_rx_page_failed: 0

     alloc_rx_buff_failed: 0

     lro_aggregated: 0

     lro_flushed: 0

     rx_no_dma_resources: 0

     hw_rsc_aggregated: 546

     hw_rsc_flushed: 536

     os2bmc_rx_by_bmc: 0

     os2bmc_tx_by_bmc: 0

     os2bmc_tx_by_host: 0

     os2bmc_rx_by_host: 0

     tx_queue_0_packets: 6774870

     tx_queue_0_bytes: 731663757

     rx_queue_0_packets: 112597

     rx_queue_0_bytes: 10211966

     rx_queue_1_packets: 6041

     rx_queue_1_bytes: 1333456

     rx_queue_2_packets: 7719

     rx_queue_2_bytes: 913256

     rx_queue_3_packets: 1712

     rx_queue_3_bytes: 354924

------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
E1000-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/e1000-devel
To learn more about Intel® Ethernet, visit 
http://communities.intel.com/community/wired

Reply via email to