[E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-09 Thread Joe Jin
Hi list, I'm seeing a Unit Hang even with the latest e1000e driver 2.0.0 when doing scp test. this issue is easy do reproduced on SUN FIRE X2270 M2, just copy a big file (500M) from another server will hit it at once. Would you please help on this? device info: # lspci -s 05:00.0 05:00.0

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Joe Jin
} I checked all driver codes I did not found anywhere will set the upper.data with E1000_TXD_STAT_DD, I guess upper.data be set by hardware? If OS is 32bit system, what which happen? Thanks in advance, Joe On 07/09/12 16:51, Joe Jin wrote: Hi list, I'm seeing a Unit Hang even

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Joe Jin
On 07/11/12 03:02, Dave, Tushar N wrote: -Original Message- From: netdev-ow...@vger.kernel.org [mailto:netdev-ow...@vger.kernel.org] On Behalf Of Joe Jin Sent: Tuesday, July 10, 2012 12:40 AM To: Joe Jin Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- ker

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Joe Jin
On 07/11/12 11:22, Dave, Tushar N wrote: Thanks for info. I see that hang occurs right when HW processing first TX descriptor with TSO. Would you be able to reproduce issue with TSO off? Disable TSO by 'ethtool -K ethx tso off' Let all debug enabled as it is, that will help us debug

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Joe Jin
On 07/11/12 15:11, Dave, Tushar N wrote: -Original Message- From: Joe Jin [mailto:joe@oracle.com] Sent: Tuesday, July 10, 2012 10:03 PM To: Dave, Tushar N Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- ker...@vger.kernel.org Subject: Re: 82571EB: Detected Hardware

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Joe Jin
On 07/11/12 15:37, Dave, Tushar N wrote: -Original Message- From: Joe Jin [mailto:joe@oracle.com] Sent: Wednesday, July 11, 2012 12:18 AM To: Dave, Tushar N Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- ker...@vger.kernel.org Subject: Re: 82571EB: Detected

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Joe Jin
On 07/11/12 15:50, Dave, Tushar N wrote: Device status and AER sections show some errors that looks little suspicious to me but I'm not too sure. I will get back tomorrow. Thanks a lot, Tushar! Joe -- Oracle http://www.oracle.com Joe Jin | Software Development Senior Manager

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Joe Jin
On 07/12/12 02:51, Dave, Tushar N wrote: Joe, I see couple of errors in lspci output. Device capability status register shows UnCorrectable PCIe error. This means there is certainly something went wrong. The only way to recover from Uncorrectable errors is reset. DevSta:

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Joe Jin
On 07/12/12 10:52, Dave, Tushar N wrote: What is the exact error messages in BIOS log? Error message from BIOS event log: 07/12/12 05:54:00 PCI Express Non-Fatal Error Thanks, Joe -- Live Security Virtual

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-14 Thread Joe Jin
On 07/15/12 11:42, Dave, Tushar N wrote: -Original Message- From: Joe Jin [mailto:joe@oracle.com] Sent: Thursday, July 12, 2012 9:34 PM To: Dave, Tushar N Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- ker...@vger.kernel.org Subject: Re: 82571EB: Detected Hardware

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-08 Thread Joe Jin
On Mon, 2012-07-09 at 16:51 +0800, Joe Jin wrote: I'm seeing a Unit Hang even with the latest e1000e driver 2.0.0 when doing scp test. this issue is easy do reproduced on SUN FIRE X2270 M2, just copy a big file (500M) from another server will hit it at once. All devices in path from root complex

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-13 Thread Joe Jin
On 11/09/12 04:35, Dave, Tushar N wrote: All devices in path from root complex to 82571, should have *same* max payload size otherwise it can cause hang. Can you double check this? Hi Tushar, Checked with hardware vendor and they said no way to modify the max payload size from BIOS, can I

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-18 Thread Joe Jin
On 11/16/12 04:26, Dave, Tushar N wrote: Would you please help to fine the offset of max payload size in eeprom? I'd like to have a try to modify it by ethtool. It is defined using bit 8 of word 0x1A. Bit value 0 = 128B , bit value 1 = 256B Hi Tushar, I checked one of my server which Max

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-20 Thread Joe Jin
On 11/20/12 16:59, Dave, Tushar N wrote: Have you power off the system completely after modifying eeprom? If not please do so. Hi Tushar, Seems not works for me, would you please help to check what is wrong of my operations? Original eeprom dump: # ethtool -e eth3 | head -8 Offset

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-20 Thread Joe Jin
On 11/20/12 16:59, Dave, Tushar N wrote: Have you power off the system completely after modifying eeprom? If not please do so. seems not works for me, would you please help to check what is wrong of my operations? Original eeprom dump: # ethtool -e eth3 | head -8 Offset Values

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-26 Thread Joe Jin
On 11/27/12 00:23, Fujinaka, Todd wrote: If you look at the previous section, DevCap, you'll see that it's correctly advertising 256 bytes but the system is negotiating 128 for the link to the Ethernet controller. Things on the other side of the link are controlled outside of the e1000 driver.

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-28 Thread Joe Jin
On 11/28/12 02:10, Ben Hutchings wrote: On Tue, 2012-11-27 at 17:32 +, Fujinaka, Todd wrote: Forgive me if I'm being too repetitious as I think some of this has been mentioned in the past. We (and by we I mean the Ethernet part and driver) can only change the advertised availability of a

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-12-18 Thread Joe Jin
(503) 712-4565 -Original Message- From: Ethan Zhao [mailto:ethan.ker...@gmail.com] Sent: Wednesday, November 28, 2012 7:10 PM To: Fujinaka, Todd Cc: Joe Jin; Ben Hutchings; Mary Mcgrath; net...@vger.kernel.org; e1000-de...@lists.sf.net; linux-ker...@vger.kernel.org; linux-pci

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-12-18 Thread Joe Jin
Hi Yijing, Thanks for your reference, the patch looks good for me, but I have no chance to test it on customer's env. Best Regards, Joe On 12/19/12 13:52, Yijing Wang wrote: On 2012/12/19 11:04, Joe Jin wrote: Hi all, I backported mps commits and ask customer pass pci=pcie_bus_peer2pee