Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-12-18 Thread Yijing Wang
(LAD) >> Intel Corporation >> todd.fujin...@intel.com >> (503) 712-4565 >> >> >> -Original Message- >> From: Ethan Zhao [mailto:ethan.ker...@gmail.com] >> Sent: Wednesday, November 28, 2012 7:10 PM >> To: Fujinaka, Todd >> C

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-12-18 Thread Joe Jin
>> LAN Access Division (LAD) >>> Intel Corporation >>> todd.fujin...@intel.com >>> (503) 712-4565 >>> >>> >>> -Original Message----- >>> From: Ethan Zhao [mailto:ethan.ker...@gmail.com] >>> Sent: Wednesday, Nov

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-12-18 Thread Joe Jin
th; net...@vger.kernel.org; > e1000-de...@lists.sf.net; linux-ker...@vger.kernel.org; linux-pci > Subject: Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang > > Joe, > Possibly your customer is running a kernel without source code on a > platform whose vendor wouldn't like to f

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-29 Thread Fujinaka, Todd
x-ker...@vger.kernel.org; linux-pci Subject: Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang Joe, Possibly your customer is running a kernel without source code on a platform whose vendor wouldn't like to fix BIOS issue( Is that a HP/Dell server ?). Anyway, to see if is a payloa

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-28 Thread Ethan Zhao
om] > Sent: Wednesday, November 28, 2012 12:31 AM > To: Ben Hutchings > Cc: Fujinaka, Todd; Mary Mcgrath; net...@vger.kernel.org; > e1000-de...@lists.sf.net; linux-ker...@vger.kernel.org; linux-pci > Subject: Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang > > On 11/28/12

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-28 Thread Fujinaka, Todd
al Message- From: Joe Jin [mailto:joe@oracle.com] Sent: Wednesday, November 28, 2012 12:31 AM To: Ben Hutchings Cc: Fujinaka, Todd; Mary Mcgrath; net...@vger.kernel.org; e1000-de...@lists.sf.net; linux-ker...@vger.kernel.org; linux-pci Subject: Re: [E1000-devel] 82571EB: Detected Hardware Uni

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-28 Thread Joe Jin
On 11/28/12 02:10, Ben Hutchings wrote: > On Tue, 2012-11-27 at 17:32 +, Fujinaka, Todd wrote: >> Forgive me if I'm being too repetitious as I think some of this has >> been mentioned in the past. >> >> We (and by we I mean the Ethernet part and driver) can only change the >> advertised availab

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-27 Thread Ben Hutchings
On Tue, 2012-11-27 at 17:32 +, Fujinaka, Todd wrote: > Forgive me if I'm being too repetitious as I think some of this has > been mentioned in the past. > > We (and by we I mean the Ethernet part and driver) can only change the > advertised availability of a larger MaxPayloadSize. The size is

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-27 Thread Fujinaka, Todd
ay, November 27, 2012 10:11 AM To: Fujinaka, Todd; Mary Mcgrath Cc: Joe Jin; net...@vger.kernel.org; e1000-de...@lists.sf.net; linux-ker...@vger.kernel.org; linux-pci Subject: RE: [E1000-devel] 82571EB: Detected Hardware Unit Hang On Tue, 2012-11-27 at 17:32 +, Fujinaka, Todd wrote: > Forgi

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-27 Thread Fujinaka, Todd
2-4565 -Original Message- From: Mary Mcgrath [mailto:mary.mcgr...@oracle.com] Sent: Monday, November 26, 2012 6:07 PM To: Joe Jin Cc: net...@vger.kernel.org; e1000-de...@lists.sf.net; linux-ker...@vger.kernel.org Subject: Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang Joe Thank yo

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-26 Thread Mary Mcgrath
in, thank you. Regards Mary -Original Message- From: Joe Jin Sent: Monday, November 26, 2012 8:00 PM To: Fujinaka, Todd Cc: Dave, Tushar N; net...@vger.kernel.org; e1000-de...@lists.sf.net; linux-ker...@vger.kernel.org; Mary Mcgrath Subject: Re: [E1000-devel] 82571EB: Detected Hardware

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-26 Thread Joe Jin
On 11/27/12 00:23, Fujinaka, Todd wrote: > If you look at the previous section, DevCap, you'll see that it's > correctly advertising 256 bytes but the system is negotiating 128 for > the link to the Ethernet controller. Things on the "other" side of the > link are controlled outside of the e1000 dr

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-26 Thread Fujinaka, Todd
On Tue, 20 Nov 2012, Joe Jin wrote: > On 11/20/12 16:59, Dave, Tushar N wrote: >> Have you power off the system completely after modifying eeprom? If not >> please do so. > > Hi Tushar, > > Seems not works for me, would you please help to check what is wrong of my > operations? ... > # lspci -

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-20 Thread Joe Jin
On 11/20/12 16:59, Dave, Tushar N wrote: > Have you power off the system completely after modifying eeprom? If not > please do so. seems not works for me, would you please help to check what is wrong of my operations? Original eeprom dump: # ethtool -e eth3 | head -8 Offset Values ---

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-20 Thread Joe Jin
On 11/20/12 16:59, Dave, Tushar N wrote: > Have you power off the system completely after modifying eeprom? If not > please do so. Hi Tushar, Seems not works for me, would you please help to check what is wrong of my operations? Original eeprom dump: # ethtool -e eth3 | head -8 Offset

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-20 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Sunday, November 18, 2012 9:38 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org; Mary Mcgrath >Subject: Re: 82571EB: Detected Hardware Unit Hang > >On 11/16/12 0

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-18 Thread Joe Jin
On 11/16/12 04:26, Dave, Tushar N wrote: >> Would you please help to fine the offset of max payload size in eeprom? >> I'd like to have a try to modify it by ethtool. > > It is defined using bit 8 of word 0x1A. > Bit value 0 = 128B , bit value 1 = 256B Hi Tushar, I checked one of my server which

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-15 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Wednesday, November 14, 2012 4:33 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org; Mary Mcgrath >Subject: Re: 82571EB: Detected Hardware Unit Hang > >On 11/14/1

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-14 Thread Joe Jin
On 11/14/12 11:45, Dave, Tushar N wrote: >> -Original Message- >> From: Joe Jin [mailto:joe@oracle.com] >> Sent: Tuesday, November 13, 2012 6:48 PM >> To: Dave, Tushar N >> Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >> ker...@vger.kernel.org; Mary Mcgrath >> Subject: R

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-13 Thread Dave, Tushar N
>-Original Message- >From: Li Yu [mailto:raise.s...@gmail.com] >Sent: Tuesday, November 13, 2012 7:37 PM >To: Dave, Tushar N >Cc: Joe Jin; e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org; Mary Mcgrath >Subject: Re: 82571EB: Detected Hardware Unit Hang > >于 2

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-13 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Tuesday, November 13, 2012 6:48 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org; Mary Mcgrath >Subject: Re: 82571EB: Detected Hardware Unit Hang > >On 11/09/12

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-13 Thread Li Yu
于 2012年11月09日 04:35, Dave, Tushar N 写道: >> -Original Message- >> From: netdev-ow...@vger.kernel.org [mailto:netdev-ow...@vger.kernel.org] >> On Behalf Of Joe Jin >> Sent: Wednesday, November 07, 2012 10:25 PM >> To: e1000-de...@lists.sf.net >> Cc: net...@vger.kernel.org; linux-ker...@vger.k

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-13 Thread Joe Jin
On 11/09/12 04:35, Dave, Tushar N wrote: > All devices in path from root complex to 82571, should have *same* max > payload size otherwise it can cause hang. > Can you double check this? Hi Tushar, Checked with hardware vendor and they said no way to modify the max payload size from BIOS, can

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-08 Thread Joe Jin
On 11/09/12 04:35, Dave, Tushar N wrote: > Are you sure this is not similar issue as before that you reported. > i.e. Tushar, Thanks for your quick response, I'll check with customer if they can modify the Max payload size from BIOS, this time issue hit on HP's server. Thanks again, Joe > On

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-08 Thread Dave, Tushar N
>-Original Message- >From: netdev-ow...@vger.kernel.org [mailto:netdev-ow...@vger.kernel.org] >On Behalf Of Joe Jin >Sent: Wednesday, November 07, 2012 10:25 PM >To: e1000-de...@lists.sf.net >Cc: net...@vger.kernel.org; linux-ker...@vger.kernel.org; Mary Mcgrath >Subject: 82571EB: Detected

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-08-29 Thread Dave, Tushar N
@lists.sourceforge.net Subject: Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang This is the output: ~$ sudo ethtool -S eth1 | grep tx_timeout_count tx_timeout_count: 0 ~$ I will try new driver, but this is a production server. I don't have any actual problems with the nic, but I do

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-08-29 Thread Andrew Peng
I would suggest try latest e1000e driver > > *From:* Andrew Peng [mailto:peng...@gmail.com] > *Sent:* Friday, August 24, 2012 10:29 AM > > *To:* Dave, Tushar N > *Cc:* e1000-devel@lists.sourceforge.net > *Subject:* Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang > >

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-08-28 Thread Nikolay Popov
Hi, Dave! Ok, I have set msglevel as you requested, let's wait for some logs Also, about versions - we using 1.11.3-NAPI on both 3.3.6 and 3.5.2 hosts. We was enforced to do that because with default kernel driver (at least 2.0.0 at 3.5.2) we see some misterious drops and delays (~1-2%, and delay

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-08-28 Thread Dave, Tushar N
>-Original Message- >From: Nikolay Popov [mailto:niko...@popoff.net.ua] >Sent: Tuesday, August 28, 2012 9:00 PM >To: Dave, Tushar N >Cc: e1000-devel@lists.sourceforge.net >Subject: Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang > >29.08.2012 6:29, Dave, Tus

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-08-28 Thread Nikolay Popov
29.08.2012 6:29, Dave, Tushar N wrote: > Have you tried disabling tso (ethtool -K tso off)? I also tried recompiling driver with DISABLE_PM, disabling gro and other offload types, boot kernel with acpi_aspm=off, increase ring buffers to 4096, playing around flow control - nothing helped. Regards

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-08-28 Thread Nikolay Popov
29.08.2012 6:29, Dave, Tushar N пишет: > Thanks for the info. > For both, 82571 and 80003ES2LAN, I see UnsuppReq+ and UncorrErr+ in lspci > (DevSta: CorrErr- UncorrErr+ FatalErr- UnsuppReq+ AuxPwr+ TransPend+) > > Have you tried disabling tso (ethtool -K tso off)? Yes, this doesn't help > Was thi

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-08-27 Thread Dave, Tushar N
>-Original Message- >From: Nikolay Popov [mailto:niko...@popoff.net.ua] >Sent: Saturday, August 25, 2012 1:29 AM >To: e1000-devel@lists.sourceforge.net >Subject: Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang > >Hi, All > >It seems that I'm getting

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-08-25 Thread Nikolay Popov
Hi, All It seems that I'm getting same problems with 3.5.2 kernel - 80003ES2LAN onboard NIC is going to reset from time to time under load Aug 25 10:27:53 bras2 kernel: [134612.808590] e1000e :05:00.0: eth2: Detected Hardware Unit Hang: Aug 25 10:27:53 bras2 kernel: [134612.808590] TDH A

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-08-24 Thread Dave, Tushar N
ethx | grep tx_timeout_count’ -Tushar PS: I would suggest try latest e1000e driver From: Andrew Peng [mailto:peng...@gmail.com] Sent: Friday, August 24, 2012 10:29 AM To: Dave, Tushar N Cc: e1000-devel@lists.sourceforge.net Subject: Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang Hi, in

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-08-24 Thread Andrew Peng
s mentioned by Flavio). > > -Tushar > > >-Original Message- > >From: Flavio Leitner [mailto:f...@redhat.com] > >Sent: Thursday, July 19, 2012 6:39 PM > >To: Andrew Peng > >Cc: Dave, Tushar N; e1000-devel@lists.sourceforge.net > >Subject: Re: [E1000-devel

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-07-19 Thread Dave, Tushar N
N; e1000-devel@lists.sourceforge.net >Subject: Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang > >On Thu, 19 Jul 2012 20:17:14 -0500 >Andrew Peng wrote: > >> Flavio; >> >> I am using the stock kernel driver with the stock Debian Squeeze kernel. >> > >

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-07-19 Thread Flavio Leitner
ool ethx' > > > > -Tushar > > > > > > > >>-Original Message- > >>From: Andrew Peng [mailto:peng...@gmail.com] > >>Sent: Thursday, July 19, 2012 4:42 PM > >>To: Dave, Tushar N > >>Cc: e1000-devel

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-07-19 Thread Andrew Peng
the log. > Please confirm that msglvl is set correctly by running 'ethtool ethx' > > -Tushar > > > >>-Original Message- >>From: Andrew Peng [mailto:peng...@gmail.com] >>Sent: Thursday, July 19, 2012 4:42 PM >>To: Dave, Tushar N >>Cc: e1

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-07-19 Thread Dave, Tushar N
ndrew Peng [mailto:peng...@gmail.com] >Sent: Thursday, July 19, 2012 4:42 PM >To: Dave, Tushar N >Cc: e1000-devel@lists.sourceforge.net >Subject: Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang > >Attached is the dmesg output. Please let me know if this looks right. >The

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-07-19 Thread Flavio Leitner
Please enable TSO back. > > Then run "ethtool -s ethx msglvl 0x2c01". This will enable debug code that > > logs HW ring data (into dmesg log) when Tx hang occurs. When issue occur > > next time please send me the full dmesg log. > > > > -Tushar > > > >>

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-07-19 Thread Andrew Peng
gs HW ring data (into dmesg log) when Tx hang occurs. When issue occur next > time please send me the full dmesg log. > > -Tushar > >>-Original Message- >>From: Andrew Peng [mailto:peng...@gmail.com] >>Sent: Wednesday, July 18, 2012 6:24 AM >>To: e1000-dev

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-07-18 Thread Dave, Tushar N
ge- >From: Andrew Peng [mailto:peng...@gmail.com] >Sent: Wednesday, July 18, 2012 6:24 AM >To: e1000-devel@lists.sourceforge.net >Subject: Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang > >Thus far disabling TSO via ethtool has seemed to work - can anyone explain >the te

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-07-18 Thread Andrew Peng
Thus far disabling TSO via ethtool has seemed to work - can anyone explain the technical reason why this appears to have fixed the issue? --Andrew On Mon, Jul 16, 2012 at 3:47 PM, Andrew Peng wrote: > Sorry folks, but I just realized that I hadn't been replying to the > list properly and instead

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-16 Thread Jon Mason
On Mon, Jul 16, 2012 at 9:08 AM, Henrique de Moraes Holschuh wrote: > On Mon, 16 Jul 2012, Ben Hutchings wrote: >> On Sun, 2012-07-15 at 10:35 -0300, Henrique de Moraes Holschuh wrote: >> > On Sun, 15 Jul 2012, Dave, Tushar N wrote: >> > > Somehow setting max payload to 256 from BIOS does not set

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-16 Thread Jon Mason
On Mon, Jul 16, 2012 at 8:47 AM, Ben Hutchings wrote: > On Sun, 2012-07-15 at 10:35 -0300, Henrique de Moraes Holschuh wrote: >> On Sun, 15 Jul 2012, Dave, Tushar N wrote: >> > Somehow setting max payload to 256 from BIOS does not set this value for >> > all devices. I believe this is a BIOS bug.

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-07-16 Thread Andrew Peng
Sorry folks, but I just realized that I hadn't been replying to the list properly and instead I was mistakenly emailing Dave directly. I'm consolidating and re-sending the information to the list. BIOS on the HP N40L does not specify any options for AER or PCIe error management, or packet size (

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-16 Thread Henrique de Moraes Holschuh
On Mon, 16 Jul 2012, Ben Hutchings wrote: > On Sun, 2012-07-15 at 10:35 -0300, Henrique de Moraes Holschuh wrote: > > On Sun, 15 Jul 2012, Dave, Tushar N wrote: > > > Somehow setting max payload to 256 from BIOS does not set this value for > > > all devices. I believe this is a BIOS bug. > > > >

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-16 Thread Ben Hutchings
On Sun, 2012-07-15 at 10:35 -0300, Henrique de Moraes Holschuh wrote: > On Sun, 15 Jul 2012, Dave, Tushar N wrote: > > Somehow setting max payload to 256 from BIOS does not set this value for > > all devices. I believe this is a BIOS bug. > > And preferably, Linux should complain about it. Since

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-15 Thread Henrique de Moraes Holschuh
On Sun, 15 Jul 2012, Dave, Tushar N wrote: > Somehow setting max payload to 256 from BIOS does not set this value for all > devices. I believe this is a BIOS bug. And preferably, Linux should complain about it. Since we know it is going to cause problems, and since we know it does happen, we sho

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-14 Thread Joe Jin
On 07/15/12 11:42, Dave, Tushar N wrote: >> -Original Message- >> From: Joe Jin [mailto:joe@oracle.com] >> Sent: Thursday, July 12, 2012 9:34 PM >> To: Dave, Tushar N >> Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >> ker...@vger.kernel.org >> Subject: Re: 82571EB: Detec

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-14 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Thursday, July 12, 2012 9:34 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang > >On 07/13/12 12:10, Dave, Tush

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-12 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Thursday, July 12, 2012 4:46 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang > Thanks for sending full dmesg

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-12 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Thursday, July 12, 2012 12:11 AM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang > >On 07/12/12 14:41, Dave, Tus

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Dave, Tushar N
>On 07/12/12 13:57, Dave, Tushar N wrote: >>> -Original Message- >>> From: Joe Jin [mailto:joe@oracle.com] >>> Sent: Wednesday, July 11, 2012 8:13 PM >>> To: Dave, Tushar N >>> Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >>> ker...@vger.kernel.org >>> Subject: Re: 82571

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Joe Jin
On 07/12/12 13:57, Dave, Tushar N wrote: >> -Original Message- >> From: Joe Jin [mailto:joe@oracle.com] >> Sent: Wednesday, July 11, 2012 8:13 PM >> To: Dave, Tushar N >> Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >> ker...@vger.kernel.org >> Subject: Re: 82571EB: Dete

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Wednesday, July 11, 2012 8:13 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang > >On 07/12/12 11:07, Dave, Tus

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Joe Jin
On 07/12/12 11:07, Dave, Tushar N wrote: >> -Original Message- >> From: Joe Jin [mailto:joe@oracle.com] >> Sent: Wednesday, July 11, 2012 7:58 PM >> To: Dave, Tushar N >> Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >> ker...@vger.kernel.org >> Subject: Re: 82571EB: Dete

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Wednesday, July 11, 2012 7:58 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang > >On 07/12/12 10:52, Dave, Tus

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Joe Jin
On 07/12/12 10:52, Dave, Tushar N wrote: > What is the exact error messages in BIOS log? Error message from BIOS event log: 07/12/12 05:54:00 PCI Express Non-Fatal Error Thanks, Joe -- Live Security Virtual Conferenc

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Wednesday, July 11, 2012 7:23 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang > >On 07/12/12 02:51, Dave, Tus

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Joe Jin
On 07/12/12 02:51, Dave, Tushar N wrote: > > Joe, > > I see couple of errors in lspci output. > Device capability status register shows UnCorrectable PCIe error. This means > there is certainly something went wrong. The only way to recover from > Uncorrectable errors is reset. > > Dev

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-07-11 Thread Dave, Tushar N
>-Original Message- >From: Andrew Peng [mailto:peng...@gmail.com] >Sent: Wednesday, July 11, 2012 8:50 AM >To: e1000-devel@lists.sourceforge.net >Subject: [E1000-devel] 82571EB - Detected Hardware Unit Hang > >Folks, I've been getting some strange error messages in my home server / >router

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Tuesday, July 10, 2012 10:03 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang > >On 07/11/12 12:05, Dave, Tush

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Joe Jin
On 07/11/12 15:50, Dave, Tushar N wrote: > Device status and AER sections show some errors that looks little suspicious > to me but I'm not too sure. I will get back tomorrow. > Thanks a lot, Tushar! Joe -- Oracle Joe Jin | Software Development Senior Manager | +8610.

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Wednesday, July 11, 2012 12:39 AM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang > >On 07/11/12 15:37, Dave, Tu

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Joe Jin
On 07/11/12 15:37, Dave, Tushar N wrote: >> -Original Message- >> From: Joe Jin [mailto:joe@oracle.com] >> Sent: Wednesday, July 11, 2012 12:18 AM >> To: Dave, Tushar N >> Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >> ker...@vger.kernel.org >> Subject: Re: 82571EB: Det

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Wednesday, July 11, 2012 12:18 AM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang > >On 07/11/12 15:11, Dave, Tu

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Joe Jin
On 07/11/12 15:11, Dave, Tushar N wrote: >> -Original Message- >> From: Joe Jin [mailto:joe@oracle.com] >> Sent: Tuesday, July 10, 2012 10:03 PM >> To: Dave, Tushar N >> Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >> ker...@vger.kernel.org >> Subject: Re: 82571EB: Detec

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Tuesday, July 10, 2012 10:03 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang > >On 07/11/12 12:05, Dave, Tush

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Joe Jin
On 07/11/12 12:05, Dave, Tushar N wrote: > When you said you had this issue with RHEL5 and RHEL6 drivers, have you > install RHEl5/6 kernel and reproduced it? If so I think I should install > RHEL6 and try reproduce it locally! > Yes I reproduced this on both RHEL5 and RHEL6. So far I tried to

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Tuesday, July 10, 2012 8:29 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang > >On 07/11/12 11:22, Dave, Tusha

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Joe Jin
On 07/11/12 11:22, Dave, Tushar N wrote: > Thanks for info. I see that hang occurs right when HW processing first TX > descriptor with TSO. > Would you be able to reproduce issue with TSO off? Disable TSO by 'ethtool > -K ethx tso off' > Let all debug enabled as it is, that will help us debug f

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Tuesday, July 10, 2012 5:35 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang > >On 07/11/12 03:02, Dave, Tush

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Joe Jin
On 07/11/12 03:02, Dave, Tushar N wrote: >> -Original Message- >> From: netdev-ow...@vger.kernel.org [mailto:netdev-ow...@vger.kernel.org] >> On Behalf Of Joe Jin >> Sent: Tuesday, July 10, 2012 12:40 AM >> To: Joe Jin >> Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >> ker..

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Dave, Tushar N
>-Original Message- >From: Dave, Tushar N >Sent: Tuesday, July 10, 2012 12:02 PM >To: Joe Jin >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org; Dave, Tushar N >Subject: RE: 82571EB: Detected Hardware Unit Hang > >>-Original Message- >>From: netd

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Dave, Tushar N
>-Original Message- >From: netdev-ow...@vger.kernel.org [mailto:netdev-ow...@vger.kernel.org] >On Behalf Of Joe Jin >Sent: Tuesday, July 10, 2012 12:40 AM >To: Joe Jin >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardw

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Wyborny, Carolyn
>-Original Message- >From: netdev-ow...@vger.kernel.org [mailto:netdev-ow...@vger.kernel.org] >On Behalf Of Joe Jin >Sent: Tuesday, July 10, 2012 12:40 AM >To: Joe Jin >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hard

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Joe Jin
When I debug the driver I found before Detected HW hang, driver unable to clean and reclaim the resources: 1457 while ((eop_desc->upper.data & cpu_to_le32(E1000_TXD_STAT_DD)) && <== at here upper.data always is 0x300 1458(count < tx_ring->count)) { <--- snip ---> 148

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-09 Thread Joe Jin
On 07/09/12 17:21, Eric Dumazet wrote: > On Mon, 2012-07-09 at 16:51 +0800, Joe Jin wrote: >> Hi list, >> >> I'm seeing a Unit Hang even with the latest e1000e driver 2.0.0 when doing >> scp test. this issue is easy do reproduced on SUN FIRE X2270 M2, just copy >> a big file (>500M) from another se

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-09 Thread Eric Dumazet
On Mon, 2012-07-09 at 16:51 +0800, Joe Jin wrote: > Hi list, > > I'm seeing a Unit Hang even with the latest e1000e driver 2.0.0 when doing > scp test. this issue is easy do reproduced on SUN FIRE X2270 M2, just copy > a big file (>500M) from another server will hit it at once. > > Would you ple

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2011-11-03 Thread Michael Wang
On 11/03/2011 09:39 PM, Flavio Leitner wrote: > (moving the discussion back to the list) > > Hi, > > I am sorry, I didn't receive your patch as we discussed in private > and ended up writing one patch myself which essentially does the > same thing. > > The patch is available at: > https://bugzilla.

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2011-11-03 Thread Flavio Leitner
(moving the discussion back to the list) Hi, I am sorry, I didn't receive your patch as we discussed in private and ended up writing one patch myself which essentially does the same thing. The patch is available at: https://bugzilla.redhat.com/show_bug.cgi?id=746272#c13 It schedules a workqueue

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2011-10-26 Thread Michael Wang
Hi, Flavio, Jesse I have send out the patch, which I hope can do some help. Because this is my first time to send a patch, I am sorry if I have done some silly thing. And please tell me if there are some problem about it. Thanks & Best regards, Michael Wang

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2011-10-25 Thread Michael Wang
On 10/25/2011 11:57 PM, Jesse Brandeburg wrote: > On Mon, 24 Oct 2011 23:29:34 -0700 > Michael Wang wrote: >> May be you can just search macro >> "E1000_TXDCTL_DMA_BURST_ENABLE" >> in "drivers/net/e1000e/e1000.h", change it to: >> >> #define E1000_TXDCTL_DMA_BURST_ENABLE \ >> (E1000_TXDCTL_GRAN |

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2011-10-25 Thread Jesse Brandeburg
On Mon, 24 Oct 2011 23:29:34 -0700 Michael Wang wrote: > May be you can just search macro > "E1000_TXDCTL_DMA_BURST_ENABLE" > in "drivers/net/e1000e/e1000.h", change it to: > > #define E1000_TXDCTL_DMA_BURST_ENABLE \ > (E1000_TXDCTL_GRAN | /* set descriptor granularity */ \ > E1000_TXDCTL_COUNT_D

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2011-10-24 Thread Michael Wang
On 10/25/2011 12:26 AM, Flavio Leitner wrote: > On Mon, 24 Oct 2011 16:26:28 +0800 > Michael Wang wrote: > >> On 10/21/2011 10:03 PM, Flavio Leitner wrote: >>> On Fri, 21 Oct 2011 14:15:12 +0800 >>> Michael Wang wrote: >>> On 10/19/2011 08:16 PM, Flavio Leitner wrote: > On Wed, 19 Oct 2

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2011-10-24 Thread Flavio Leitner
On Mon, 24 Oct 2011 16:26:28 +0800 Michael Wang wrote: > On 10/21/2011 10:03 PM, Flavio Leitner wrote: > > On Fri, 21 Oct 2011 14:15:12 +0800 > > Michael Wang wrote: > > > >> On 10/19/2011 08:16 PM, Flavio Leitner wrote: > >>> On Wed, 19 Oct 2011 12:49:48 +0800 > >>> wangyun wrote: > >>> > >>>

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2011-10-24 Thread Michael Wang
On 10/21/2011 10:03 PM, Flavio Leitner wrote: > On Fri, 21 Oct 2011 14:15:12 +0800 > Michael Wang wrote: > >> On 10/19/2011 08:16 PM, Flavio Leitner wrote: >>> On Wed, 19 Oct 2011 12:49:48 +0800 >>> wangyun wrote: >>> Hi, Flavio I am new to join the community, work on e1000e drive

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2011-10-21 Thread Flavio Leitner
On Fri, 21 Oct 2011 14:15:12 +0800 Michael Wang wrote: > On 10/19/2011 08:16 PM, Flavio Leitner wrote: > > On Wed, 19 Oct 2011 12:49:48 +0800 > > wangyun wrote: > > > >> Hi, Flavio > >> > >> I am new to join the community, work on e1000e driver currently, > >> And I found a thing strange in this

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2011-10-20 Thread Michael Wang
On 10/19/2011 08:16 PM, Flavio Leitner wrote: > On Wed, 19 Oct 2011 12:49:48 +0800 > wangyun wrote: > >> Hi, Flavio >> >> I am new to join the community, work on e1000e driver currently, >> And I found a thing strange in this issue, please check below. >> >> Thanks, >> Michael Wang >> >> On 10/18/

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2011-10-19 Thread Flavio Leitner
On Wed, 19 Oct 2011 12:49:48 +0800 wangyun wrote: > Hi, Flavio > > I am new to join the community, work on e1000e driver currently, > And I found a thing strange in this issue, please check below. > > Thanks, > Michael Wang > > On 10/18/2011 10:42 PM, Flavio Leitner wrote: > > On Mon, 17 Oct 2

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2011-10-18 Thread wangyun
Hi, Flavio I am new to join the community, work on e1000e driver currently, And I found a thing strange in this issue, please check below. Thanks, Michael Wang On 10/18/2011 10:42 PM, Flavio Leitner wrote: > On Mon, 17 Oct 2011 11:48:22 -0700 > Jesse Brandeburg wrote: > >> On Fri, 14 Oct 2011 1

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2011-10-18 Thread Flavio Leitner
On Mon, 17 Oct 2011 11:48:22 -0700 Jesse Brandeburg wrote: > On Fri, 14 Oct 2011 10:04:26 -0700 > Flavio Leitner wrote: > > > > > Hi, > > > > I got few reports so far that 82571EB models are having the > > "Detected Hardware Unit Hang" issue after upgrading the kernel. > > > > Further debugg

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2011-10-17 Thread Jesse Brandeburg
On Fri, 14 Oct 2011 10:04:26 -0700 Flavio Leitner wrote: > > Hi, > > I got few reports so far that 82571EB models are having the > "Detected Hardware Unit Hang" issue after upgrading the kernel. > > Further debugging with an instrumented kernel revealed that the > socket buffer time stamp matc