On 11/03/2011 09:39 PM, Flavio Leitner wrote: > (moving the discussion back to the list) > > Hi, > > I am sorry, I didn't receive your patch as we discussed in private > and ended up writing one patch myself which essentially does the > same thing. > > The patch is available at: > https://bugzilla.redhat.com/show_bug.cgi?id=746272#c13 > > It schedules a workqueue to flush the descriptors 500ms after > sent the first packet. This ensures that there will be a write-back > and enough time before the watchdog detects it as an old entry. > > Time: 0 ms -> x ms -> y ms ->...-> 500ms -> > Pkts: pkt#1 -> pkt#2 -> pkt#3 ->...-> pkt#n ->pkt(n+1) > Event:schedule -> -> -> flush ->schedule > workqueue workqueue > > Customer reported that it works, so IMHO, the root cause is confirmed. > There is no enough packets to cause the write-back and writing to FPD > fixes it. > > That patch will flush every 500ms with high traffic too which > isn't good for performance, though it would be a flush of up to > 4 descriptors as far as I understand. > > I like Michael's approach to let the watchdog detects the hang first, > then try to flush. Michael told me that we could flush and use the > interrupt raised when the write-back ends to clean up. I think if > there is a real TX hang (i.e. no interrupt event), it will take another > watchdog cycle to detect that. It seems to me too much time without > taking any action. > > Maybe something like this would work: > 1) watchdog detects the hang > 2) check for FLAG2_DMA_BURST flag > 3) if yes, force flush, set a bit flag in the TX ring and schedule > watchdog with a short period > 4) if the TXDW interrupt happens, cleans up and reset the bit flag. > 5) if not, the watchdog will expire, that bit flag will remain set > then it will take any action assuming a real hang has occurred. > May be this is a way, let's try. > thanks, > fbl > > On Wed, 26 Oct 2011 17:27:04 +0800 > Michael Wang<wang...@linux.vnet.ibm.com> wrote: > >> Hi, Flavio, Jesse >> >> I have send out the patch, which I hope can do some help. >> >> Because this is my first time to send a patch, I am sorry if >> I have done some silly thing. >> >> And please tell me if there are some problem about it. >> >> Thanks& Best regards, >> Michael Wang >> >> ------------------------------------------------------------------------------ >> The demand for IT networking professionals continues to grow, and the >> demand for specialized networking skills is growing even more rapidly. >> Take a complimentary Learning@Cisco Self-Assessment and learn >> about Cisco certifications, training, and career opportunities. >> http://p.sf.net/sfu/cisco-dev2dev >> _______________________________________________ >> E1000-devel mailing list >> E1000-devel@lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/e1000-devel >> To learn more about Intel® Ethernet, visit >> http://communities.intel.com/community/wired
------------------------------------------------------------------------------ RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1 _______________________________________________ E1000-devel mailing list E1000-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/e1000-devel To learn more about Intel® Ethernet, visit http://communities.intel.com/community/wired