On Wed, Mar 3, 2010 at 4:45 PM, Jesse Brandeburg
<jesse.brandeb...@gmail.com> wrote:
>
> On Mon, Mar 1, 2010 at 3:37 AM, Thrash Dude <thrash.d...@gmail.com> wrote:
> > Seems to be a rather common issue with the e1000 module. I searched the
> > archives back to 2005. Plenty of reports, no solutions.
>
> There are some solutions, one of which is to try loading the driver
> with TxDescriptorStep=4 TxDescriptors=1024

Hi Jesse thanks for the response!

>
>
> for some of the recent times have you been streaming using cifs or
> NFS?  what version of NFS?  what client machine /os did you test with?
>  What streaming software were you using to play the movie on the
> remote machine?

MythTV is the streaming software. Though, the Tx Hangs happen with cp,
scp, even just browsing the 'net.
Client/Server is NFS V3, also CIFS. I tried using CIFS instead of NFS
to rule out protocol issues. Also tried different NFS mount options.
One did not perform better than the other.
All systems are Slackware. I have not tried any other Linux systems.
If you can recommend a Live CD I could do that. Or - if really needed
- I can do a quick install of another OS on a spare partition if it
would help.

I have other older Motherboard chipsets available if that would help as well.


> so maybe its not actually related to traffic levels?

I don't believe so. Nor is it related to CPU usage. Top can report
0,0,0 or 3,3.5,4. All other equipment appears to functioning
optimally. As a precaution I did memtest all RAM sticks for 3 passes.



>
> This is actually a promising development because we might actually
> have something close to that system here.  What slot did you plug in?
> what is the barcode number on your adapter? XXXXXX-XXX.  The other
> (bad) option is that since the problem follows the adapter it could be
> the adapter.

It was the bottom PCI slot with both mother boards (GA-MA785GM-US2H,
GA-EP45-DS3L)

Numbers on the card -
PN SN: 95P2157 YL10256AE0N6
Manufactured for IBM by Intel
FRU 95P2156

Another sticker has -
000E0C****** (MAC ID ;) ) 416SC D26000-003




>
> have you double checked cooling of the NIC?  Do you have another
> identical NIC you can try?  You can probably get warranty support for
> the one you have, to get a replacement.

With the last Tx Hang unit, I promptly shut the PC down and removed
the card. It was not warm to the touch. The PC has decent ventilation.


>
> wow, thats a mess, please fix your mail client next time.  What I do
> see in the above is is appears to be a legitimate tx hang.  We have
> some debug code you can run that can help us diagnose, would you be
> able to run that?

Sorry about that. That message was posted with Pan VIA GMANE.

Sure I'd be more than willing to run some debug code.




> > If I upgrade to version e1000-8.0.19 Tx Unit Hang appears immediately
> > after the e1000 module is loaded.
>
> but does the part work at that point or is it completely dead?

If I rmmod, modprobe, rc.inet1 restart, I do get a link for a few
seconds/minutes, before it hangs again.

>
> >
> > Using ethtool to turn off rx,tx,sg,tso, and gso things appear to work
> > better. But - in that case, a $5 r8169 performs just as well.
> >
> > Full dmesg in next post.
>
> Still waiting for the next post...  please also include the output of
> ethtool -S eth0 after the next hang you get.

As noted above, I went ahead and removed the card for now. Latter
tonight, first thing in the morning I will put the Intel Nic back in.
And I'll link to pastebin.ca for the dmesg so it will hopefully be
legible. I do have /var/log/messages - and did notice -

[ 2893.699264] WARNING: at net/sched/sch_generic.c:261
dev_watchdog+0x236/0x250()
[ 2893.699266] Hardware name: GA-MA785GM-US2H
[ 2893.699268] NETDEV WATCHDOG: eth0 (e1000): transmit queue 0 timed
out
[ 2893.699270] Modules linked in: vmnet vmblock vsock vmci vmmon
snd_seq_dummy snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device
snd_pcm_oss snd_mixer_oss ipv6 fuse joydev usbhid hid
snd_hda_codec_realtek snd_hda_intel snd_hda_codec snd_hwdep snd_pcm
snd_timer nvidia(P) firewire_ohci snd soundcore ohci_hcd e1000
firewire_core ssb mmc_core snd_page_alloc crc_itu_t i2c_piix4 ehci_hcd
agpgart rtc_cmos processor rtc_core rtc_lib thermal i2c_core
thermal_sys psmouse serio_raw button hwmon evdev sg

I can also put that to pastebin if you'd like.

( Off to find my receipt for possible RMA )

------------------------------------------------------------------------------
Download Intel&#174; Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
_______________________________________________
E1000-devel mailing list
E1000-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/e1000-devel
To learn more about Intel&#174; Ethernet, visit 
http://communities.intel.com/community/wired

Reply via email to