Re: [#24529544] Re: Network sort of stops working with the em (intel) driver under load.

2010-06-01 Thread mark rowlands
 Ticket subject: Re: Network sort of stops working with the em (intel) driver 
 under  load.
 Ticket number: 24529544
 Ticket link: https://secure.mpcustomer.com/ticket.php?ticket=24529544
 Ticket body: On Mon, May 31, 2010 at 9:30 PM, mark rowlands 
 rowlands.m...@gmail.com wrote:
 Newly built and cvsupped system,with GENERIC kernel,  when copying
 large amounts of data over a gigabit link via scp  the network  will
 hang after a couple of gig.  I can then no longer login via ssh. If I
 leave it be, after about 12-24 hours I can then login again. (A reboot
 of course fixes the issue immediately...) .

 snip...

 I can now confirm that a similar copy to the vr card does not cause
 the same problem... so it would appear
 more likely that it might be an em related problem.

and after replacing the em0 with a realtek gigabit card, the problem
also does not occur. (the vr0 was only 100mb)
So it would seem there is something hinky with at least the em (82540) card .
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org


Re: [#24529544] Re: Network sort of stops working with the em (intel) driver under load.

2010-06-01 Thread Carl Chave
On Tue, Jun 1, 2010 at 3:34 PM, mark rowlands rowlands.m...@gmail.com wrote:
 Ticket subject: Re: Network sort of stops working with the em (intel) driver 
 under  load.
 Ticket number: 24529544
 Ticket link: https://secure.mpcustomer.com/ticket.php?ticket=24529544
 Ticket body: On Mon, May 31, 2010 at 9:30 PM, mark rowlands 
 rowlands.m...@gmail.com wrote:
 Newly built and cvsupped system,with GENERIC kernel,  when copying
 large amounts of data over a gigabit link via scp  the network  will
 hang after a couple of gig.  I can then no longer login via ssh. If I
 leave it be, after about 12-24 hours I can then login again. (A reboot
 of course fixes the issue immediately...) .

 snip...

 I can now confirm that a similar copy to the vr card does not cause
 the same problem... so it would appear
 more likely that it might be an em related problem.

 and after replacing the em0 with a realtek gigabit card, the problem
 also does not occur. (the vr0 was only 100mb)
 So it would seem there is something hinky with at least the em (82540) card .
 ___
 freebsd-questions@freebsd.org mailing list
 http://lists.freebsd.org/mailman/listinfo/freebsd-questions
 To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org

___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org


Re: Network sort of stops working with the em (intel) driver under load.

2010-06-01 Thread Carl Chave
On Mon, May 31, 2010 at 3:30 PM, mark rowlands rowlands.m...@gmail.com wrote:
 Newly built and cvsupped system,with GENERIC kernel,  when copying
 large amounts of data over a gigabit link via scp  the network  will
 hang after a couple of gig.  I can then no longer login via ssh. If I
 leave it be, after about 12-24 hours I can then login again. (A reboot
 of course fixes the issue immediately...) .

I've had this behavior also with Intel Pro 1000 cards using the em
driver.  I picked up a lot of 4 of them off ebay and never figured out
if it was a driver problem or faulty hardware.  I went back to FreeNAS
on that same box (7.2-RELEASE-p4) and it seems ok except my Netgear
gigabit switch also died so I'm currently plugged into a 100Mbps
switch.
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org


Network sort of stops working with the em (intel) driver under load.

2010-05-31 Thread mark rowlands
Newly built and cvsupped system,with GENERIC kernel,  when copying
large amounts of data over a gigabit link via scp  the network  will
hang after a couple of gig.  I can then no longer login via ssh. If I
leave it be, after about 12-24 hours I can then login again. (A reboot
of course fixes the issue immediately...) .

The system is very vanilla, only accf_http in loader.conf. No sysctl
tuning done, no firewall hankypanky.

pciconf -lv | grep -A4 ^em
e...@pci0:3:4:0: class=0x02 card=0x001e8086 chip=0x100e8086 rev=0x02 
hdr=0x00
vendor = 'Intel Corporation'
device = 'Gigabit Ethernet Controller (82540EM)'
class  = network
subclass   = ethernet

em0: Intel(R) PRO/1000 Legacy Network Connection 1.0.1 port
0xec00-0xec3f mem 0xfeae-0xfeaf,0xff irq 17 at device 4.0 on
pci3

Suggestions as to where I could look for more information as to the
precise nature of the problem gratefully received.  Current plan is to
purchase another variety of gigabit card to see if it is specific to
the intel card.

Copyright (c) 1992-2010 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
FreeBSD is a registered trademark of The FreeBSD Foundation.
FreeBSD 8.1-PRERELEASE #0: Sun May 30 10:00:58 CEST 2010
r...@temp.mwrwin2k.se:/usr/obj/usr/src/sys/MARK2 i386
Timecounter i8254 frequency 1193182 Hz quality 0
CPU: Intel(R) Pentium(R) 4 CPU 3.40GHz (3391.53-MHz 686-class CPU)
  Origin = GenuineIntel  Id = 0xf34  Family = f  Model = 3  Stepping = 4
  
Features=0xbfebfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE
  Features2=0x441dSSE3,DTES64,MON,DS_CPL,CNXT-ID,xTPR
  TSC: P-state invariant
real memory  = 2147483648 (2048 MB)
avail memory = 2090823680 (1993 MB)
ACPI APIC Table: A M I  OEMAPIC 
FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs
FreeBSD/SMP: 1 package(s) x 1 core(s) x 2 HTT threads
 cpu0 (BSP): APIC ID:  0
 cpu1 (AP/HT): APIC ID:  1
ioapic0: Changing APIC ID to 2
ioapic1: Changing APIC ID to 3
ioapic0 Version 0.3 irqs 0-23 on motherboard
ioapic1 Version 0.3 irqs 24-47 on motherboard
kbd1 at kbdmux0
acpi0: A M I OEMRSDT on motherboard
acpi0: [ITHREAD]
acpi0: Power Button (fixed)
acpi0: reservation of fec0, 1000 (3) failed
acpi0: reservation of fee0, 1000 (3) failed
acpi0: reservation of 0, a (3) failed
acpi0: reservation of 10, 7ff0 (3) failed
Timecounter ACPI-fast frequency 3579545 Hz quality 1000
acpi_timer0: 24-bit timer at 3.579545MHz port 0x808-0x80b on acpi0
cpu0: ACPI CPU on acpi0
cpu1: ACPI CPU on acpi0
pcib0: ACPI Host-PCI bridge port 0xcf8-0xcff on acpi0
pci0: ACPI PCI bus on pcib0
pcib1: ACPI PCI-PCI bridge at device 1.0 on pci0
pci1: ACPI PCI bus on pcib1
vgapci0: VGA-compatible display port 0xd000-0xd0ff mem
0xc000-0xcfff,0xfe9e-0xfe9e irq 16 at device 0.0 on
pci1
vgapci1: VGA-compatible display mem
0xd000-0xdfff,0xfe9f-0xfe9f at device 0.1 on pci1
pcib2: ACPI PCI-PCI bridge irq 27 at device 2.0 on pci0
pci2: ACPI PCI bus on pcib2
atapci0: VIA 8237 SATA300 controller port
0xcc00-0xcc07,0xc880-0xc883,0xc800-0xc807,0xc480-0xc483,0xc400-0xc40f,0xc000-0xc0ff
irq 21 at device 15.0 on pci0
atapci0: [ITHREAD]
ata2: ATA channel 0 on atapci0
ata2: [ITHREAD]
ata3: ATA channel 1 on atapci0
ata3: [ITHREAD]
atapci1: VIA 8237 UDMA133 controller port
0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xfc00-0xfc0f at device 15.1 on
pci0
ata0: ATA channel 0 on atapci1
ata0: [ITHREAD]
ata1: ATA channel 1 on atapci1
ata1: [ITHREAD]
uhci0: VIA 83C572 USB controller port 0xb480-0xb49f irq 20 at device
16.0 on pci0
uhci0: [ITHREAD]
usbus0: VIA 83C572 USB controller on uhci0
uhci1: VIA 83C572 USB controller port 0xb800-0xb81f irq 22 at device
16.1 on pci0
uhci1: [ITHREAD]
usbus1: VIA 83C572 USB controller on uhci1
uhci2: VIA 83C572 USB controller port 0xb880-0xb89f irq 21 at device
16.2 on pci0
uhci2: [ITHREAD]
usbus2: VIA 83C572 USB controller on uhci2
uhci3: VIA 83C572 USB controller port 0xbc00-0xbc1f irq 23 at device
16.3 on pci0
uhci3: [ITHREAD]
usbus3: VIA 83C572 USB controller on uhci3
ehci0: VIA VT6202 USB 2.0 controller mem 0xfe8ffc00-0xfe8ffcff irq
21 at device 16.4 on pci0
ehci0: [ITHREAD]
usbus4: EHCI version 1.0
usbus4: VIA VT6202 USB 2.0 controller on ehci0
isab0: PCI-ISA bridge at device 17.0 on pci0
isa0: ISA bus on isab0
vr0: VIA VT6102 Rhine II 10/100BaseTX port 0xb000-0xb0ff mem
0xfe8ff800-0xfe8ff8ff irq 23 at device 18.0 on pci0
vr0: Quirks: 0x0
vr0: Revision: 0x7c
miibus0: MII bus on vr0
ukphy0: Generic IEEE 802.3u media interface PHY 1 on miibus0
ukphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
vr0: Ethernet address: 00:19:66:92:a0:3f
vr0: [ITHREAD]
pcib3: ACPI PCI-PCI bridge at device 19.1 on pci0
pci3: ACPI PCI bus on pcib3
em0: Intel(R) PRO/1000 Legacy Network Connection 1.0.1 port
0xec00-0xec3f mem 

Re: Network sort of stops working with the em (intel) driver under load.

2010-05-31 Thread Michael Powell
mark rowlands wrote:

 Newly built and cvsupped system,with GENERIC kernel,  when copying
 large amounts of data over a gigabit link via scp  the network  will
 hang after a couple of gig.  I can then no longer login via ssh. If I
 leave it be, after about 12-24 hours I can then login again. (A reboot
 of course fixes the issue immediately...) .
 
 The system is very vanilla, only accf_http in loader.conf. No sysctl
 tuning done, no firewall hankypanky.
 
 pciconf -lv | grep -A4 ^em
 e...@pci0:3:4:0: class=0x02 card=0x001e8086 chip=0x100e8086 rev=0x02
 hdr=0x00
 vendor = 'Intel Corporation'
 device = 'Gigabit Ethernet Controller (82540EM)'
 class  = network
 subclass   = ethernet
 
 em0: Intel(R) PRO/1000 Legacy Network Connection 1.0.1 port
 0xec00-0xec3f mem 0xfeae-0xfeaf,0xff irq 17 at device 4.0 on
 pci3
 
 Suggestions as to where I could look for more information as to the
 precise nature of the problem gratefully received.  Current plan is to
 purchase another variety of gigabit card to see if it is specific to
 the intel card.
 


If memory serves, I think there may have been some traffic about something 
like this on the -CURRENT list. You might look/search there and see if it 
sounds similar. If it seems like it might be the same thing, look for an MFC 
back to -STABLE. Sometimes the fix for very a specific item which has been 
addressed is to take a system to -STABLE in order to obtain the fixed bits.

Research and confirm first, before considering such an update. My policy on 
-STABLE in the past is I only think about going there for a very narrow and 
specific situation where I know I have a problem that the devs have seen, 
analyzed, and fixed, with subsequent MFC.

Something else too - if you can disable the vr and the USB chips completely, 
it might provide a data point. IRQ sharing is supposed to work well, and it 
is something that may be eliminated from the scenario easily if you do not 
need these things.

-Mike
 


___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org


Re: Network sort of stops working with the em (intel) driver under load.

2010-05-31 Thread mark rowlands
On Mon, May 31, 2010 at 9:30 PM, mark rowlands rowlands.m...@gmail.com wrote:
 Newly built and cvsupped system,with GENERIC kernel,  when copying
 large amounts of data over a gigabit link via scp  the network  will
 hang after a couple of gig.  I can then no longer login via ssh. If I
 leave it be, after about 12-24 hours I can then login again. (A reboot
 of course fixes the issue immediately...) .

snip...

I can now confirm that a similar copy to the vr card does not cause
the same problem... so it would appear
more likely that it might be an em related problem.
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org