Re: bge problems under 7.2-PRERELEASE

2009-06-15 Thread Henri-Pierre Charles
Hello,

On Fri, May 29, 2009 at 8:16 PM, Marius Stroblmar...@alchemy.franken.de wrote:
 On Fri, May 29, 2009 at 08:02:42PM +0200, Henri-Pierre Charles wrote:
 Hello Guys

 On Thu, Apr 23, 2009 at 10:05 PM, Jeff Blank jb000...@mr-happy.com wrote:
  On Thu, Apr 23, 2009 at 07:19:49PM +0200, Marius Strobl wrote:
  So in combination with
  the low number of problem reports this really doesn't
  look like a generic problem of this Broadcom hardware
  or bge(4) and I just can suggest to disable MSI by
  setting the hw.pci.enable_msi tuneable to 0 for now.
 
  Great, this does the trick, and I'm now running today's RELENG_7.
  Thanks for your help, and let me know if ever you'd like me to help
  test possible fixes.

 I have the same computer (Dell optiplex 740) but the sysctl turnaround
 doesn't work, the bge interface seem up but no interruptions.

 I have to download / recompile with sys/dev/bge/if_bge.c rev
 1.198.2.13 and sys/dev/pci/pci.c rev 1.355.2.6

 Is there anything else to try ?

 You need to set hw.pci.enable_msi in the loader f.e. via
 loader.conf rather than the corresponding sysctl as the
 latter has no effect during boot.

Ok, thanks it works.

-- 
HPC
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: bge problems under 7.2-PRERELEASE

2009-05-29 Thread Henri-Pierre Charles
Hello Guys

On Thu, Apr 23, 2009 at 10:05 PM, Jeff Blank jb000...@mr-happy.com wrote:
 On Thu, Apr 23, 2009 at 07:19:49PM +0200, Marius Strobl wrote:
 So in combination with
 the low number of problem reports this really doesn't
 look like a generic problem of this Broadcom hardware
 or bge(4) and I just can suggest to disable MSI by
 setting the hw.pci.enable_msi tuneable to 0 for now.

 Great, this does the trick, and I'm now running today's RELENG_7.
 Thanks for your help, and let me know if ever you'd like me to help
 test possible fixes.

I have the same computer (Dell optiplex 740) but the sysctl turnaround
doesn't work, the bge interface seem up but no interruptions.

I have to download / recompile with sys/dev/bge/if_bge.c rev
1.198.2.13 and sys/dev/pci/pci.c rev 1.355.2.6

Is there anything else to try ?

H-P

-- 
HPC
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: bge problems under 7.2-PRERELEASE

2009-05-29 Thread Marius Strobl
On Fri, May 29, 2009 at 08:02:42PM +0200, Henri-Pierre Charles wrote:
 Hello Guys
 
 On Thu, Apr 23, 2009 at 10:05 PM, Jeff Blank jb000...@mr-happy.com wrote:
  On Thu, Apr 23, 2009 at 07:19:49PM +0200, Marius Strobl wrote:
  So in combination with
  the low number of problem reports this really doesn't
  look like a generic problem of this Broadcom hardware
  or bge(4) and I just can suggest to disable MSI by
  setting the hw.pci.enable_msi tuneable to 0 for now.
 
  Great, this does the trick, and I'm now running today's RELENG_7.
  Thanks for your help, and let me know if ever you'd like me to help
  test possible fixes.
 
 I have the same computer (Dell optiplex 740) but the sysctl turnaround
 doesn't work, the bge interface seem up but no interruptions.
 
 I have to download / recompile with sys/dev/bge/if_bge.c rev
 1.198.2.13 and sys/dev/pci/pci.c rev 1.355.2.6
 
 Is there anything else to try ?

You need to set hw.pci.enable_msi in the loader f.e. via
loader.conf rather than the corresponding sysctl as the
latter has no effect during boot.

Marius

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: bge problems under 7.2-PRERELEASE

2009-04-23 Thread Annelise Anderson

On Wed, 22 Apr 2009, Marius Strobl wrote:


On Wed, Apr 22, 2009 at 11:06:12AM -0400, Jeff Blank wrote:

On Tue, Apr 21, 2009 at 09:55:10PM +0200, Marius Strobl wrote:

Use the sources you have but plug in sys/dev/bge/if_bge.c
rev 1.198.2.13 and sys/dev/pci/pci.c rev 1.355.2.6, this
combination should work.


correct.


Then update if_bge.c to rev
1.198.2.14 and check whether things still work,


this causes the problem to resurface.



David,

this is the second report that using MSI with a BCM5754/
chip=0x167a14e4 results in no interrupts, is there any
errata about this?

Thanks,
Marius


I wanted to upgrade my stable and but now am afraid to
reboot with the new kernel as I've got one of these bge0 cards.


dmesg reveals:

brgphy0: BCM5750 10/100/1000baseTX PHY PHY 1 on miibus0

a different version of BCM, not BCM 5754

and the log of cvsup shows:

Edit src/sys/dev/bge/if_bge.c
Add delta 1.198.2.12 2009.01.15.20.13.22 marius
Add delta 1.198.2.13 2009.01.15.20.19.53 marius
Add delta 1.198.2.14 2009.01.15.20.23.38 marius
Add delta 1.198.2.15 2009.03.19.16.44.37 marius
Add delta 1.198.2.16 2009.03.23.20.53.38 marius
Add delta 1.198.2.17 2009.03.27.21.21.22 marius

Would you advise using the old kernel?  Should I have
posted to -questions?

Annelise





___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: bge problems under 7.2-PRERELEASE

2009-04-23 Thread Jeff Blank
On Thu, Apr 23, 2009 at 12:41:36AM -0700, Annelise Anderson wrote:
 I wanted to upgrade my stable and but now am afraid to
 reboot with the new kernel as I've got one of these bge0 cards.

You should be able to back out fairly easily in case of most problems.
When you reboot to single-user mode with the new kernel, you can test
whatever you think you need to (device presence/functionality,
networking, etc) before installing world.  If you don't like the
results, just rename or delete /boot/kernel, rename /boot/kernel.old
to /boot/kernel, and reboot, no damage done (usually).

 dmesg reveals:
 
 brgphy0: BCM5750 10/100/1000baseTX PHY PHY 1 on miibus0
 
 a different version of BCM, not BCM 5754

I can't speak authoritatively, but I expect you're fine.  My problem
appears to have a very narrow scope.

Jeff
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: bge problems under 7.2-PRERELEASE

2009-04-23 Thread Marius Strobl
On Wed, Apr 22, 2009 at 11:06:12AM -0400, Jeff Blank wrote:
 On Tue, Apr 21, 2009 at 09:55:10PM +0200, Marius Strobl wrote:
  Use the sources you have but plug in sys/dev/bge/if_bge.c
  rev 1.198.2.13 and sys/dev/pci/pci.c rev 1.355.2.6, this
  combination should work.
 
 correct.
 
  Then update if_bge.c to rev
  1.198.2.14 and check whether things still work,
 
 this causes the problem to resurface.
 

FYI, I've found a notebook equipped with the exact same
Broadcom chip and ASIC revision but which works without
problems with current sources. So in combination with
the low number of problem reports this really doesn't
look like a generic problem of this Broadcom hardware
or bge(4) and I just can suggest to disable MSI by
setting the hw.pci.enable_msi tuneable to 0 for now.
As David pointed out this might be a problem with the
Nvidia chipset. Linux seems to have a workaround
related to this but I still need to check that in
detail.

Marius

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: bge problems under 7.2-PRERELEASE

2009-04-23 Thread Jeff Blank
On Thu, Apr 23, 2009 at 07:19:49PM +0200, Marius Strobl wrote:
 So in combination with
 the low number of problem reports this really doesn't
 look like a generic problem of this Broadcom hardware
 or bge(4) and I just can suggest to disable MSI by
 setting the hw.pci.enable_msi tuneable to 0 for now.

Great, this does the trick, and I'm now running today's RELENG_7.
Thanks for your help, and let me know if ever you'd like me to help
test possible fixes.

Jeff
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: bge problems under 7.2-PRERELEASE

2009-04-22 Thread Jeff Blank
On Tue, Apr 21, 2009 at 09:55:10PM +0200, Marius Strobl wrote:
 Use the sources you have but plug in sys/dev/bge/if_bge.c
 rev 1.198.2.13 and sys/dev/pci/pci.c rev 1.355.2.6, this
 combination should work.

correct.

 Then update if_bge.c to rev
 1.198.2.14 and check whether things still work,

this causes the problem to resurface.

Jeff
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: bge problems under 7.2-PRERELEASE

2009-04-22 Thread Marius Strobl
On Wed, Apr 22, 2009 at 11:06:12AM -0400, Jeff Blank wrote:
 On Tue, Apr 21, 2009 at 09:55:10PM +0200, Marius Strobl wrote:
  Use the sources you have but plug in sys/dev/bge/if_bge.c
  rev 1.198.2.13 and sys/dev/pci/pci.c rev 1.355.2.6, this
  combination should work.
 
 correct.
 
  Then update if_bge.c to rev
  1.198.2.14 and check whether things still work,
 
 this causes the problem to resurface.
 

David,

this is the second report that using MSI with a BCM5754/
chip=0x167a14e4 results in no interrupts, is there any
errata about this?

Thanks,
Marius

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: bge problems under 7.2-PRERELEASE

2009-04-22 Thread Marius Strobl
On Wed, Apr 22, 2009 at 01:21:58PM -0700, David Christensen wrote:
Then update if_bge.c to rev
1.198.2.14 and check whether things still work,
   
   this causes the problem to resurface.
   
  
  David,
  
  this is the second report that using MSI with a BCM5754/
  chip=0x167a14e4 results in no interrupts, is there any errata 
  about this?
 
 Nope, no errata for MSI on the 5754, nor can I recall any
 MSI problems on any of the PCIe based devices.  Do the 
 systems share a common root complex that might be the source
 of the issue?
 

This one useses a:
no...@pci0:0:0:0:   class=0x05 card=0x02f010de chip=0x02f010de rev=0xa2 
hdr=0x00
vendor = 'Nvidia Corp'
device = 'C51 Host Bridge'
class  = memory
subclass   = RAM

At least Linux doesn't seem to avoid using MSI with these.
I didn't get any further regarding the other report so far.

Marius

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


RE: bge problems under 7.2-PRERELEASE

2009-04-22 Thread David Christensen
   Then update if_bge.c to rev
   1.198.2.14 and check whether things still work,
  
  this causes the problem to resurface.
  
 
 David,
 
 this is the second report that using MSI with a BCM5754/
 chip=0x167a14e4 results in no interrupts, is there any errata 
 about this?

Nope, no errata for MSI on the 5754, nor can I recall any
MSI problems on any of the PCIe based devices.  Do the 
systems share a common root complex that might be the source
of the issue?

Dave

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


RE: bge problems under 7.2-PRERELEASE

2009-04-22 Thread David Christensen
   this is the second report that using MSI with a BCM5754/
   chip=0x167a14e4 results in no interrupts, is there any 
 errata about 
   this?
  
  Nope, no errata for MSI on the 5754, nor can I recall any 
 MSI problems 
  on any of the PCIe based devices.  Do the systems share a 
 common root 
  complex that might be the source of the issue?
  
 
 This one useses a:
 no...@pci0:0:0:0:   class=0x05 card=0x02f010de 
 chip=0x02f010de rev=0xa2 hdr=0x00
 vendor = 'Nvidia Corp'
 device = 'C51 Host Bridge'
 class  = memory
 subclass   = RAM
 
 At least Linux doesn't seem to avoid using MSI with these.
 I didn't get any further regarding the other report so far.

Are there any other errors reported by the 5754 such as a PCIe 
completion timeout in the Advanced Error Uncorrectable Status
Register?  I recall an issue with another Nvidia chipset and 
the 5754 where the system BIOS incorrectly configured the
Hypertransport unitID field to a value of 0x20 which would
prevent our diagnostics from running.  PCI configuration and
memory-mapped I/O worked correctly but DMA operations such as
sending RX'd frames would fail.  MSI might fall into this latter
category and could be causing this error.

The important thing would be to check for a Completion Timeout
in the 5754.

Dave

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: bge problems under 7.2-PRERELEASE

2009-04-21 Thread Marius Strobl
On Mon, Apr 20, 2009 at 04:46:38PM -0400, Jeff Blank wrote:
 Hi,
 
 csup today around 13:00 UTC.  'make buildworld', 'make buildkernel',
 'make installkernel', reboot to single-user.
 
 # ifconfig bge0 141.219.5.33/22 up
 # ping 141.219.4.1
 PING 141.219.4.1 (141.219.4.1): 56 data bytes
 ^C
 --- 141.219.4.1 ping statistics ---
 3 packets transmitted, 0 packets received, 100.0% packet loss
 
 # ifconfig bge0
 bge0: flags=8843UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST metric 0 mtu 1500
 options=9bRXCSUM,TXCSUM,VLAN_MTU,VLAN_HWTAGGING,VLAN_HWCSUM
 ether 00:1e:c9:44:30:14
 inet6 fe80::21e:c9ff:fe44:3014%bge0 prefixlen 64 scopeid 0x1 
 inet 141.219.5.33 netmask 0xfc00 broadcast 141.219.7.255
 media: Ethernet autoselect (100baseTX full-duplex)
 status: active
 
 I have no trouble with networking using a kernel from 20090115.  This

Does that working kernel include r187309 (if_bge.c rev 1.198.2.14)?
If it predates it could you please check whether this is the change
which breaks things for you?

 is a Dell Optiplex 740, about 9 months old.  make.conf and src.conf
 are below.  dmesg and pciconf output are attached.  I ran 'tcpdump -i
 bge0 -nn' while pinging, and this captured no packets, not even this
 host's ARP broadcasts.
 
 I also encountered this problem in late March as well, just after
 RELENG_7 became 7.2-PRERELEASE, but couldn't find time to put this
 information together.
 
 If there's anything else I can run to troubleshoot this, please let me
 know.
 
 thank you,
 Jeff Blank
 
 /etc/make.conf:
 CPUTYPE ?= k8
 WITHOUT_NLS=true
 [port-specific options elided for brevity]
 
 /etc/src.conf:
 WITHOUT_ATM=true
 WITHOUT_GPIB=true
 WITHOUT_I4B=true
 WITHOUT_IPFILTER=true
 WITHOUT_IPFW=true
 WITHOUT_IPX=true
 WITHOUT_LIB32=true
 WITHOUT_NCP=true
 WITHOUT_NDIS=true
 WITHOUT_PPP=true
 WITHOUT_QUOTAS=true
 WITHOUT_RCMDS=true
 WITHOUT_SENDMAIL=true
 WITHOUT_SLIP=true
 WITHOUT_WIRELESS=true

 # dmesg
 Copyright (c) 1992-2009 The FreeBSD Project.
 Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
   The Regents of the University of California. All rights reserved.
 FreeBSD is a registered trademark of The FreeBSD Foundation.
 FreeBSD 7.2-PRERELEASE #0: Mon Apr 20 12:27:26 EDT 2009
 r...@bender.tc.mtu.edu:/usr/obj/usr/src/sys/GENERIC
 Timecounter i8254 frequency 1193182 Hz quality 0
 CPU: AMD Athlon(tm) 64 X2 Dual Core Processor 5600+ (2906.08-MHz K8-class CPU)
   Origin = AuthenticAMD  Id = 0x60fb2  Stepping = 2
   
 Features=0x178bfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,MMX,FXSR,SSE,SSE2,HTT
   Features2=0x2001SSE3,CX16
   AMD Features=0xea500800SYSCALL,NX,MMX+,FFXSR,RDTSCP,LM,3DNow!+,3DNow!
   AMD Features2=0x11fLAHF,CMP,SVM,ExtAPIC,CR8,Prefetch
   TSC: P-state invariant
   Cores per package: 2
 usable memory = 4278718464 (4080 MB)
 avail memory  = 4098314240 (3908 MB)
 ACPI APIC Table: DELL   bMk
 FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs
  cpu0 (BSP): APIC ID:  0
  cpu1 (AP): APIC ID:  1
 This module (opensolaris) contains code covered by the
 Common Development and Distribution License (CDDL)
 see http://opensolaris.org/os/licensing/opensolaris_license/
 ioapic0: Changing APIC ID to 4
 ioapic0 Version 1.1 irqs 0-23 on motherboard
 kbd1 at kbdmux0
 acpi0: DELL bMk on motherboard
 acpi0: [ITHREAD]
 acpi0: Power Button (fixed)
 acpi0: reservation of 0, a (3) failed
 acpi0: reservation of 10, bfdd (3) failed
 Timecounter ACPI-fast frequency 3579545 Hz quality 1000
 acpi_timer0: 24-bit timer at 3.579545MHz port 0x1008-0x100b on acpi0
 acpi_hpet0: High Precision Event Timer iomem 0xfeff-0xfeff03ff on acpi0
 Timecounter HPET frequency 2500 Hz quality 900
 acpi_button0: Power Button on acpi0
 pcib0: ACPI Host-PCI bridge port 0xcf8-0xcff on acpi0
 pci0: ACPI PCI bus on pcib0
 pci0: memory, RAM at device 0.0 (no driver attached)
 pci0: memory, RAM at device 0.1 (no driver attached)
 pci0: memory, RAM at device 0.2 (no driver attached)
 pci0: memory, RAM at device 0.3 (no driver attached)
 pci0: memory, RAM at device 0.4 (no driver attached)
 pci0: memory, RAM at device 0.5 (no driver attached)
 pci0: memory, RAM at device 0.6 (no driver attached)
 pci0: memory, RAM at device 0.7 (no driver attached)
 pcib1: ACPI PCI-PCI bridge at device 2.0 on pci0
 pci1: ACPI PCI bus on pcib1
 pcib2: ACPI PCI-PCI bridge at device 3.0 on pci0
 pci2: ACPI PCI bus on pcib2
 bge0: Broadcom NetXtreme Gigabit Ethernet Controller, ASIC rev. 0xb002 mem 
 0xfd8f-0xfd8f irq 16 at device 0.0 on pci2
 miibus0: MII bus on bge0
 brgphy0: BCM5787 10/100/1000baseTX PHY PHY 1 on miibus0
 brgphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 
 1000baseT-FDX, auto
 bge0: Ethernet address: 00:1e:c9:44:30:14
 bge0: [ITHREAD]
 pcib3: ACPI PCI-PCI bridge at device 4.0 on pci0
 pci3: ACPI PCI bus on pcib3
 vgapci0: VGA-compatible display port 0xbc00-0xbcff mem 
 

Re: bge problems under 7.2-PRERELEASE

2009-04-21 Thread Jeff Blank
On Tue, Apr 21, 2009 at 08:49:24PM +0200, Marius Strobl wrote:
 On Mon, Apr 20, 2009 at 04:46:38PM -0400, Jeff Blank wrote:
  I have no trouble with networking using a kernel from 20090115.  This
 Does that working kernel include r187309 (if_bge.c rev 1.198.2.14)?

nope, 1.198.2.11.

 If it predates it could you please check whether this is the change
 which breaks things for you?

what's the best way to go about this?  I no longer have the source
tree from the working kernel.  should I plug 1.198.2.11 into the
sources I have and then try .14?  any other files I'd need previous
revisions of?  or is there a different, better way to go about it?  (I
don't know much about time-travelling through source.)

thanks,
Jeff
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: bge problems under 7.2-PRERELEASE

2009-04-21 Thread Marius Strobl
On Tue, Apr 21, 2009 at 03:31:29PM -0400, Jeff Blank wrote:
 On Tue, Apr 21, 2009 at 08:49:24PM +0200, Marius Strobl wrote:
  On Mon, Apr 20, 2009 at 04:46:38PM -0400, Jeff Blank wrote:
   I have no trouble with networking using a kernel from 20090115.  This
  Does that working kernel include r187309 (if_bge.c rev 1.198.2.14)?
 
 nope, 1.198.2.11.
 
  If it predates it could you please check whether this is the change
  which breaks things for you?
 
 what's the best way to go about this?  I no longer have the source
 tree from the working kernel.  should I plug 1.198.2.11 into the
 sources I have and then try .14?  any other files I'd need previous
 revisions of?  or is there a different, better way to go about it?  (I
 don't know much about time-travelling through source.)
 

Use the sources you have but plug in sys/dev/bge/if_bge.c
rev 1.198.2.13 and sys/dev/pci/pci.c rev 1.355.2.6, this
combination should work. Then update if_bge.c to rev
1.198.2.14 and check whether things still work, if they do
revert to pci.c rev 1.355.2.9 and check again.

Marius

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org