Re: SunFire X2200 ilo's bge1 DOWN/UP

2013-05-30 Thread Daniel Braniss
 
 --/04w6evG8XlLl3ft
 Content-Type: text/plain; charset=us-ascii
 Content-Disposition: inline
 
 On Tue, May 28, 2013 at 09:55:24AM +0300, Daniel Braniss wrote:
   On Tue, May 28, 2013 at 09:28:00AM +0300, Daniel Braniss wrote:
 On Mon, May 27, 2013 at 10:59:28AM +0300, Daniel Braniss wrote:
   On Fri, May 24, 2013 at 05:31:13PM +0300, Daniel Braniss wrote:
hi, after upgrading to 9.1-stable, this particular hardware - 
SunFire X2200,
   
   Show me dmesg(bge(4) and brgphy(4) only) and 'ifconfig bge1' 
   output.
   
  
  bge0: Broadcom NetXtreme Gigabit Ethernet Controller, ASIC rev. 
  0x009003 mem 
  0xfdff-0xfdff,0xfdfe-0xfdfe irq 17 at device 4.0 on 
  pci6
  bge0: CHIP ID 0x9003; ASIC REV 0x09; CHIP REV 0x90; PCI-X 133 
  MHz
  miibus2: MII bus on bge0
  brgphy0: BCM5714 1000BASE-T media interface PHY 1 on miibus2
  brgphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 
  1000baseT, 
  1000baseT-master, 1000baseT-FDX, 1000baseT-FDX-master, auto, 
  auto-flow
  bge0: Ethernet address: 00:1b:24:5d:5b:bd
  bge1: Broadcom NetXtreme Gigabit Ethernet Controller, ASIC rev. 
  0x009003 mem 
  0xfdfc-0xfdfc,0xfdfb-0xfdfb irq 18 at device 4.1 on 
  pci6
  bge1: CHIP ID 0x9003; ASIC REV 0x09; CHIP REV 0x90; PCI-X 133 
  MHz
  miibus3: MII bus on bge1
  brgphy1: BCM5714 1000BASE-T media interface PHY 1 on miibus3
  brgphy1:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 
  1000baseT, 
  1000baseT-master, 1000baseT-FDX, 1000baseT-FDX-master, auto, 
  auto-flow
  bge1: Ethernet address: 00:1b:24:5d:5b:be
  
  sf-10 ifconfig bge1
  bge1: flags=8802BROADCAST,SIMPLEX,MULTICAST metric 0 mtu 1500
  
  options=8009bRXCSUM,TXCSUM,VLAN_MTU,VLAN_HWTAGGING,VLAN_HWCSUM,LINKSTA
  TE
  ether 00:1b:24:5d:5b:be
  nd6 options=21PERFORMNUD,AUTO_LINKLOCAL
  media: Ethernet autoselect (100baseTX full-duplex)
  status: active
  
 
 Because bge1 is not UP, I wonder how you get link UP/DOWN events.
 Do you have some network script run by cron?

no scripts.
this port is shared with the ILO/IPMI, and back in March you fixed a 
problem
that it was hanging soon after it was initialized by the driver,
(r248226 - but I'm not sure if it was ever MFC'ed).
   
   It was MFCed.
   
Initialy I thought it could be caused by connections to it from other
hosts (either via the web, or ssh) so I killed them, but it didn't help.
without that patch the connection fails, and I don't see any DOWN/UP.
   
   Could you check how many number of interrupts you get from bge1?
   Ideally you shouldn't get any interrupts for bge1.
  
  it's not even mentioned :-)
  sf-04 vmstat -i
  interrupt  total   rate
  irq3: uart1  964  0
  irq4: uart06  0
  irq14: ata0   227354  0
  irq17: bge0  1021981  2
  irq21: ohci0  28  0
  irq22: ehci0   2  0
  irq23: atapci1293228  0
  cpu0:timer 383244076   1124
  cpu1:timer   2225144  6
  cpu2:timer   2056087  6
  cpu3:timer   2093943  6
  Total  391162813   1147
  
 
 Then the only way link UP/DOWN event could be generated for DOWN
 interface would be invocation of media status query
 (i.e. ifconfig -a) triggered by an external application.  Most
 drivers I touched check IFF_UP flag before poking media status
 register. However I'm not sure you're seeing this issue because you
 do not use any network script run by cron.
 Anyway, try attached patch and let me know whether it makes any
 difference.
 
   

 
is toggeling bge1 DOWN/UP every few hours, this port is being 
used by the ILO.
To check, I upgraded another identical host, and the same 
problem appears. 
   
   What is the last known working revision?
  
  I have no idea, but I have older versions, and ill start from the 
  oldets 
  (9.1-prerelease), but
  it will take time, since it takes hours till it happens.
  
 
 ok.


  
  
 
 --/04w6evG8XlLl3ft
 Content-Type: text/x-diff; charset=us-ascii
 Content-Disposition: attachment; filename=bge.media_sts.diff
 
 Index: sys/dev/bge/if_bge.c
 ===
 --- sys/dev/bge/if_bge.c  (revision 251021)
 +++ sys/dev/bge/if_bge.c  (working copy)
 @@ -5583,6 +5583,10 @@ bge_ifmedia_sts(struct ifnet *ifp, struct ifmediar
  
   BGE_LOCK(sc);
  
 + if ((ifp-if_flags  IFF_UP) == 0) {
 +  

Re: SunFire X2200 ilo's bge1 DOWN/UP

2013-05-30 Thread John Baldwin
On Thursday, May 30, 2013 2:44:35 am Daniel Braniss wrote:
  --/04w6evG8XlLl3ft
  Content-Type: text/x-diff; charset=us-ascii
  Content-Disposition: attachment; filename=bge.media_sts.diff
  
  Index: sys/dev/bge/if_bge.c
  ===
  --- sys/dev/bge/if_bge.c(revision 251021)
  +++ sys/dev/bge/if_bge.c(working copy)
  @@ -5583,6 +5583,10 @@ bge_ifmedia_sts(struct ifnet *ifp, struct ifmediar
   
  BGE_LOCK(sc);
   
  +   if ((ifp-if_flags  IFF_UP) == 0) {
  +   BGE_UNLOCK(sc);
  +   return;
  +   }
  if (sc-bge_flags  BGE_FLAG_TBI) {
  ifmr-ifm_status = IFM_AVALID;
  ifmr-ifm_active = IFM_ETHER;
  
  --/04w6evG8XlLl3ft--
 after 18hs, the logs are empty!
 it seems the patch fixes the problem.
 
 now maybe it's time to hunt for who is randomly calling for bge_ifmedia_sts
 ...

It could be any number of daemons that query interface state such as an
SNMP server, ladvd, etc.

If you wanted help you could modify the patch so that it does something like 
this:

if (/* test for IFF_UP */) {
BGE_UNLOCK(sc);
if_printf(ifp, state queried on down interface by pid %d (%s),
curthread-td_proc-p_pid, curthread-td_proc-p_comm);
return;
}

-- 
John Baldwin
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: System doesn't dump

2013-05-30 Thread John Baldwin
On Wednesday, May 29, 2013 2:41:38 am Dominic Fandrey wrote:
 I have a number of actions that reliably panic the system, such as
 performing shutdown -p (yes I'm booting into an inconsistent file
 system every time). Both with my notebook and my workstation.
 
 However I cannot get the system to dump.
 
 dumpdir=/var/crash
 and I've tried ada0s2b, /dev/ada0s2b, label/5swap, /dev/label/5swap and AUTO
 for dumpdev to no avail.
 
 The swap partition is 16g, the machines have 8g RAM and there's plenty
 of hard disk space available for /var/crash.
 
 I'm looking for that secret, undocumented trigger, that makes the
 system dump if a panic occurs. Once upon a time dumping just worked
 if the swap partition was large enough. I miss those olden days.

Does /dev/dumpdev exist and point to your swap partition after booting?

-- 
John Baldwin
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


[HEADSUP] New pkg-devel 1.1.0 beta1

2013-05-30 Thread Baptiste Daroussin
Hi,

The pkg developement team is proud to announce the new 1.1.0 beta1 release of
pkg.

Here is the list of new features that happened in pkg 1.1:
- new simpler and more reliable solver
- shared libraries are now always tracked
- ssh:// is supported as a protocol to distribute packages (needs pkg 1.1+ on
  the server hosting the packages)
- multirepository is no longer considered experimental and works by default.
- incremental update of the catalog (only if the repository was created by pkg
  1.1+)
- simplification of the public API
- stabilisation of the public API (we will now try to keep it stable and if
  change are needed there will be deprecation time before removal of some old
  functions)
- new experimental pkg convert (can convert from and to legacy pkg database)
  pkg2ng now uses pkg convert (still recommanded to use pkg2ng)
- new pkg lock/unlock to prevent any manipulation of a given package (no
  upgrade,delete,etc)
- improved UI (now you can see the progress of an upgrade what is left to be
  done)
- new pkg annotation to allow one to add annotations (free form key/value) to a
  package)
- pkg audit is now able to directly parse the vuxml native format and not only
  the compact version
- pkg -vv now shows all available options and their current settings
- pkg -vvv now shows a description of all the available options
- pkg info now automatically considers the query as globbing if * is in the
  requested pattern
- new hook plugin interface (allows users to create hooks that get called at
  anytime during and upgrade/installation/deletion of a package)
- new cmd plugin interface (allows users to create new sub command available for
  pkg)
- pkg register can now register a port installation in the legacy database
  format
- repository can be defined in simple yaml files

Internal:
- massive usage of hash tables (uthash), which simplifies a lot of the code,
  and improves performances
- lots of optimisation in plist and manifest parsing
- lots of optimisation in loading packages (mmap used when possible)
- lots of cleanup in memory usage
- regression test framework is now ready (using atf) regression test are slowly
  being added and populated.

To use this new version:
  Ports users (or in building factories: poudriere/tinderbox):
Add WITH_PKGNG=devel to your make.conf
pkg set -o ports-mgmt/pkg:ports-mgmt/pkg-devel

  Binary package users, if the remote repository is providing pkg 1.1:
pkg set -o ports-mgmt/pkg:ports-mgmt/pkg-devel
pkg upgrade


Note that pkg 1.1 can use a repository created for pkg 1.0 and vis versa.

Huge thanks to all the people that have contributed to the pkg developement: 
- may that be by code
- documentation
- bug report
- feedback
- ideas

List of people who contributed code:
Baptiste Daroussin, Matthew Seaman, Bryan Drewery, Vsevolod Stakhov,
Marin Atanasov Nikolov, Alexandre Perrin, Romain Tartière, Julien Laffaye,
Glen Barber, John Marino, Alex Kozlov, Roman Naumann, Sofian Brabez,
Alberto Villa, Will Andrews, Eitan Adler, Dan McGregor, namor, niamtokik,
Arthur Gautier, Garrett Cooper, Andrew Turner, Jeremy Chadwick,
Hajimu UMEMOTO, Mark Lokowich, Eygene Ryabinkin, Pietro Cerutti,
Rolf Grossmann, Ed Schouten, Dimitry Andric, David Forsythe, Stefan Grundmann,
Craig Rodrigues, Antoine Brodin, Andrey Zonov, Joel Dahl

Stats between 1.0 and 1.1:
287 files changed, 63418 insertions(+), 18763 deletions(-)

1198 commits

regards,
Bapt


pgp5ZK50yeDyl.pgp
Description: PGP signature


Re: [HEADSUP] New pkg-devel 1.1.0 beta1

2013-05-30 Thread Bryan Drewery
On 5/30/2013 10:20 AM, Baptiste Daroussin wrote:
   Ports users (or in building factories: poudriere/tinderbox):
 Add WITH_PKGNG=devel to your make.conf
 pkg set -o ports-mgmt/pkg:ports-mgmt/pkg-devel

FYI this will not currently work with portupgrade. I plan to address it
soon.

-- 
Regards,
Bryan Drewery



signature.asc
Description: OpenPGP digital signature


[releng_9 tinderbox] failure on i386/i386

2013-05-30 Thread FreeBSD Tinderbox
TB --- 2013-05-30 22:50:22 - tinderbox 2.10 running on freebsd-stable.sentex.ca
TB --- 2013-05-30 22:50:22 - FreeBSD freebsd-stable.sentex.ca 8.3-STABLE 
FreeBSD 8.3-STABLE #0: Tue Oct 16 17:37:58 UTC 2012 
mdtan...@freebsd-stable.sentex.ca:/usr/obj/usr/src/sys/server  amd64
TB --- 2013-05-30 22:50:22 - starting RELENG_9 tinderbox run for i386/i386
TB --- 2013-05-30 22:50:22 - cleaning the object tree
TB --- 2013-05-30 22:50:22 - /usr/local/bin/svn stat /src
TB --- 2013-05-30 22:50:28 - At svn revision 251168
TB --- 2013-05-30 22:50:29 - building world
TB --- 2013-05-30 22:50:29 - CROSS_BUILD_TESTING=YES
TB --- 2013-05-30 22:50:29 - MAKEOBJDIRPREFIX=/obj
TB --- 2013-05-30 22:50:29 - PATH=/usr/bin:/usr/sbin:/bin:/sbin
TB --- 2013-05-30 22:50:29 - SRCCONF=/dev/null
TB --- 2013-05-30 22:50:29 - TARGET=i386
TB --- 2013-05-30 22:50:29 - TARGET_ARCH=i386
TB --- 2013-05-30 22:50:29 - TZ=UTC
TB --- 2013-05-30 22:50:29 - __MAKE_CONF=/dev/null
TB --- 2013-05-30 22:50:29 - cd /src
TB --- 2013-05-30 22:50:29 - /usr/bin/make -B buildworld
 World build started on Thu May 30 22:50:29 UTC 2013
 Rebuilding the temporary build tree
 stage 1.1: legacy release compatibility shims
 stage 1.2: bootstrap tools
 stage 2.1: cleaning up the object tree
 stage 2.2: rebuilding the object tree
 stage 2.3: build tools
 stage 3: cross tools
 stage 4.1: building includes
 stage 4.2: building libraries
 stage 4.3: make dependencies
 stage 4.4: building everything
 World build completed on Fri May 31 01:44:34 UTC 2013
TB --- 2013-05-31 01:44:34 - generating LINT kernel config
TB --- 2013-05-31 01:44:34 - cd /src/sys/i386/conf
TB --- 2013-05-31 01:44:34 - /usr/bin/make -B LINT
TB --- 2013-05-31 01:44:34 - cd /src/sys/i386/conf
TB --- 2013-05-31 01:44:34 - /usr/sbin/config -m LINT
TB --- 2013-05-31 01:44:34 - building LINT kernel
TB --- 2013-05-31 01:44:34 - CROSS_BUILD_TESTING=YES
TB --- 2013-05-31 01:44:34 - MAKEOBJDIRPREFIX=/obj
TB --- 2013-05-31 01:44:34 - PATH=/usr/bin:/usr/sbin:/bin:/sbin
TB --- 2013-05-31 01:44:34 - SRCCONF=/dev/null
TB --- 2013-05-31 01:44:34 - TARGET=i386
TB --- 2013-05-31 01:44:34 - TARGET_ARCH=i386
TB --- 2013-05-31 01:44:34 - TZ=UTC
TB --- 2013-05-31 01:44:34 - __MAKE_CONF=/dev/null
TB --- 2013-05-31 01:44:34 - cd /src
TB --- 2013-05-31 01:44:34 - /usr/bin/make -B buildkernel KERNCONF=LINT
 Kernel build for LINT started on Fri May 31 01:44:34 UTC 2013
 stage 1: configuring the kernel
 stage 2.1: cleaning up the object tree
 stage 2.2: rebuilding the object tree
 stage 2.3: build tools
 stage 3.1: making dependencies
 stage 3.2: building everything
[...]
cc -c -O2 -pipe -fno-strict-aliasing  -std=c99  -Wall -Wredundant-decls 
-Wnested-externs -Wstrict-prototypes  -Wmissing-prototypes -Wpointer-arith 
-Winline -Wcast-qual  -Wundef -Wno-pointer-sign -fformat-extensions  
-Wmissing-include-dirs -fdiagnostics-show-option   -nostdinc  -I. -I/src/sys 
-I/src/sys/contrib/altq -D_KERNEL -DHAVE_KERNEL_OPTION_HEADERS -include 
opt_global.h -fno-common -finline-limit=8000 --param inline-unit-growth=100 
--param large-function-growth=1000 -DGPROF -falign-functions=16 -DGPROF4 
-DGUPROF -fno-builtin -mno-align-long-strings -mpreferred-stack-boundary=2 
-mno-mmx -mno-sse -msoft-float -ffreestanding -fstack-protector -Werror -pg 
-mprofiler-epilogue /src/sys/dev/aha/aha_isa.c
cc -c -O2 -pipe -fno-strict-aliasing  -std=c99  -Wall -Wredundant-decls 
-Wnested-externs -Wstrict-prototypes  -Wmissing-prototypes -Wpointer-arith 
-Winline -Wcast-qual  -Wundef -Wno-pointer-sign -fformat-extensions  
-Wmissing-include-dirs -fdiagnostics-show-option   -nostdinc  -I. -I/src/sys 
-I/src/sys/contrib/altq -D_KERNEL -DHAVE_KERNEL_OPTION_HEADERS -include 
opt_global.h -fno-common -finline-limit=8000 --param inline-unit-growth=100 
--param large-function-growth=1000 -DGPROF -falign-functions=16 -DGPROF4 
-DGUPROF -fno-builtin -mno-align-long-strings -mpreferred-stack-boundary=2 
-mno-mmx -mno-sse -msoft-float -ffreestanding -fstack-protector -Werror -pg 
-mprofiler-epilogue /src/sys/dev/aha/aha_mca.c
In file included from /src/sys/dev/aha/aha_mca.c:49:
/src/sys/dev/aha/ahareg.h:300: error: field 'timer' has incomplete type
/src/sys/dev/aha/aha_mca.c: In function 'aha_mca_attach':
/src/sys/dev/aha/aha_mca.c:194: error: 'aha' undeclared (first use in this 
function)
/src/sys/dev/aha/aha_mca.c:194: error: (Each undeclared identifier is reported 
only once
/src/sys/dev/aha/aha_mca.c:194: error: for each function it appears in.)
*** Error code 1

Stop in /obj/i386.i386/src/sys/LINT.
*** Error code 1

Stop in /src.
*** Error code 1

Stop in /src.
TB --- 2013-05-31 01:51:07 - WARNING: /usr/bin/make returned exit code  1 
TB --- 2013-05-31 01:51:07 - ERROR: failed to build LINT kernel
TB --- 2013-05-31 01:51:07 - 8352.52 user 914.26 system 10844.89 real


http://tinderbox.freebsd.org/tinderbox-freebsd9-build-RELENG_9-i386-i386.full
___
freebsd-stable@freebsd.org mailing list

Re: SunFire X2200 ilo's bge1 DOWN/UP

2013-05-30 Thread Daniel Braniss
 On Thursday, May 30, 2013 2:44:35 am Daniel Braniss wrote:
   --/04w6evG8XlLl3ft
   Content-Type: text/x-diff; charset=us-ascii
   Content-Disposition: attachment; filename=bge.media_sts.diff
   
   Index: sys/dev/bge/if_bge.c
   ===
   --- sys/dev/bge/if_bge.c  (revision 251021)
   +++ sys/dev/bge/if_bge.c  (working copy)
   @@ -5583,6 +5583,10 @@ bge_ifmedia_sts(struct ifnet *ifp, struct ifmediar

 BGE_LOCK(sc);

   + if ((ifp-if_flags  IFF_UP) == 0) {
   + BGE_UNLOCK(sc);
   + return;
   + }
 if (sc-bge_flags  BGE_FLAG_TBI) {
 ifmr-ifm_status = IFM_AVALID;
 ifmr-ifm_active = IFM_ETHER;
   
   --/04w6evG8XlLl3ft--
  after 18hs, the logs are empty!
  it seems the patch fixes the problem.
  
  now maybe it's time to hunt for who is randomly calling for bge_ifmedia_sts
  ...
 
 It could be any number of daemons that query interface state such as an
 SNMP server, ladvd, etc.
 
 If you wanted help you could modify the patch so that it does something like 
 this:
 
#include sys/proc.h
   if (/* test for IFF_UP */) {
   BGE_UNLOCK(sc);
   if_printf(ifp, state queried on down interface by pid %d (%s),
--|
 add a \n
   curthread-td_proc-p_pid, curthread-td_proc-p_comm);
   return;
   }
 
 -- 
 John Baldwin
snmpd call this several times a second, (difficult to measeure since sysolog 
just says
 last message repeated 22 times
in any case, the DOWN/UP appears once every few hours, oh well.
I have now stopped the snmpd daemon, maybe there is someone else ...

thanks,
danny


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: SunFire X2200 ilo's bge1 DOWN/UP

2013-05-30 Thread YongHyeon PYUN
On Fri, May 31, 2013 at 08:24:47AM +0300, Daniel Braniss wrote:
  On Thursday, May 30, 2013 2:44:35 am Daniel Braniss wrote:
--/04w6evG8XlLl3ft
Content-Type: text/x-diff; charset=us-ascii
Content-Disposition: attachment; filename=bge.media_sts.diff

Index: sys/dev/bge/if_bge.c
===
--- sys/dev/bge/if_bge.c(revision 251021)
+++ sys/dev/bge/if_bge.c(working copy)
@@ -5583,6 +5583,10 @@ bge_ifmedia_sts(struct ifnet *ifp, struct 
ifmediar
 
BGE_LOCK(sc);
 
+   if ((ifp-if_flags  IFF_UP) == 0) {
+   BGE_UNLOCK(sc);
+   return;
+   }
if (sc-bge_flags  BGE_FLAG_TBI) {
ifmr-ifm_status = IFM_AVALID;
ifmr-ifm_active = IFM_ETHER;

--/04w6evG8XlLl3ft--
   after 18hs, the logs are empty!
   it seems the patch fixes the problem.
   
   now maybe it's time to hunt for who is randomly calling for 
   bge_ifmedia_sts
   ...
  
  It could be any number of daemons that query interface state such as an
  SNMP server, ladvd, etc.
  
  If you wanted help you could modify the patch so that it does something 
  like 
  this:
  
 #include sys/proc.h
  if (/* test for IFF_UP */) {
  BGE_UNLOCK(sc);
  if_printf(ifp, state queried on down interface by pid %d (%s),
 --|
  add a \n
  curthread-td_proc-p_pid, curthread-td_proc-p_comm);
  return;
  }
  
  -- 
  John Baldwin
 snmpd call this several times a second, (difficult to measeure since sysolog 
 just says
last message repeated 22 times
 in any case, the DOWN/UP appears once every few hours, oh well.
 I have now stopped the snmpd daemon, maybe there is someone else ...

I have no idea why snmpd wants to know media status for interfaces
that are put into down state. The media status resolved after
bringing up the interface may be different one that was seen
before.
The patch also makes dhclient think driver got a valid link
regardless of link establishment. I guess that wouldn't be
issue though. I'll commit the patch after some more testing.

Thanks for reporting and testing!

 
 thanks,
   danny
 
 
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org