Re: SunFire X2200 ilo's bge1 DOWN/UP
--/04w6evG8XlLl3ft Content-Type: text/plain; charset=us-ascii Content-Disposition: inline On Tue, May 28, 2013 at 09:55:24AM +0300, Daniel Braniss wrote: On Tue, May 28, 2013 at 09:28:00AM +0300, Daniel Braniss wrote: On Mon, May 27, 2013 at 10:59:28AM +0300, Daniel Braniss wrote: On Fri, May 24, 2013 at 05:31:13PM +0300, Daniel Braniss wrote: hi, after upgrading to 9.1-stable, this particular hardware - SunFire X2200, Show me dmesg(bge(4) and brgphy(4) only) and 'ifconfig bge1' output. bge0: Broadcom NetXtreme Gigabit Ethernet Controller, ASIC rev. 0x009003 mem 0xfdff-0xfdff,0xfdfe-0xfdfe irq 17 at device 4.0 on pci6 bge0: CHIP ID 0x9003; ASIC REV 0x09; CHIP REV 0x90; PCI-X 133 MHz miibus2: MII bus on bge0 brgphy0: BCM5714 1000BASE-T media interface PHY 1 on miibus2 brgphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-master, 1000baseT-FDX, 1000baseT-FDX-master, auto, auto-flow bge0: Ethernet address: 00:1b:24:5d:5b:bd bge1: Broadcom NetXtreme Gigabit Ethernet Controller, ASIC rev. 0x009003 mem 0xfdfc-0xfdfc,0xfdfb-0xfdfb irq 18 at device 4.1 on pci6 bge1: CHIP ID 0x9003; ASIC REV 0x09; CHIP REV 0x90; PCI-X 133 MHz miibus3: MII bus on bge1 brgphy1: BCM5714 1000BASE-T media interface PHY 1 on miibus3 brgphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-master, 1000baseT-FDX, 1000baseT-FDX-master, auto, auto-flow bge1: Ethernet address: 00:1b:24:5d:5b:be sf-10 ifconfig bge1 bge1: flags=8802BROADCAST,SIMPLEX,MULTICAST metric 0 mtu 1500 options=8009bRXCSUM,TXCSUM,VLAN_MTU,VLAN_HWTAGGING,VLAN_HWCSUM,LINKSTA TE ether 00:1b:24:5d:5b:be nd6 options=21PERFORMNUD,AUTO_LINKLOCAL media: Ethernet autoselect (100baseTX full-duplex) status: active Because bge1 is not UP, I wonder how you get link UP/DOWN events. Do you have some network script run by cron? no scripts. this port is shared with the ILO/IPMI, and back in March you fixed a problem that it was hanging soon after it was initialized by the driver, (r248226 - but I'm not sure if it was ever MFC'ed). It was MFCed. Initialy I thought it could be caused by connections to it from other hosts (either via the web, or ssh) so I killed them, but it didn't help. without that patch the connection fails, and I don't see any DOWN/UP. Could you check how many number of interrupts you get from bge1? Ideally you shouldn't get any interrupts for bge1. it's not even mentioned :-) sf-04 vmstat -i interrupt total rate irq3: uart1 964 0 irq4: uart06 0 irq14: ata0 227354 0 irq17: bge0 1021981 2 irq21: ohci0 28 0 irq22: ehci0 2 0 irq23: atapci1293228 0 cpu0:timer 383244076 1124 cpu1:timer 2225144 6 cpu2:timer 2056087 6 cpu3:timer 2093943 6 Total 391162813 1147 Then the only way link UP/DOWN event could be generated for DOWN interface would be invocation of media status query (i.e. ifconfig -a) triggered by an external application. Most drivers I touched check IFF_UP flag before poking media status register. However I'm not sure you're seeing this issue because you do not use any network script run by cron. Anyway, try attached patch and let me know whether it makes any difference. is toggeling bge1 DOWN/UP every few hours, this port is being used by the ILO. To check, I upgraded another identical host, and the same problem appears. What is the last known working revision? I have no idea, but I have older versions, and ill start from the oldets (9.1-prerelease), but it will take time, since it takes hours till it happens. ok. --/04w6evG8XlLl3ft Content-Type: text/x-diff; charset=us-ascii Content-Disposition: attachment; filename=bge.media_sts.diff Index: sys/dev/bge/if_bge.c === --- sys/dev/bge/if_bge.c (revision 251021) +++ sys/dev/bge/if_bge.c (working copy) @@ -5583,6 +5583,10 @@ bge_ifmedia_sts(struct ifnet *ifp, struct ifmediar BGE_LOCK(sc); + if ((ifp-if_flags IFF_UP) == 0) { +
Re: SunFire X2200 ilo's bge1 DOWN/UP
On Thursday, May 30, 2013 2:44:35 am Daniel Braniss wrote: --/04w6evG8XlLl3ft Content-Type: text/x-diff; charset=us-ascii Content-Disposition: attachment; filename=bge.media_sts.diff Index: sys/dev/bge/if_bge.c === --- sys/dev/bge/if_bge.c(revision 251021) +++ sys/dev/bge/if_bge.c(working copy) @@ -5583,6 +5583,10 @@ bge_ifmedia_sts(struct ifnet *ifp, struct ifmediar BGE_LOCK(sc); + if ((ifp-if_flags IFF_UP) == 0) { + BGE_UNLOCK(sc); + return; + } if (sc-bge_flags BGE_FLAG_TBI) { ifmr-ifm_status = IFM_AVALID; ifmr-ifm_active = IFM_ETHER; --/04w6evG8XlLl3ft-- after 18hs, the logs are empty! it seems the patch fixes the problem. now maybe it's time to hunt for who is randomly calling for bge_ifmedia_sts ... It could be any number of daemons that query interface state such as an SNMP server, ladvd, etc. If you wanted help you could modify the patch so that it does something like this: if (/* test for IFF_UP */) { BGE_UNLOCK(sc); if_printf(ifp, state queried on down interface by pid %d (%s), curthread-td_proc-p_pid, curthread-td_proc-p_comm); return; } -- John Baldwin ___ freebsd-stable@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-stable To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org
Re: System doesn't dump
On Wednesday, May 29, 2013 2:41:38 am Dominic Fandrey wrote: I have a number of actions that reliably panic the system, such as performing shutdown -p (yes I'm booting into an inconsistent file system every time). Both with my notebook and my workstation. However I cannot get the system to dump. dumpdir=/var/crash and I've tried ada0s2b, /dev/ada0s2b, label/5swap, /dev/label/5swap and AUTO for dumpdev to no avail. The swap partition is 16g, the machines have 8g RAM and there's plenty of hard disk space available for /var/crash. I'm looking for that secret, undocumented trigger, that makes the system dump if a panic occurs. Once upon a time dumping just worked if the swap partition was large enough. I miss those olden days. Does /dev/dumpdev exist and point to your swap partition after booting? -- John Baldwin ___ freebsd-stable@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-stable To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org
[HEADSUP] New pkg-devel 1.1.0 beta1
Hi, The pkg developement team is proud to announce the new 1.1.0 beta1 release of pkg. Here is the list of new features that happened in pkg 1.1: - new simpler and more reliable solver - shared libraries are now always tracked - ssh:// is supported as a protocol to distribute packages (needs pkg 1.1+ on the server hosting the packages) - multirepository is no longer considered experimental and works by default. - incremental update of the catalog (only if the repository was created by pkg 1.1+) - simplification of the public API - stabilisation of the public API (we will now try to keep it stable and if change are needed there will be deprecation time before removal of some old functions) - new experimental pkg convert (can convert from and to legacy pkg database) pkg2ng now uses pkg convert (still recommanded to use pkg2ng) - new pkg lock/unlock to prevent any manipulation of a given package (no upgrade,delete,etc) - improved UI (now you can see the progress of an upgrade what is left to be done) - new pkg annotation to allow one to add annotations (free form key/value) to a package) - pkg audit is now able to directly parse the vuxml native format and not only the compact version - pkg -vv now shows all available options and their current settings - pkg -vvv now shows a description of all the available options - pkg info now automatically considers the query as globbing if * is in the requested pattern - new hook plugin interface (allows users to create hooks that get called at anytime during and upgrade/installation/deletion of a package) - new cmd plugin interface (allows users to create new sub command available for pkg) - pkg register can now register a port installation in the legacy database format - repository can be defined in simple yaml files Internal: - massive usage of hash tables (uthash), which simplifies a lot of the code, and improves performances - lots of optimisation in plist and manifest parsing - lots of optimisation in loading packages (mmap used when possible) - lots of cleanup in memory usage - regression test framework is now ready (using atf) regression test are slowly being added and populated. To use this new version: Ports users (or in building factories: poudriere/tinderbox): Add WITH_PKGNG=devel to your make.conf pkg set -o ports-mgmt/pkg:ports-mgmt/pkg-devel Binary package users, if the remote repository is providing pkg 1.1: pkg set -o ports-mgmt/pkg:ports-mgmt/pkg-devel pkg upgrade Note that pkg 1.1 can use a repository created for pkg 1.0 and vis versa. Huge thanks to all the people that have contributed to the pkg developement: - may that be by code - documentation - bug report - feedback - ideas List of people who contributed code: Baptiste Daroussin, Matthew Seaman, Bryan Drewery, Vsevolod Stakhov, Marin Atanasov Nikolov, Alexandre Perrin, Romain Tartière, Julien Laffaye, Glen Barber, John Marino, Alex Kozlov, Roman Naumann, Sofian Brabez, Alberto Villa, Will Andrews, Eitan Adler, Dan McGregor, namor, niamtokik, Arthur Gautier, Garrett Cooper, Andrew Turner, Jeremy Chadwick, Hajimu UMEMOTO, Mark Lokowich, Eygene Ryabinkin, Pietro Cerutti, Rolf Grossmann, Ed Schouten, Dimitry Andric, David Forsythe, Stefan Grundmann, Craig Rodrigues, Antoine Brodin, Andrey Zonov, Joel Dahl Stats between 1.0 and 1.1: 287 files changed, 63418 insertions(+), 18763 deletions(-) 1198 commits regards, Bapt pgp5ZK50yeDyl.pgp Description: PGP signature
Re: [HEADSUP] New pkg-devel 1.1.0 beta1
On 5/30/2013 10:20 AM, Baptiste Daroussin wrote: Ports users (or in building factories: poudriere/tinderbox): Add WITH_PKGNG=devel to your make.conf pkg set -o ports-mgmt/pkg:ports-mgmt/pkg-devel FYI this will not currently work with portupgrade. I plan to address it soon. -- Regards, Bryan Drewery signature.asc Description: OpenPGP digital signature
[releng_9 tinderbox] failure on i386/i386
TB --- 2013-05-30 22:50:22 - tinderbox 2.10 running on freebsd-stable.sentex.ca TB --- 2013-05-30 22:50:22 - FreeBSD freebsd-stable.sentex.ca 8.3-STABLE FreeBSD 8.3-STABLE #0: Tue Oct 16 17:37:58 UTC 2012 mdtan...@freebsd-stable.sentex.ca:/usr/obj/usr/src/sys/server amd64 TB --- 2013-05-30 22:50:22 - starting RELENG_9 tinderbox run for i386/i386 TB --- 2013-05-30 22:50:22 - cleaning the object tree TB --- 2013-05-30 22:50:22 - /usr/local/bin/svn stat /src TB --- 2013-05-30 22:50:28 - At svn revision 251168 TB --- 2013-05-30 22:50:29 - building world TB --- 2013-05-30 22:50:29 - CROSS_BUILD_TESTING=YES TB --- 2013-05-30 22:50:29 - MAKEOBJDIRPREFIX=/obj TB --- 2013-05-30 22:50:29 - PATH=/usr/bin:/usr/sbin:/bin:/sbin TB --- 2013-05-30 22:50:29 - SRCCONF=/dev/null TB --- 2013-05-30 22:50:29 - TARGET=i386 TB --- 2013-05-30 22:50:29 - TARGET_ARCH=i386 TB --- 2013-05-30 22:50:29 - TZ=UTC TB --- 2013-05-30 22:50:29 - __MAKE_CONF=/dev/null TB --- 2013-05-30 22:50:29 - cd /src TB --- 2013-05-30 22:50:29 - /usr/bin/make -B buildworld World build started on Thu May 30 22:50:29 UTC 2013 Rebuilding the temporary build tree stage 1.1: legacy release compatibility shims stage 1.2: bootstrap tools stage 2.1: cleaning up the object tree stage 2.2: rebuilding the object tree stage 2.3: build tools stage 3: cross tools stage 4.1: building includes stage 4.2: building libraries stage 4.3: make dependencies stage 4.4: building everything World build completed on Fri May 31 01:44:34 UTC 2013 TB --- 2013-05-31 01:44:34 - generating LINT kernel config TB --- 2013-05-31 01:44:34 - cd /src/sys/i386/conf TB --- 2013-05-31 01:44:34 - /usr/bin/make -B LINT TB --- 2013-05-31 01:44:34 - cd /src/sys/i386/conf TB --- 2013-05-31 01:44:34 - /usr/sbin/config -m LINT TB --- 2013-05-31 01:44:34 - building LINT kernel TB --- 2013-05-31 01:44:34 - CROSS_BUILD_TESTING=YES TB --- 2013-05-31 01:44:34 - MAKEOBJDIRPREFIX=/obj TB --- 2013-05-31 01:44:34 - PATH=/usr/bin:/usr/sbin:/bin:/sbin TB --- 2013-05-31 01:44:34 - SRCCONF=/dev/null TB --- 2013-05-31 01:44:34 - TARGET=i386 TB --- 2013-05-31 01:44:34 - TARGET_ARCH=i386 TB --- 2013-05-31 01:44:34 - TZ=UTC TB --- 2013-05-31 01:44:34 - __MAKE_CONF=/dev/null TB --- 2013-05-31 01:44:34 - cd /src TB --- 2013-05-31 01:44:34 - /usr/bin/make -B buildkernel KERNCONF=LINT Kernel build for LINT started on Fri May 31 01:44:34 UTC 2013 stage 1: configuring the kernel stage 2.1: cleaning up the object tree stage 2.2: rebuilding the object tree stage 2.3: build tools stage 3.1: making dependencies stage 3.2: building everything [...] cc -c -O2 -pipe -fno-strict-aliasing -std=c99 -Wall -Wredundant-decls -Wnested-externs -Wstrict-prototypes -Wmissing-prototypes -Wpointer-arith -Winline -Wcast-qual -Wundef -Wno-pointer-sign -fformat-extensions -Wmissing-include-dirs -fdiagnostics-show-option -nostdinc -I. -I/src/sys -I/src/sys/contrib/altq -D_KERNEL -DHAVE_KERNEL_OPTION_HEADERS -include opt_global.h -fno-common -finline-limit=8000 --param inline-unit-growth=100 --param large-function-growth=1000 -DGPROF -falign-functions=16 -DGPROF4 -DGUPROF -fno-builtin -mno-align-long-strings -mpreferred-stack-boundary=2 -mno-mmx -mno-sse -msoft-float -ffreestanding -fstack-protector -Werror -pg -mprofiler-epilogue /src/sys/dev/aha/aha_isa.c cc -c -O2 -pipe -fno-strict-aliasing -std=c99 -Wall -Wredundant-decls -Wnested-externs -Wstrict-prototypes -Wmissing-prototypes -Wpointer-arith -Winline -Wcast-qual -Wundef -Wno-pointer-sign -fformat-extensions -Wmissing-include-dirs -fdiagnostics-show-option -nostdinc -I. -I/src/sys -I/src/sys/contrib/altq -D_KERNEL -DHAVE_KERNEL_OPTION_HEADERS -include opt_global.h -fno-common -finline-limit=8000 --param inline-unit-growth=100 --param large-function-growth=1000 -DGPROF -falign-functions=16 -DGPROF4 -DGUPROF -fno-builtin -mno-align-long-strings -mpreferred-stack-boundary=2 -mno-mmx -mno-sse -msoft-float -ffreestanding -fstack-protector -Werror -pg -mprofiler-epilogue /src/sys/dev/aha/aha_mca.c In file included from /src/sys/dev/aha/aha_mca.c:49: /src/sys/dev/aha/ahareg.h:300: error: field 'timer' has incomplete type /src/sys/dev/aha/aha_mca.c: In function 'aha_mca_attach': /src/sys/dev/aha/aha_mca.c:194: error: 'aha' undeclared (first use in this function) /src/sys/dev/aha/aha_mca.c:194: error: (Each undeclared identifier is reported only once /src/sys/dev/aha/aha_mca.c:194: error: for each function it appears in.) *** Error code 1 Stop in /obj/i386.i386/src/sys/LINT. *** Error code 1 Stop in /src. *** Error code 1 Stop in /src. TB --- 2013-05-31 01:51:07 - WARNING: /usr/bin/make returned exit code 1 TB --- 2013-05-31 01:51:07 - ERROR: failed to build LINT kernel TB --- 2013-05-31 01:51:07 - 8352.52 user 914.26 system 10844.89 real http://tinderbox.freebsd.org/tinderbox-freebsd9-build-RELENG_9-i386-i386.full ___ freebsd-stable@freebsd.org mailing list
Re: SunFire X2200 ilo's bge1 DOWN/UP
On Thursday, May 30, 2013 2:44:35 am Daniel Braniss wrote: --/04w6evG8XlLl3ft Content-Type: text/x-diff; charset=us-ascii Content-Disposition: attachment; filename=bge.media_sts.diff Index: sys/dev/bge/if_bge.c === --- sys/dev/bge/if_bge.c (revision 251021) +++ sys/dev/bge/if_bge.c (working copy) @@ -5583,6 +5583,10 @@ bge_ifmedia_sts(struct ifnet *ifp, struct ifmediar BGE_LOCK(sc); + if ((ifp-if_flags IFF_UP) == 0) { + BGE_UNLOCK(sc); + return; + } if (sc-bge_flags BGE_FLAG_TBI) { ifmr-ifm_status = IFM_AVALID; ifmr-ifm_active = IFM_ETHER; --/04w6evG8XlLl3ft-- after 18hs, the logs are empty! it seems the patch fixes the problem. now maybe it's time to hunt for who is randomly calling for bge_ifmedia_sts ... It could be any number of daemons that query interface state such as an SNMP server, ladvd, etc. If you wanted help you could modify the patch so that it does something like this: #include sys/proc.h if (/* test for IFF_UP */) { BGE_UNLOCK(sc); if_printf(ifp, state queried on down interface by pid %d (%s), --| add a \n curthread-td_proc-p_pid, curthread-td_proc-p_comm); return; } -- John Baldwin snmpd call this several times a second, (difficult to measeure since sysolog just says last message repeated 22 times in any case, the DOWN/UP appears once every few hours, oh well. I have now stopped the snmpd daemon, maybe there is someone else ... thanks, danny ___ freebsd-stable@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-stable To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org
Re: SunFire X2200 ilo's bge1 DOWN/UP
On Fri, May 31, 2013 at 08:24:47AM +0300, Daniel Braniss wrote: On Thursday, May 30, 2013 2:44:35 am Daniel Braniss wrote: --/04w6evG8XlLl3ft Content-Type: text/x-diff; charset=us-ascii Content-Disposition: attachment; filename=bge.media_sts.diff Index: sys/dev/bge/if_bge.c === --- sys/dev/bge/if_bge.c(revision 251021) +++ sys/dev/bge/if_bge.c(working copy) @@ -5583,6 +5583,10 @@ bge_ifmedia_sts(struct ifnet *ifp, struct ifmediar BGE_LOCK(sc); + if ((ifp-if_flags IFF_UP) == 0) { + BGE_UNLOCK(sc); + return; + } if (sc-bge_flags BGE_FLAG_TBI) { ifmr-ifm_status = IFM_AVALID; ifmr-ifm_active = IFM_ETHER; --/04w6evG8XlLl3ft-- after 18hs, the logs are empty! it seems the patch fixes the problem. now maybe it's time to hunt for who is randomly calling for bge_ifmedia_sts ... It could be any number of daemons that query interface state such as an SNMP server, ladvd, etc. If you wanted help you could modify the patch so that it does something like this: #include sys/proc.h if (/* test for IFF_UP */) { BGE_UNLOCK(sc); if_printf(ifp, state queried on down interface by pid %d (%s), --| add a \n curthread-td_proc-p_pid, curthread-td_proc-p_comm); return; } -- John Baldwin snmpd call this several times a second, (difficult to measeure since sysolog just says last message repeated 22 times in any case, the DOWN/UP appears once every few hours, oh well. I have now stopped the snmpd daemon, maybe there is someone else ... I have no idea why snmpd wants to know media status for interfaces that are put into down state. The media status resolved after bringing up the interface may be different one that was seen before. The patch also makes dhclient think driver got a valid link regardless of link establishment. I guess that wouldn't be issue though. I'll commit the patch after some more testing. Thanks for reporting and testing! thanks, danny ___ freebsd-stable@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-stable To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org