On 05/11/12 03:21, Stuart Henderson wrote:
On 2012-05-11, Adam Jacob Muller<adam-openbsd-m...@adam.gs> wrote:
On 5/10/12 4:24 AM, JC)rC)mie CourrC(ges-Anglas wrote:
Please see
http://www.openbsd.org/faq/faq2.html
and
http://www.openbsd.org/report.html
Regards.
Hi,
I did do a sendbug, but i'm not sure if gnats@ goes anywhere (seems
query-pr page is broken?).
Possibly not at the moment.
In any event, this is the ddb output of ps/show registers.
I'm fairly reliably able to reproduce this, if there is any more
information I can gather, let me know.
Dmesg (no. 3 on http://www.openbsd.org/report.html) is really
important. Ideally send one from the working previous version too
which you might find in old logs (/var/log/messages*).
Sorry, that was in the sendbug, I removed it when I sent to the list.
Unfortunately/fortunately the box was up for so long prior to upgrading
that there's no dmesg and the remote syslog archives don't catch things
from so early on in the boot so I only have the 5.1 dmesg :/
OpenBSD 5.1 (GENERIC) #181: Sun Feb 12 09:35:53 MST 2012
dera...@amd64.openbsd.org:/usr/src/sys/arch/amd64/compile/GENERIC
real mem = 2146172928 (2046MB)
avail mem = 2074972160 (1978MB)
mainbus0 at root
bios0 at mainbus0: SMBIOS rev. 2.3 @ 0xfa3d0 (48 entries)
bios0: vendor Secure Computing version "A02" date 03/29/2006
bios0: Secure Computing Sidewinder G2
acpi0 at bios0: rev 0
acpi0: sleep states S0 S4 S5
acpi0: tables DSDT FACP APIC SPCR HPET MCFG
acpi0: wakeup devices PCI0(S5) PES1(S5) PEP0(S5) PXHA(S5) PEP1(S5)
PEP2(S5) PCIS(S5)
acpitimer0 at acpi0: 3579545 Hz, 24 bits
acpimadt0 at acpi0 addr 0xfee00000: PC-AT compat
cpu0 at mainbus0: apid 0 (boot processor)
cpu0: Intel(R) Celeron(R) CPU 2.66GHz, 2667.13 MHz
cpu0:
FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,SBF,SSE3,MWAIT,DS-CPL,TM2,CNXT-ID,CX16,xTPR,NXE,LONG,LAHF
cpu0: 256KB 64b/line 4-way L2 cache
cpu0: apic clock running at 133MHz
ioapic0 at mainbus0: apid 1 pa 0xfec00000, version 20, 24 pins
ioapic0: misconfigured as apic 0, remapped to apid 1
ioapic1 at mainbus0: apid 2 pa 0xfec10000, version 20, 24 pins
ioapic1: misconfigured as apic 0, remapped to apid 2
acpihpet0 at acpi0: 14318179 Hz
acpimcfg0 at acpi0 addr 0xf0000000, bus 0-63
acpiprt0 at acpi0: bus 0 (PCI0)
acpiprt1 at acpi0: bus 1 (PES1)
acpiprt2 at acpi0: bus 2 (PEP0)
acpiprt3 at acpi0: bus 3 (PXHA)
acpiprt4 at acpi0: bus 5 (PEP1)
acpiprt5 at acpi0: bus 6 (PEP2)
acpiprt6 at acpi0: bus 7 (PCIS)
acpicpu0 at acpi0
ipmi at mainbus0 not configured
pci0 at mainbus0 bus 0
pchb0 at pci0 dev 0 function 0 "Intel E7230 Host" rev 0x00
ppb0 at pci0 dev 1 function 0 "Intel E7230 PCIE" rev 0x00: msi
pci1 at ppb0 bus 1
ppb1 at pci0 dev 28 function 0 "Intel 82801GB PCIE" rev 0x01
pci2 at ppb1 bus 2
ppb2 at pci2 dev 0 function 0 "Intel 6702PXH PCIE-PCIX" rev 0x09
pci3 at ppb2 bus 3
ppb3 at pci3 dev 2 function 0 "IBM 133 PCIX-PCIX" rev 0x02
pci4 at ppb3 bus 4
em0 at pci4 dev 4 function 0 "Intel PRO/1000MT QP (82546EB)" rev 0x01:
apic 2 int 3, address 00:04:23:c2:9f:24
em1 at pci4 dev 4 function 1 "Intel PRO/1000MT QP (82546EB)" rev 0x01:
apic 2 int 2, address 00:04:23:c2:9f:25
em2 at pci4 dev 6 function 0 "Intel PRO/1000MT QP (82546EB)" rev 0x01:
apic 2 int 1, address 00:04:23:c2:9f:26
em3 at pci4 dev 6 function 1 "Intel PRO/1000MT QP (82546EB)" rev 0x01:
apic 2 int 0, address 00:04:23:c2:9f:27
ppb4 at pci0 dev 28 function 4 "Intel 82801G PCIE" rev 0x01
pci5 at ppb4 bus 5
bge0 at pci5 dev 0 function 0 "Broadcom BCM5721" rev 0x11, BCM5750 B1
(0x4101): apic 1 int 16, address 00:13:72:fc:ae:1b
brgphy0 at bge0 phy 1: BCM5750 10/100/1000baseT PHY, rev. 0
ppb5 at pci0 dev 28 function 5 "Intel 82801G PCIE" rev 0x01
pci6 at ppb5 bus 6
bge1 at pci6 dev 0 function 0 "Broadcom BCM5721" rev 0x11, BCM5750 B1
(0x4101): apic 1 int 17, address 00:13:72:fc:ae:1c
brgphy1 at bge1 phy 1: BCM5750 10/100/1000baseT PHY, rev. 0
ppb6 at pci0 dev 30 function 0 "Intel 82801BA Hub-to-PCI" rev 0xe1
pci7 at ppb6 bus 7
vga1 at pci7 dev 5 function 0 "XGI Technology Volari Z7" rev 0x00
wsdisplay0 at vga1 mux 1: console (80x25, vt100 emulation)
wsdisplay0: screen 1-5 added (80x25, vt100 emulation)
pcib0 at pci0 dev 31 function 0 "Intel 82801GB LPC" rev 0x01
pciide0 at pci0 dev 31 function 1 "Intel 82801GB IDE" rev 0x01: DMA,
channel 0 configured to compatibility, channel 1 configured to compatibility
atapiscsi0 at pciide0 channel 0 drive 0
scsibus0 at atapiscsi0: 2 targets
cd0 at scsibus0 targ 0 lun 0: <HL-DT-ST, CD-ROM GCR-8240N, 1.10> ATAPI
5/cdrom removable
cd0(pciide0:0:0): using PIO mode 4, Ultra-DMA mode 2
pciide0: channel 1 ignored (disabled)
pciide1 at pci0 dev 31 function 2 "Intel 82801GB SATA" rev 0x01: DMA,
channel 0 configured to native-PCI, channel 1 configured to native-PCI
pciide1: using apic 1 int 20 for native-PCI interrupt
wd0 at pciide1 channel 0 drive 0: <ST31500341AS>
wd0: 16-sector PIO, LBA48, 1430799MB, 2930277168 sectors
wd0(pciide1:0:0): using PIO mode 4, Ultra-DMA mode 6
wd1 at pciide1 channel 1 drive 0: <ST31500341AS>
wd1: 16-sector PIO, LBA48, 1430799MB, 2930277168 sectors
wd1(pciide1:1:0): using PIO mode 4, Ultra-DMA mode 6
ichiic0 at pci0 dev 31 function 3 "Intel 82801GB SMBus" rev 0x01: polling
iic0 at ichiic0
adt0 at iic0 addr 0x2e: lm96000 rev 0x68
spdmem0 at iic0 addr 0x50: 1GB DDR2 SDRAM ECC PC2-4200CL5
spdmem1 at iic0 addr 0x52: 1GB DDR2 SDRAM ECC PC2-4200CL5
isa0 at pcib0
isadma0 at isa0
com0 at isa0 port 0x3f8/8 irq 4: ns16550a, 16 byte fifo
com0: console
pckbc0 at isa0 port 0x60/5
pckbd0 at pckbc0 (kbd slot)
pckbc0: using irq 1 for kbd slot
wskbd0 at pckbd0: console keyboard, using wsdisplay0
pms0 at pckbc0 (aux slot)
pckbc0: using irq 12 for aux slot
wsmouse0 at pms0 mux 0
pcppi0 at isa0 port 0x61
spkr0 at pcppi0
mtrr: Pentium Pro MTRR support
vscsi0 at root
scsibus1 at vscsi0: 256 targets
softraid0 at root
scsibus2 at softraid0: 256 targets
root on wd0a (54087ac4cbd9c0f8.a) swap on wd0b dump on wd0b
-=[~]=- -=[Thu May 10]=- -=[21:30:46]=-
[root@charon]# ifconfig em2 up
-=[~]=- -=[Thu May 10]=- -=[21:30:49]=-
[root@charon]# uptime
9:30PM up 2 mins, 1 user, load averages: 1.38, 0.50, 0.19
-=[~]=- -=[Thu May 10]=- -=[21:30:52]=-
[root@charon]# ifconfig em3 up
^EB^EStopped at Debugger+0x5: leave
ddb> show panic
the kernel did not panic
Happens for em3 and not em2? Or does this vary?
So the actual trigger behaviour is a bit hard to track, the only thing I
can say for 100% sure is that some combination of ifconfig up/down/up
(sometimes just two ups after boot) causes a hang.
This does however appear to require some combination of multiple
interfaces being up. For example, if I boot then bring up em3 alone,
nothing happens. If I boot then bring up em2/em3 up, i'm almost
guaranteed a hang! (the inverse is true as well, em3 then em2 also hangs
-- ordering seems unimportant).
I figured out / fixed the issue with savecore (/var too full) so I do
have a core dump from the last crash, if that's useful.
Thanks for your time and help,
-Adam
I won't trim the quote below so you can reply to the whole lot
and keep the information together (but no more text from me below :)
ddb> ps
PID PPID PGRP UID S FLAGS WAIT COMMAND
*31458 2782 31458 0 7 0 ifconfig
2782 1 2782 0 3 0x80 wait bash
9835 1 9835 0 3 0x80 ttyin getty
28249 1 28249 0 3 0x80 ttyin getty
1429 1 1429 0 3 0x80 ttyin getty
12859 1 12859 0 3 0x80 ttyin getty
15689 1 15689 0 3 0x80 ttyin getty
21720 1 21720 0 3 0x80 select cron
22103 15791 15791 0 3 0x80 nanosleep perl
15791 1 15791 0 3 0x80 poll collectd
17486 1711 1711 77 3 0x80 poll dhcpd
32181 15104 27517 90 3 0x80 kqread ospf6d
22133 15104 27517 90 3 0x80 kqread ospf6d
4380 27517 27517 0 3 0x80 piperd tee
15104 27517 27517 0 2 0x80 ospf6d
27517 11636 27517 0 3 0x88 pause sh
7865 22621 4001 83 3 0x80 poll ntpd
22621 4001 4001 83 3 0x80 poll ntpd
11636 1 11636 0 3 0x80 select screen
1711 22145 1711 77 3 0x80 poll dhcpd
4001 26301 4001 0 3 0x80 poll ntpd
22145 1 22145 0 3 0x80 select screen
20753 11069 20753 0 3 0x80 netcon php
11069 1 11069 0 3 0x80 select screen
26301 1 26301 0 3 0x80 select screen
23181 1 23181 556 3 0x80 select nrpe
13812 30502 30502 91 2 0x80 snmpd
30502 23345 30502 0 3 0x80 kqread snmpd
24114 6566 24114 0 3 0x80 nanosleep php
24896 12320 24896 0 3 0x80 nanosleep php
30324 26717 30324 0 3 0x80 nanosleep php
23345 1 23345 0 3 0x80 select screen
2939 17720 2939 0 3 0x80 nanosleep php
26717 1 26717 0 3 0x80 select screen
12320 1 12320 0 3 0x80 select screen
6566 1 6566 0 3 0x80 select screen
17720 1 17720 0 3 0x80 select screen
20349 31546 20349 0 3 0x80 poll syslog-ng
31546 1 13174 0 3 0x80 wait syslog-ng
22116 1 22116 99 3 0x80 poll sndiod
12536 1 12536 0 3 0x80 select inetd
21142 13495 13495 507 3 0x80 kqread qmgr
16697 13495 13495 507 3 0x80 kqread pickup
13495 1 13495 0 3 0x80 kqread master
17383 15889 15889 75 3 0x80 poll bgpd
2491 15889 15889 75 3 0x80 poll bgpd
15889 1 15889 0 2 0x80 bgpd
30554 15678 15678 90 3 0x80 kqread ospf6d
19811 15678 15678 90 3 0x80 kqread ospf6d
15678 1 15678 0 2 0x80 ospf6d
29524 1 29524 0 3 0x80 select sshd
26501 5231 5231 70 3 0x80 select named
5231 1 5231 0 3 0x80 netio named
21867 29781 29781 74 3 0x80 bpf pflogd
29781 1 29781 0 3 0x80 netio pflogd
9811 2867 2867 73 3 0x80 poll syslogd
2867 1 2867 0 3 0x80 netio syslogd
11 0 0 0 3 0x100200 aiodoned aiodoned
10 0 0 0 3 0x100200 syncer update
9 0 0 0 3 0x100200 cleaner cleaner
8 0 0 0 3 0x100200 reaper reaper
7 0 0 0 3 0x100200 pgdaemon pagedaemon
6 0 0 0 3 0x100200 bored crypto
5 0 0 0 3 0x100200 pftm pfpurge
4 0 0 0 3 0x100200 acpi0 acpi0
3 0 0 0 3 0x100200 bored syswq
2 0 0 0 3 0x40100200 idle0
1 0 1 0 3 0x80 wait init
0 -1 0 0 3 0x200 scheduler swapper
ddb> show registers
ds 0
es 0
fs 0x11
gs 0
rdi 0
rsi 0x3f8
rbp 0xffff80002140f838
rbx 0xf9
rdx 0x3f8
rcx 0x282
rax 0
r8 0xffffffff80d5fc30 x86_soft_intrs+0x50
r9 0x1
r10 0
r11 0xffffffff802bee60 pool_page_alloc
r12 0xffff800000168110
r13 0xffff800000164000
r14 0xffff800000144dc0
r15 0x3f8
rip 0xffffffff80431a15 Debugger+0x5
cs 0x8
rflags 0x202
rsp 0xffff80002140f838
ss 0x10
Debugger+0x5: leave
ddb> trace
Debugger() at Debugger+0x5
comintr() at comintr+0x268
Xintr_ioapic_edge4() at Xintr_ioapic_edge4+0xe0
--- interrupt ---
Bad frame pointer: 0xffff80002140faa0
end trace frame: 0xffff80002140faa0, count: -3
Xdoreti:
ddb> boot dump
^T
\
^T^TStopped at Debugger+0x5: leave
ddb>
dumping to dev 0,1 offset 523047
dump 2047 2046 2045 2044 2043 2042 2041 2040 2039 2038 2037 2036 2035 2034 2033
2032 2031 2030 2029 2028 2027 2026 2025 2024 2023 2022 2021 2020 2019 2010
rebooting...