Need help reporting kernel panic

2008-04-14 Thread Dag Richards

Understand that I am not (quite) reporting a panic without a ps and trace.

I had kernel panic this weekend on my standby vpn firewall, this is the 
third time this has happened in the last 300 days or so, always with the 
same panic.


I run with ddb.log=1
I ran ps and trace expecting the output to be put in a log erm ... 
'somewhere'.


Then I performed a boot dump, I have
drwxrwx---   2 root  wheel 512 Apr 14 07:53 ./
drwxr-xr-x  25 root  wheel 512 Aug 28  2007 ../
-rw---   1 root  wheel   2 Apr 14 07:51 bounds
-rw---   1 root  wheel 6229740 Apr 14 07:53 bsd.0
-rw---   1 root  wheel  1048568340 Apr 14 07:53 bsd.0.core
-rw-r--r--   1 root  wheel   5 Sep 10  2005 minfree

But am clueless ... yes you are thinking it, I said for you.
Clueless about where to find the output from my ps and trace.
My guess at this point is that it is gone.


So what should I have done? Or where do I expect to find this output?
Besides logging in from the console server and getting a screen scrape 
of the output,  I could not get the ilo to respond to me.



I have included as much of the panic message as I have and the dmesg 
from the system in case anyone is curious about the system in question,

as well as a ps of what is normally left running on the system.






panic: pmap_pinit: kernel_map out of virtual space!
Stopped at Debugger + 0x4: leave

--




OpenBSD 4.2 (GENERIC) #375: Tue Aug 28 10:38:44 MDT 2007
[EMAIL PROTECTED]:/usr/src/sys/arch/i386/compile/GENERIC
cpu0: Intel(R) Xeon(TM) CPU 3.00GHz (GenuineIntel 686-class) 3.01 GHz
cpu0: 
FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,SBF,SSE3,MWAIT,DS-CPL,CNXT-ID,CX16,xTPR

real mem  = 1073258496 (1023MB)
avail mem = 1030156288 (982MB)
mainbus0 at root
bios0 at mainbus0: AT/286+ BIOS, date 12/31/99, BIOS32 rev. 0 @ 0xf, 
SMBIOS rev. 2.3 @ 0xec000 (54 entries)

bios0: vendor HP version P52 date 04/14/2005
bios0: HP ProLiant DL360 G4
pcibios0 at bios0: rev 2.1 @ 0xf/0x2000
pcibios0: PCI BIOS has 7 Interrupt Routing table entries
pcibios0: PCI Interrupt Router at 000:31:0 (Intel 6300ESB LPC rev 0x00)
pcibios0: PCI bus #13 is the last bus
bios0: ROM list: 0xc/0x8000 0xc8000/0x4000! 0xcc000/0x1800 
0xcd800/0x1600 0xee000/0x2000!

acpi at mainbus0 not configured
cpu0 at mainbus0
pci0 at mainbus0 bus 0: configuration mode 1 (no bios)
pchb0 at pci0 dev 0 function 0 Intel E7520 MCH rev 0x0c
ppb0 at pci0 dev 2 function 0 Intel MCH PCIE rev 0x0c
pci1 at ppb0 bus 13
ppb1 at pci0 dev 4 function 0 Intel MCH PCIE rev 0x0c
pci2 at ppb1 bus 6
ppb2 at pci2 dev 0 function 0 Intel PCIE-PCIE rev 0x09
pci3 at ppb2 bus 7
ppb3 at pci2 dev 0 function 2 Intel PCIE-PCIE rev 0x09
pci4 at ppb3 bus 10
bge0 at pci4 dev 1 function 0 Broadcom BCM5703 Alt rev 0x10, BCM5703 
B0 (0x1100): irq 5, address 00:10:18:0c:44:6b

brgphy0 at bge0 phy 1: BCM5703 10/100/1000baseT PHY, rev. 3
ppb4 at pci0 dev 6 function 0 Intel MCH PCIE rev 0x0c
pci5 at ppb4 bus 3
ppb5 at pci0 dev 28 function 0 Intel 6300ESB PCIX rev 0x02
pci6 at ppb5 bus 2
ciss0 at pci6 dev 1 function 0 Compaq Smart Array 64xx rev 0x01: irq 5
ciss0: 1 LD, HW rev 1, FW 2.36/2.36
scsibus0 at ciss0: 1 targets
sd0 at scsibus0 targ 0 lun 0: HP, LOGICAL VOLUME, 2.36 SCSI0 0/direct 
fixed

sd0: 34727MB, 4427 cyl, 255 head, 63 sec, 512 bytes/sec, 71122560 sec total
bge1 at pci6 dev 2 function 0 Broadcom BCM5704C rev 0x10, BCM5704 B0 
(0x2100): irq 5, address 00:14:38:4b:ef:fe

brgphy1 at bge1 phy 1: BCM5704 10/100/1000baseT PHY, rev. 0
bge2 at pci6 dev 2 function 1 Broadcom BCM5704C rev 0x10, BCM5704 B0 
(0x2100): irq 5, address 00:14:38:4b:ef:fd

brgphy2 at bge2 phy 1: BCM5704 10/100/1000baseT PHY, rev. 0
uhci0 at pci0 dev 29 function 0 Intel 6300ESB USB rev 0x02: irq 5
uhci1 at pci0 dev 29 function 1 Intel 6300ESB USB rev 0x02: irq 5
Intel 6300ESB WDT rev 0x02 at pci0 dev 29 function 4 not configured
Intel 6300ESB APIC rev 0x02 at pci0 dev 29 function 5 not configured
ehci0 at pci0 dev 29 function 7 Intel 6300ESB USB rev 0x02: irq 5
usb0 at ehci0: USB revision 2.0
uhub0 at usb0: Intel EHCI root hub, rev 2.00/1.00, addr 1
ppb6 at pci0 dev 30 function 0 Intel 82801BA AGP rev 0x0a
pci7 at ppb6 bus 1
vga1 at pci7 dev 3 function 0 ATI Rage XL rev 0x27
wsdisplay0 at vga1 mux 1: console (80x25, vt100 emulation)
wsdisplay0: screen 1-5 added (80x25, vt100 emulation)
Compaq iLO rev 0x01 at pci7 dev 4 function 0 not configured
Compaq iLO rev 0x01 at pci7 dev 4 function 2 not configured
ichpcib0 at pci0 dev 31 function 0 Intel 6300ESB LPC rev 0x02: 24-bit 
timer at 3579545Hz
pciide0 at pci0 dev 31 function 1 Intel 6300ESB IDE rev 0x02: DMA, 
channel 0 configured to compatibility, channel 1 configured to compatibility

atapiscsi0 at pciide0 channel 0 drive 0
scsibus1 at atapiscsi0: 2 targets
cd0 at scsibus1 targ 0 lun 0: COMPAQ, CD-ROM SN-124, N104 SCSI0 
5/cdrom removable

cd0(pciide0:0:0): using PIO mode 4

Re: Need help reporting kernel panic

2008-04-14 Thread Josh Grosse
On Mon, 14 Apr 2008 08:57:55 -0700, Dag Richards wrote

 Then I performed a boot dump, I have
 drwxrwx---   2 root  wheel 512 Apr 14 07:53 ./
 drwxr-xr-x  25 root  wheel 512 Aug 28  2007 ../
 -rw---   1 root  wheel   2 Apr 14 07:51 bounds
 -rw---   1 root  wheel 6229740 Apr 14 07:53 bsd.0
 -rw---   1 root  wheel  1048568340 Apr 14 07:53 bsd.0.core
 -rw-r--r--   1 root  wheel   5 Sep 10  2005 minfree
 
 But am clueless ... yes you are thinking it, I said for you.
 Clueless about where to find the output from my ps and trace.
 My guess at this point is that it is gone.

Your ddb console ouput should be in the dmesg contained within the bsd.0.core
file.  You will want to use the dmesg command with -M and -N operands.  See
the dmesg(1) man page.

 So what should I have done? Or where do I expect to find this output?
 Besides logging in from the console server and getting a screen 
 scrape of the output,  I could not get the ilo to respond to me.

A great place to start is the crash(8) man page; I've found it a helpful
reference.



Re: Need help reporting kernel panic

2008-04-14 Thread Dag Richards

Josh Grosse wrote:

On Mon, 14 Apr 2008 08:57:55 -0700, Dag Richards wrote


Then I performed a boot dump, I have
drwxrwx---   2 root  wheel 512 Apr 14 07:53 ./
drwxr-xr-x  25 root  wheel 512 Aug 28  2007 ../
-rw---   1 root  wheel   2 Apr 14 07:51 bounds
-rw---   1 root  wheel 6229740 Apr 14 07:53 bsd.0
-rw---   1 root  wheel  1048568340 Apr 14 07:53 bsd.0.core
-rw-r--r--   1 root  wheel   5 Sep 10  2005 minfree

But am clueless ... yes you are thinking it, I said for you.
Clueless about where to find the output from my ps and trace.
My guess at this point is that it is gone.


Your ddb console ouput should be in the dmesg contained within the bsd.0.core
file.  You will want to use the dmesg command with -M and -N operands.  See
the dmesg(1) man page.


So what should I have done? Or where do I expect to find this output?
Besides logging in from the console server and getting a screen 
scrape of the output,  I could not get the ilo to respond to me.


A great place to start is the crash(8) man page; I've found it a helpful
reference.



Thanks, I had read crash. I started to question my comprehension as the 
result I was getting looked like this:


hsdcert1:root:/root #dmesg -N /var/crash/bsd.0 -M /var/crash/bsd.0.core
dmesg: kvm_read:  (d0932000)
hsdcert1:root:/root #ps -N /var/crash/bsd.0 -M /var/crash/bsd.0.core -O 
paddr

  PID  PADDR TT  STAT   TIME COMMAND
 3257 d773781c p0- I   0:00.00 (tcpdump)
12147 d7ae0564 C0  Is+ 0:00.00 (ksh)
21336 d77ee970 C1  Is+ 0:08.00 (getty)
22401 d773700c C2  Is+ 0:08.00 (getty)
25004 d7737164 C3  Is+ 0:08.00 (getty)
 3004 d77372bc C5  Is+ 0:08.00 (getty)

So it would seem that my ps and trace did not get appended to the file.
Or maybe ... appended you say?

I did the ps, then trace, then the boot dump.

So did the dump then overwrite my ps and trace?

Boot dump seems like it should be the last thing run, as it ends with a 
boot, right?

Can I just perform a dump? Guess I will find out is a few weeks.