Hello,

I have a strange problem: A machine I have to tend locks up once or
twice a day.


The situation:

 LAN --- box1 (OpenBSD 4.0-stable)
            |          |
            |          |
            |          |
          pppoe0     pppoe1
          1.2.3.4    1.2.3.5

          --- The Internet, 6.7.8.9 ---

           |          |
           |          |
           box2       box3
           1.2.3.6    1.2.3.7

On box1, I have a default route to the Internet at large over pppoe0
and two host routes over pppoe1, but all machines use the same remote
gateway (ie, using the same concentrator on the remote side). "-stable"
on box1 means as of 2007-02-16 whereas on the other two machines, it
means kernels as of 2007-01-17 and userland as of 2006-12-18 (all are
i386 machines).

The problem: box1 locks up about once or twice a day. If not,
everything works mostly as expected (box3 hangs sometimes, too, but
that may be hardware related).

When box1 locks up, the cursor sits blinking in the corner of a dark
screen, and no three-finger-salute or other key press brings the
machine back to life. A hard reset is required. I have swapped in new
hardware already. A dmesg of the new hardware is attached. 

A similar setup was working nicely for months, but with user-mode PPP,
older code (of course), and with an ISP that wasn't IPv6 capable
although I don't use that - it's IPv4 only, and the ICMP6 patch should
be in on all machines, anyway.

The only messages I find in the logs are concerned with "BAD state"
(from pf), and this:

Feb 21 04:02:17 box1 /bsd: pppoe0: LCP keepalive timeout
Feb 21 04:04:58 box1 /bsd: pppoe1: LCP keepalive timeout
Feb 21 07:25:39 box1 syslogd: restart

The time gap shows the period while the machine hung.

What gives, and how to debug?


Thank you!


Best,
--Toni++
OpenBSD 4.0-stable (GENERIC.MP) #1: Thu Feb 15 22:48:55 CET 2007
    [EMAIL PROTECTED]:/usr/src/sys/arch/i386/compile/GENERIC.MP
cpu0: Intel(R) Xeon(TM) CPU 3.00GHz ("GenuineIntel" 686-class) 3.01 GHz
cpu0: 
FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,SBF,SSE3,MWAIT,DS-CPL,CNXT-ID,CX16
real mem  = 1073258496 (1048104K)
avail mem = 970964992 (948208K)
using 4256 buffers containing 53764096 bytes (52504K) of memory
mainbus0 (root)
bios0 at mainbus0: AT/286+(00) BIOS, date 12/31/99, BIOS32 rev. 0 @ 0xf0000, 
SMBIOS rev. 2.3 @ 0xec000 (77 entries)
bios0: HP ProLiant DL380 G4
pcibios0 at bios0: rev 2.1 @ 0xf0000/0x2000
pcibios0: PCI BIOS has 7 Interrupt Routing table entries
pcibios0: PCI Interrupt Router at 000:31:0 ("Intel 82801EB/ER LPC" rev 0x00)
pcibios0: PCI bus #10 is the last bus
bios0: ROM list: 0xc0000/0x8000 0xc8000/0x4000! 0xcc000/0x1800 0xcd800/0x1600 
0xee000/0x2000!
mainbus0: Intel MP Specification (Version 1.4) (HP       PROLIANT    )
cpu0 at mainbus0: apid 0 (boot processor)
cpu0: apic clock running at 200 MHz
mainbus0: bus 0 is type PCI   
mainbus0: bus 1 is type PCI   
mainbus0: bus 2 is type PCI   
mainbus0: bus 3 is type PCI   
mainbus0: bus 4 is type PCI   
mainbus0: bus 5 is type PCI   
mainbus0: bus 6 is type PCI   
mainbus0: bus 10 is type PCI   
mainbus0: bus 32 is type ISA   
ioapic0 at mainbus0: apid 8 pa 0xfec00000, version 20, 24 pins
ioapic1 at mainbus0: apid 9 pa 0xfec80000, version 20, 24 pins
ioapic2 at mainbus0: apid 10 pa 0xfec80400, version 20, 24 pins
ioapic3 at mainbus0: apid 11 pa 0xfec84000, version 20, 24 pins
ioapic4 at mainbus0: apid 12 pa 0xfec84400, version 20, 24 pins
pci0 at mainbus0 bus 0: configuration mode 1 (no bios)
pchb0 at pci0 dev 0 function 0 "Intel E7520 MCH" rev 0x0c
ppb0 at pci0 dev 2 function 0 "Intel MCH PCIE" rev 0x0c
pci1 at ppb0 bus 2
ppb1 at pci1 dev 0 function 0 "Intel PCIE-PCIE" rev 0x09
pci2 at ppb1 bus 3
bge0 at pci2 dev 1 function 0 "Broadcom BCM5704C" rev 0x10, BCM5704 B0 
(0x2100): apic 9 int 1 (irq 5), address 00:18:fe:fc:5e:11
brgphy0 at bge0 phy 1: BCM5704 10/100/1000baseT PHY, rev. 0
bge1 at pci2 dev 1 function 1 "Broadcom BCM5704C" rev 0x10, BCM5704 B0 
(0x2100): apic 9 int 2 (irq 5), address 00:18:fe:fc:5e:10
brgphy1 at bge1 phy 1: BCM5704 10/100/1000baseT PHY, rev. 0
ppb2 at pci1 dev 0 function 2 "Intel PCIE-PCIE" rev 0x09
pci3 at ppb2 bus 4
ciss0 at pci3 dev 3 function 0 "Compaq Smart Array 64xx" rev 0x01: apic 10 int 
3 (irq 5)
ciss0: 1 LD, HW rev 1, FW 2.68/2.68
scsibus0 at ciss0: 1 targets
sd0 at scsibus0 targ 0 lun 0: <HP, LOGICAL VOLUME, 2.68> SCSI0 0/direct fixed
sd0: 69459MB, 69459 cyl, 64 head, 32 sec, 512 bytes/sec, 142253280 sec total
ppb3 at pci0 dev 6 function 0 "Intel MCH PCIE" rev 0x0c
pci4 at ppb3 bus 5
ppb4 at pci4 dev 0 function 0 "Intel PCIE-PCIE" rev 0x09
pci5 at ppb4 bus 6
em0 at pci5 dev 2 function 0 "Intel PRO/1000MT (82546EB)" rev 0x01: apic 11 int 
6 (irq 5), address 00:11:0a:63:a5:52
em1 at pci5 dev 2 function 1 "Intel PRO/1000MT (82546EB)" rev 0x01: apic 11 int 
5 (irq 5), address 00:11:0a:63:a5:53
ppb5 at pci4 dev 0 function 2 "Intel PCIE-PCIE" rev 0x09
pci6 at ppb5 bus 10
uhci0 at pci0 dev 29 function 0 "Intel 82801EB/ER USB" rev 0x02: apic 8 int 16 
(irq 5)
usb0 at uhci0: USB revision 1.0
uhub0 at usb0
uhub0: Intel UHCI root hub, rev 1.00/1.00, addr 1
uhub0: 2 ports with 2 removable, self powered
uhci1 at pci0 dev 29 function 1 "Intel 82801EB/ER USB" rev 0x02: apic 8 int 19 
(irq 5)
usb1 at uhci1: USB revision 1.0
uhub1 at usb1
uhub1: Intel UHCI root hub, rev 1.00/1.00, addr 1
uhub1: 2 ports with 2 removable, self powered
uhci2 at pci0 dev 29 function 2 "Intel 82801EB/ER USB" rev 0x02: apic 8 int 18 
(irq 5)
usb2 at uhci2: USB revision 1.0
uhub2 at usb2
uhub2: Intel UHCI root hub, rev 1.00/1.00, addr 1
uhub2: 2 ports with 2 removable, self powered
uhci3 at pci0 dev 29 function 3 "Intel 82801EB/ER USB" rev 0x02: apic 8 int 16 
(irq 5)
usb3 at uhci3: USB revision 1.0
uhub3 at usb3
uhub3: Intel UHCI root hub, rev 1.00/1.00, addr 1
uhub3: 2 ports with 2 removable, self powered
ehci0 at pci0 dev 29 function 7 "Intel 82801EB/ER USB2" rev 0x02: apic 8 int 23 
(irq 5)
usb4 at ehci0: USB revision 2.0
uhub4 at usb4
uhub4: Intel EHCI root hub, rev 2.00/1.00, addr 1
uhub4: 8 ports with 8 removable, self powered
ppb6 at pci0 dev 30 function 0 "Intel 82801BA AGP" rev 0xc2
pci7 at ppb6 bus 1
vga1 at pci7 dev 3 function 0 "ATI Rage XL" rev 0x27
wsdisplay0 at vga1 mux 1: console (80x25, vt100 emulation)
wsdisplay0: screen 1-5 added (80x25, vt100 emulation)
"Compaq iLO" rev 0x01 at pci7 dev 4 function 0 not configured
"Compaq iLO" rev 0x01 at pci7 dev 4 function 2 not configured
ichpcib0 at pci0 dev 31 function 0 "Intel 82801EB/ER LPC" rev 0x02
pciide0 at pci0 dev 31 function 1 "Intel 82801EB/ER IDE" rev 0x02: DMA, channel 
0 configured to compatibility, channel 1 configured to compatibility
atapiscsi0 at pciide0 channel 0 drive 0
scsibus1 at atapiscsi0: 2 targets
cd0 at scsibus1 targ 0 lun 0: <TSSTcorp, CD-ROM TS-L162C, N204> SCSI0 5/cdrom 
removable
cd0(pciide0:0:0): using PIO mode 4, Ultra-DMA mode 2
pciide0: channel 1 disabled (no drives)
isa0 at ichpcib0
isadma0 at isa0
pckbc0 at isa0 port 0x60/5
pckbd0 at pckbc0 (kbd slot)
pckbc0: using irq 1 for kbd slot
wskbd0 at pckbd0: console keyboard, using wsdisplay0
pms0 at pckbc0 (aux slot)
pckbc0: using irq 12 for aux slot
wsmouse0 at pms0 mux 0
pcppi0 at isa0 port 0x61
midi0 at pcppi0: <PC speaker>
spkr0 at pcppi0
npx0 at isa0 port 0xf0/16: reported by CPUID; using exception 16
pccom0 at isa0 port 0x3f8/8 irq 4: ns16550a, 16 byte fifo
pccom1 at isa0 port 0x2f8/8 irq 3: ns16550a, 16 byte fifo
fdc0 at isa0 port 0x3f0/6 irq 6 drq 2
biomask 0 netmask 0 ttymask 0
pctr: user-level cycle counter enabled
dkcsum: sd0 matches BIOS drive 0x80
root on sd0a
rootdev=0x400 rrootdev=0xd00 rawdev=0xd02
WARNING: / was not properly unmounted

Reply via email to