Re: random crashes on a firewall with OpenBSD 4.5-stable

2009-08-02 Thread ropers
2009/6/26 Jussi Peltola pe...@pelzi.net:
 memtest86+: it can prove it's broken, but if it doesn't
 find problems it doesn't guarantee there are none.

I've said this before, but I've yet to see faulty RAM whose problems
memtest86+ will not detect during a 24hr burn-in test. Sure, I've seen
faulty RAM that memtest86+ said was ok during a single-pass test, but
if you let memtest86+ run for 24 hours it'll probably find just about
any error.

Of course, depending on your circumstances it may be more economical
to just chuck the suspect RAM instead of wasting 24 hours. And
granted, YMMV. But if anyone has ever seen any faulty RAM whose
problems a 24hr burn-in test with memtest86+ could not detect, I'd be
very interested in hearing that.

regards,
--ropers



Re: random crashes on a firewall with OpenBSD 4.5-stable

2009-08-02 Thread Mattieu Baptiste
On Sun, Aug 2, 2009 at 10:45 AM, ropersrop...@gmail.com wrote:

 Of course, depending on your circumstances it may be more economical
 to just chuck the suspect RAM instead of wasting 24 hours. And
 granted, YMMV. But if anyone has ever seen any faulty RAM whose
 problems a 24hr burn-in test with memtest86+ could not detect, I'd be
 very interested in hearing that.

 regards,
 --ropers



I have seen errors appear after more than 24 hours: 36 or 48 hours.
Yes, it can happen.

Cheers,

-- 
Mattieu Baptiste
/earth is 102% full ... please delete anyone you can.



Re: random crashes on a firewall with OpenBSD 4.5-stable

2009-08-02 Thread Nick Holland
ropers wrote:
 2009/6/26 Jussi Peltola pe...@pelzi.net:
 memtest86+: it can prove it's broken, but if it doesn't
 find problems it doesn't guarantee there are none.

This is correct.

 I've said this before, but I've yet to see faulty RAM whose problems
 memtest86+ will not detect during a 24hr burn-in test. Sure, I've seen
 faulty RAM that memtest86+ said was ok during a single-pass test, but
 if you let memtest86+ run for 24 hours it'll probably find just about
 any error.
 
 Of course, depending on your circumstances it may be more economical
 to just chuck the suspect RAM instead of wasting 24 hours. And
 granted, YMMV. But if anyone has ever seen any faulty RAM whose
 problems a 24hr burn-in test with memtest86+ could not detect, I'd be
 very interested in hearing that.

How about this...
Some years ago, Walmart had some Athlon-based $200 PCs, they used
SDRAM, 100MHz, IIRC.

For giggles one day, I stuck some oddball SDRAM modules in one
of them, and not too surprisingly, the thing failed to boot.  In
addition to being blatantly the wrong speed (66MHz), it was some really
odd junk that didn't work in much of anything that used normal SDRAM of
any speed.  I didn't expect it to work, it confirmed my expectations;
any OS that was loaded on the disk refused to boot very far before
barfing all over itself with this RAM installed.

Having enjoyed that part (yeah, I'm oddly amused), I figured running
memtest86 would be an interesting test.

Well, I'm somewhat disturbed to say that memtest86 had no problem
testing all that junk^Woddball RAM, and told me it was all perfect.
It may have been..but it certainly didn't work in that machine, and
yet, it passed every diagnostic memtest86 threw at it for hours.  I
don't recall how long I left it cooking, but I know it was more than
24 hours, and I think it was for a few days before I needed that bit
of shelf space and shut down the test.

You can argue that memtest86 was correct that the memory itself was
good, but since no other OS seemed to be able to make that RAM work
in that machine, I don't think it is a very convincing argument --
that machine's memory subsystem was clearly broke, the diagnosis
easy to confirm (swap RAM, system works, swap back, system won't
boot).  Yes, I think this qualifies as a memtest86 bug, as it missed
something very basic, and maybe it's been fixed by now, but it still
proves the point: passing diagnostics only means the diagnostics
didn't find anything, it doesn't mean things are good.

(years before that, I saw a great demonstration warning of this:
one of the machines I sold and support had a very good internal
diagnostic that included a looping RAM test, BUT if you installed
the DIP (the old, traditional IC style RAM) with pin 1 not properly
plugged into the socket, the system would pass the internal
diagnostics very well as long as you wished to run them.  You see,
this machine's diagnostics tested RAM in 64k pages.  Pin #1 on a
256kbit RAM chip happened to be used to pick which of the four(1)
64k pages were selected on the RAM chip, so the diagnostics happened
to just test the same 64k bits of that chip four times, and said, no
problems found, even though the OS or apps would crash rather soon
after booting.  Fortunately, my then young eyes were good enough to
spot the pin bent under the socket.)

memtest86 is a very impressive memory diagnostic program, it does
good things and does them well, but passing memtest86, as with any
diagnostic, just means no problem FOUND.

Nick.


(1) those thinking, hey, one pin can't select more than TWO pages
of RAM need not try to correct me, I'm right, you don't understand
how this stuff works. :)  Hint: the chips had only 16 pins, including
power, data, address, ground...and yet had an org of 256k X 1bit)



Re: random crashes on a firewall with OpenBSD 4.5-stable

2009-08-01 Thread Comete

Hello,

as suggested, i finally changed the RAID controler (Compaq Smart Array 
431) with the same device and the firewall still crashes with the same 
error.

So if it is not the hardware could it be the software ?
How can i submit a bug report and where ? What informations do you need ?
what can i do or type when i get the error with the debug prompt ?

It seems that the firewall now crashes 3 times a week and it's very 
annoying.


Thanks for the help :)


Michal a icrit :

Other servers?? I don't mean PDU, I mean PSU...the power supply in the
server. If your shearing a power supply across 2 servers I would be shocked
:)

-Original Message-
From: Comete [mailto:com...@daknet.org] 
Sent: 26 June 2009 13:48

To: Michal
Subject: Re: random crashes on a firewall with OpenBSD 4.5-stable

No problem with the PSU and voltage limits. The PSU isn't used at its 
full capacity and the other servers plugged on it work well.


Could it be a bad network interface ?

Michal a icrit :

Just stabbing the dark here, test your Voltage Rails on your PSU. Check

they

are within limits. I find unexplained crash's can be traced back to PSU's
quite often

-Original Message-
From: owner-m...@openbsd.org [mailto:owner-m...@openbsd.org] On Behalf Of
Comhte
Sent: 26 June 2009 12:22
To: Misc OpenBSD
Cc: Daniel Gracia Garallar
Subject: Re: random crashes on a firewall with OpenBSD 4.5-stable

Well i have tested the RAM with memtest, no error.

maybe another idea ?

Thanks

Daniel Gracia Garallar a C)crit :
Oh and maybe bad RAM; I've hit some nasty errors with these faulty 
DIMMs... :/


ComC(te escribiC3:

Hi,

we are using the last OpenBSD 4.5-stable release on an old Compaq 
Proliant ML350 as a firewall with spamd. But we encounter randomly 
some system crashes (once a week or two weeks). The system always 
displays the same message:


uvm_fault (0xd080d9e00x0,0,1) - e

kernel: page fault trap, code=0

Stopped at cac_pci_l0_intr_pending+0xb
push 0x34 (%eax)

What do you think it could be ? I thought about maybe a hardware 
problem but where exactly...


I join my dmesg below

Thanks for your advice !

OpenBSD 4.5-stable (GENERIC) #9: Sun May 17 22:59:17 CEST 2009
r...@arwen.saintlo.fr:/usr/src/sys/arch/i386/compile/GENERIC
cpu0: Intel(R) Pentium(R) III CPU family 1266MHz (GenuineIntel 
686-class) 1.27 GHz
cpu0: 


FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,
FXSR,SSE 

real mem  = 267988992 (255MB)
avail mem = 250839040 (239MB)
mainbus0 at root
bios0 at mainbus0: AT/286+ BIOS, date 12/31/99, BIOS32 rev. 0 @ 
0xf, SMBIOS rev. 2.3 @ 0xec000 (31 entries)

bios0: vendor Compaq version D11 date 01/29/2002
bios0: Compaq ProLiant ML350 G2
acpi0 at bios0: rev 0
acpi0: tables DSDT FACP APIC SPCR
acpi0: wakeup devices PBTN(S5)
acpitimer0 at acpi0: 3579545 Hz, 32 bits
acpimadt0 at acpi0 addr 0xfee0: PC-AT compat
cpu0 at mainbus0: apid 3 (boot processor)
cpu0: apic clock running at 132MHz
ioapic0 at mainbus0: apid 8 pa 0xfec0, version 11, 16 pins
ioapic0: misconfigured as apic 0, remapped to apid 8
ioapic1 at mainbus0: apid 2 pa 0xfec01000, version 11, 16 pins
ioapic1: misconfigured as apic 0, remapped to apid 2
acpiprt0 at acpi0: bus 0 (PCI0)
acpiprt1 at acpi0: bus 2 (PCI1)
acpicpu0 at acpi0
acpitz0 at acpi0: critical temperature 31 degC
acpibtn0 at acpi0: PBTN
bios0: ROM list: 0xc/0x8000 0xc8000/0x1800 0xc9800/0x1800 
0xcb000/0x1800 0xcc800/0x4000! 0xd0800/0x1800 0xee000/0x2000!

pci0 at mainbus0 bus 0: configuration mode 1 (bios)
pchb0 at pci0 dev 0 function 0 ServerWorks CNB20LE Host rev 0x06
pchb1 at pci0 dev 0 function 1 ServerWorks CNB20LE Host rev 0x06
pci1 at pchb1 bus 2
em0 at pci1 dev 1 function 0 Intel PRO/1000T (82544GC) rev 0x02: 
apic 2 int 0 (irq 5), address 00:02:b3:b9:0d:a4
em1 at pci1 dev 2 function 0 Intel PRO/1000T (82544GC) rev 0x02: 
apic 2 int 2 (irq 15), address 00:02:b3:b9:0d:7d
re0 at pci1 dev 3 function 0 D-Link Systems DGE-528T rev 0x10: 
RTL8169/8110SB (0x1000), apic 2 int 4 (irq 15), address

00:1c:f0:6f:38:7e

rgephy0 at re0 phy 7: RTL8169S/8110S PHY, rev. 3
cac0 at pci1 dev 4 function 0 DEC Compaq SMART RAID 42xx rev 0x01: 
apic 2 int 6 (irq 11), Smart Array 431

scsibus0 at cac0: 1 targets
sd0 at scsibus0 targ 0 lun 0: Compaq, RAID1 vol #00,  SCSI2 0/direct 
fixed

sd0: 34727MB, 512 bytes/sec, 71122560 sec total
re1 at pci1 dev 5 function 0 D-Link Systems DGE-528T rev 0x10: 
RTL8169/8110SB (0x1000), apic 2 int 8 (irq 15), address

00:1c:f0:62:eb:12

rgephy1 at re1 phy 7: RTL8169S/8110S PHY, rev. 3
fxp0 at pci0 dev 1 function 0 Intel 8255x rev 0x08, i82559: apic 2 
int 10 (irq 5), address 00:02:a5:44:33:f7

inphy0 at fxp0 phy 1: i82555 10/100 PHY, rev. 4
ahc0 at pci0 dev 2 function 0 Adaptec AHA-3960D U160 rev 0x01: apic 
2 int 11 (irq 11)

scsibus1 at ahc0: 16 targets, initiator 7
ahc1 at pci0 dev 2 function 1 Adaptec AHA-3960D U160 rev 0x01: apic 
2 int 11 (irq 11)

scsibus2 at ahc1: 16 targets, initiator 7
st0 at scsibus2 targ 6 lun 0

Re: random crashes on a firewall with OpenBSD 4.5-stable

2009-08-01 Thread Stuart Henderson
On 2009-08-01, Comete com...@daknet.org wrote:
 as suggested, i finally changed the RAID controler (Compaq Smart Array 
 431) with the same device and the firewall still crashes with the same 
 error.

I think the idea was probably to change it for a completely different
type of device (or for just a plain SCSI controller).

 So if it is not the hardware could it be the software ?

yes, quite possibly the cac(4) driver.

 How can i submit a bug report and where ? What informations do you need ?
 what can i do or type when i get the error with the debug prompt ?

as a starter: trace, ps.



Re: random crashes on a firewall with OpenBSD 4.5-stable

2009-06-26 Thread Comète

Well i have tested the RAM with memtest, no error.

maybe another idea ?

Thanks

Daniel Gracia Garallar a C)crit :
Oh and maybe bad RAM; I've hit some nasty errors with these faulty 
DIMMs... :/


ComC(te escribiC3:

Hi,

we are using the last OpenBSD 4.5-stable release on an old Compaq 
Proliant ML350 as a firewall with spamd. But we encounter randomly 
some system crashes (once a week or two weeks). The system always 
displays the same message:


uvm_fault (0xd080d9e00x0,0,1) - e

kernel: page fault trap, code=0

Stopped at cac_pci_l0_intr_pending+0xb
push 0x34 (%eax)

What do you think it could be ? I thought about maybe a hardware 
problem but where exactly...


I join my dmesg below

Thanks for your advice !

OpenBSD 4.5-stable (GENERIC) #9: Sun May 17 22:59:17 CEST 2009
r...@arwen.saintlo.fr:/usr/src/sys/arch/i386/compile/GENERIC
cpu0: Intel(R) Pentium(R) III CPU family 1266MHz (GenuineIntel 
686-class) 1.27 GHz
cpu0: 
FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE 


real mem  = 267988992 (255MB)
avail mem = 250839040 (239MB)
mainbus0 at root
bios0 at mainbus0: AT/286+ BIOS, date 12/31/99, BIOS32 rev. 0 @ 
0xf, SMBIOS rev. 2.3 @ 0xec000 (31 entries)

bios0: vendor Compaq version D11 date 01/29/2002
bios0: Compaq ProLiant ML350 G2
acpi0 at bios0: rev 0
acpi0: tables DSDT FACP APIC SPCR
acpi0: wakeup devices PBTN(S5)
acpitimer0 at acpi0: 3579545 Hz, 32 bits
acpimadt0 at acpi0 addr 0xfee0: PC-AT compat
cpu0 at mainbus0: apid 3 (boot processor)
cpu0: apic clock running at 132MHz
ioapic0 at mainbus0: apid 8 pa 0xfec0, version 11, 16 pins
ioapic0: misconfigured as apic 0, remapped to apid 8
ioapic1 at mainbus0: apid 2 pa 0xfec01000, version 11, 16 pins
ioapic1: misconfigured as apic 0, remapped to apid 2
acpiprt0 at acpi0: bus 0 (PCI0)
acpiprt1 at acpi0: bus 2 (PCI1)
acpicpu0 at acpi0
acpitz0 at acpi0: critical temperature 31 degC
acpibtn0 at acpi0: PBTN
bios0: ROM list: 0xc/0x8000 0xc8000/0x1800 0xc9800/0x1800 
0xcb000/0x1800 0xcc800/0x4000! 0xd0800/0x1800 0xee000/0x2000!

pci0 at mainbus0 bus 0: configuration mode 1 (bios)
pchb0 at pci0 dev 0 function 0 ServerWorks CNB20LE Host rev 0x06
pchb1 at pci0 dev 0 function 1 ServerWorks CNB20LE Host rev 0x06
pci1 at pchb1 bus 2
em0 at pci1 dev 1 function 0 Intel PRO/1000T (82544GC) rev 0x02: 
apic 2 int 0 (irq 5), address 00:02:b3:b9:0d:a4
em1 at pci1 dev 2 function 0 Intel PRO/1000T (82544GC) rev 0x02: 
apic 2 int 2 (irq 15), address 00:02:b3:b9:0d:7d
re0 at pci1 dev 3 function 0 D-Link Systems DGE-528T rev 0x10: 
RTL8169/8110SB (0x1000), apic 2 int 4 (irq 15), address 00:1c:f0:6f:38:7e

rgephy0 at re0 phy 7: RTL8169S/8110S PHY, rev. 3
cac0 at pci1 dev 4 function 0 DEC Compaq SMART RAID 42xx rev 0x01: 
apic 2 int 6 (irq 11), Smart Array 431

scsibus0 at cac0: 1 targets
sd0 at scsibus0 targ 0 lun 0: Compaq, RAID1 vol #00,  SCSI2 0/direct 
fixed

sd0: 34727MB, 512 bytes/sec, 71122560 sec total
re1 at pci1 dev 5 function 0 D-Link Systems DGE-528T rev 0x10: 
RTL8169/8110SB (0x1000), apic 2 int 8 (irq 15), address 00:1c:f0:62:eb:12

rgephy1 at re1 phy 7: RTL8169S/8110S PHY, rev. 3
fxp0 at pci0 dev 1 function 0 Intel 8255x rev 0x08, i82559: apic 2 
int 10 (irq 5), address 00:02:a5:44:33:f7

inphy0 at fxp0 phy 1: i82555 10/100 PHY, rev. 4
ahc0 at pci0 dev 2 function 0 Adaptec AHA-3960D U160 rev 0x01: apic 
2 int 11 (irq 11)

scsibus1 at ahc0: 16 targets, initiator 7
ahc1 at pci0 dev 2 function 1 Adaptec AHA-3960D U160 rev 0x01: apic 
2 int 11 (irq 11)

scsibus2 at ahc1: 16 targets, initiator 7
st0 at scsibus2 targ 6 lun 0: COMPAQ, SDT-9000, 4.20 SCSI2 
1/sequential removable
fxp1 at pci0 dev 4 function 0 Intel 8255x rev 0x08, i82559: apic 2 
int 13 (irq 10), address 00:08:02:45:29:64

inphy1 at fxp1 phy 1: i82555 10/100 PHY, rev. 4
vga1 at pci0 dev 5 function 0 ATI Rage XL rev 0x27
wsdisplay0 at vga1 mux 1: console (80x25, vt100 emulation)
wsdisplay0: screen 1-5 added (80x25, vt100 emulation)
Compaq Netelligent ASMC rev 0x00 at pci0 dev 6 function 0 not 
configured

piixpm0 at pci0 dev 15 function 0 ServerWorks CSB5 rev 0x92: polling
iic0 at piixpm0
iic0: addr 0x28 00=a0 01=10 02=03 03=01 04=7f 05=04 06=03 07=00 08=00 
09=00 0b=00 0c=03 0d=41 0e=02 0f=00 10=00 11=05 18=3a 19=10 20=ff 
21=ff 28=00 29=00 2a=04 2b=00 2c=00 2d=00 2e=00 30=00 31=00 32=00 
38=00 39=00 3a=00 3b=00 3c=00 3d=00 3e=00 40=08 41=08 42=80 48=03 
49=03 4a=03 50=00 51=80 58=00 59=00 60=f0 61=f0 68=af 69=af 70=ff 
71=00 78=ff 79=ff 80=2b 81=37 82=ff 88=f0 89=f0 8a=f0 90=3c 91=46 
92=ff 98=37 99=41 9a=ff a0=22 a1=2d a2=80 a8=ff a9=ff b0=00 b1=00 
b8=06 b9=00 words 00=a0a0 01=1010 02=0303 03=0101 04=7f7f 05=0404 
06=0303 07=

spdmem0 at iic0 addr 0x50: 256MB SDRAM registered ECC PC133CL2
pciide0 at pci0 dev 15 function 1 ServerWorks CSB5 IDE rev 0x92: DMA
atapiscsi0 at pciide0 channel 0 drive 0
scsibus3 at atapiscsi0: 2 targets
cd0 at scsibus3 targ 0 lun 0: COMPAQ, CRD-8402B, 1.03 ATAPI 5/cdrom 
removable

cd0(pciide0:0:0): using PIO 

Re: random crashes on a firewall with OpenBSD 4.5-stable

2009-06-26 Thread Comete

No problem with the PSU and voltage limits. The PSU isn't used at its
full capacity and the other servers plugged on it work well.

Could it be a bad network interface ?

Michal a icrit :

Just stabbing the dark here, test your Voltage Rails on your PSU. Check they
are within limits. I find unexplained crash's can be traced back to PSU's
quite often

-Original Message-
From: owner-m...@openbsd.org [mailto:owner-m...@openbsd.org] On Behalf Of
Comhte
Sent: 26 June 2009 12:22
To: Misc OpenBSD
Cc: Daniel Gracia Garallar
Subject: Re: random crashes on a firewall with OpenBSD 4.5-stable

Well i have tested the RAM with memtest, no error.

maybe another idea ?

Thanks

Daniel Gracia Garallar a C)crit :
Oh and maybe bad RAM; I've hit some nasty errors with these faulty 
DIMMs... :/


ComC(te escribiC3:

Hi,

we are using the last OpenBSD 4.5-stable release on an old Compaq 
Proliant ML350 as a firewall with spamd. But we encounter randomly 
some system crashes (once a week or two weeks). The system always 
displays the same message:


uvm_fault (0xd080d9e00x0,0,1) - e

kernel: page fault trap, code=0

Stopped at cac_pci_l0_intr_pending+0xb
push 0x34 (%eax)

What do you think it could be ? I thought about maybe a hardware 
problem but where exactly...


I join my dmesg below

Thanks for your advice !

OpenBSD 4.5-stable (GENERIC) #9: Sun May 17 22:59:17 CEST 2009
r...@arwen.saintlo.fr:/usr/src/sys/arch/i386/compile/GENERIC
cpu0: Intel(R) Pentium(R) III CPU family 1266MHz (GenuineIntel 
686-class) 1.27 GHz
cpu0: 


FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,
FXSR,SSE 

real mem  = 267988992 (255MB)
avail mem = 250839040 (239MB)
mainbus0 at root
bios0 at mainbus0: AT/286+ BIOS, date 12/31/99, BIOS32 rev. 0 @ 
0xf, SMBIOS rev. 2.3 @ 0xec000 (31 entries)

bios0: vendor Compaq version D11 date 01/29/2002
bios0: Compaq ProLiant ML350 G2
acpi0 at bios0: rev 0
acpi0: tables DSDT FACP APIC SPCR
acpi0: wakeup devices PBTN(S5)
acpitimer0 at acpi0: 3579545 Hz, 32 bits
acpimadt0 at acpi0 addr 0xfee0: PC-AT compat
cpu0 at mainbus0: apid 3 (boot processor)
cpu0: apic clock running at 132MHz
ioapic0 at mainbus0: apid 8 pa 0xfec0, version 11, 16 pins
ioapic0: misconfigured as apic 0, remapped to apid 8
ioapic1 at mainbus0: apid 2 pa 0xfec01000, version 11, 16 pins
ioapic1: misconfigured as apic 0, remapped to apid 2
acpiprt0 at acpi0: bus 0 (PCI0)
acpiprt1 at acpi0: bus 2 (PCI1)
acpicpu0 at acpi0
acpitz0 at acpi0: critical temperature 31 degC
acpibtn0 at acpi0: PBTN
bios0: ROM list: 0xc/0x8000 0xc8000/0x1800 0xc9800/0x1800 
0xcb000/0x1800 0xcc800/0x4000! 0xd0800/0x1800 0xee000/0x2000!

pci0 at mainbus0 bus 0: configuration mode 1 (bios)
pchb0 at pci0 dev 0 function 0 ServerWorks CNB20LE Host rev 0x06
pchb1 at pci0 dev 0 function 1 ServerWorks CNB20LE Host rev 0x06
pci1 at pchb1 bus 2
em0 at pci1 dev 1 function 0 Intel PRO/1000T (82544GC) rev 0x02: 
apic 2 int 0 (irq 5), address 00:02:b3:b9:0d:a4
em1 at pci1 dev 2 function 0 Intel PRO/1000T (82544GC) rev 0x02: 
apic 2 int 2 (irq 15), address 00:02:b3:b9:0d:7d
re0 at pci1 dev 3 function 0 D-Link Systems DGE-528T rev 0x10: 
RTL8169/8110SB (0x1000), apic 2 int 4 (irq 15), address 00:1c:f0:6f:38:7e

rgephy0 at re0 phy 7: RTL8169S/8110S PHY, rev. 3
cac0 at pci1 dev 4 function 0 DEC Compaq SMART RAID 42xx rev 0x01: 
apic 2 int 6 (irq 11), Smart Array 431

scsibus0 at cac0: 1 targets
sd0 at scsibus0 targ 0 lun 0: Compaq, RAID1 vol #00,  SCSI2 0/direct 
fixed

sd0: 34727MB, 512 bytes/sec, 71122560 sec total
re1 at pci1 dev 5 function 0 D-Link Systems DGE-528T rev 0x10: 
RTL8169/8110SB (0x1000), apic 2 int 8 (irq 15), address 00:1c:f0:62:eb:12

rgephy1 at re1 phy 7: RTL8169S/8110S PHY, rev. 3
fxp0 at pci0 dev 1 function 0 Intel 8255x rev 0x08, i82559: apic 2 
int 10 (irq 5), address 00:02:a5:44:33:f7

inphy0 at fxp0 phy 1: i82555 10/100 PHY, rev. 4
ahc0 at pci0 dev 2 function 0 Adaptec AHA-3960D U160 rev 0x01: apic 
2 int 11 (irq 11)

scsibus1 at ahc0: 16 targets, initiator 7
ahc1 at pci0 dev 2 function 1 Adaptec AHA-3960D U160 rev 0x01: apic 
2 int 11 (irq 11)

scsibus2 at ahc1: 16 targets, initiator 7
st0 at scsibus2 targ 6 lun 0: COMPAQ, SDT-9000, 4.20 SCSI2 
1/sequential removable
fxp1 at pci0 dev 4 function 0 Intel 8255x rev 0x08, i82559: apic 2 
int 13 (irq 10), address 00:08:02:45:29:64

inphy1 at fxp1 phy 1: i82555 10/100 PHY, rev. 4
vga1 at pci0 dev 5 function 0 ATI Rage XL rev 0x27
wsdisplay0 at vga1 mux 1: console (80x25, vt100 emulation)
wsdisplay0: screen 1-5 added (80x25, vt100 emulation)
Compaq Netelligent ASMC rev 0x00 at pci0 dev 6 function 0 not 
configured

piixpm0 at pci0 dev 15 function 0 ServerWorks CSB5 rev 0x92: polling
iic0 at piixpm0
iic0: addr 0x28 00=a0 01=10 02=03 03=01 04=7f 05=04 06=03 07=00 08=00 
09=00 0b=00 0c=03 0d=41 0e=02 0f=00 10=00 11=05 18=3a 19=10 20=ff 
21=ff 28=00 29=00 2a=04 2b=00 2c=00 2d=00 2e=00 30=00 31=00 32=00 
38=00 39=00 3a=00 3b=00 3c=00 3d=00 3e=00 40=08 41=08 42=80 48=03 
49

Re: random crashes on a firewall with OpenBSD 4.5-stable

2009-06-26 Thread Tobias Ulmer
On Thu, Jun 25, 2009 at 05:23:40PM +0200, Com??te wrote:
 Hi,

 we are using the last OpenBSD 4.5-stable release on an old Compaq  
 Proliant ML350 as a firewall with spamd. But we encounter randomly some  
 system crashes (once a week or two weeks). The system always displays  
 the same message:

 uvm_fault (0xd080d9e00x0,0,1) - e

 kernel: page fault trap, code=0

 Stopped at cac_pci_l0_intr_pending+0xb
 push 0x34 (%eax)

The function name gives it away. 99% cac(4) is causing your problem.
Connect the drive to another scsi controller and if possible disable the
compaq thing in the bios.


 What do you think it could be ? I thought about maybe a hardware problem  
 but where exactly...

 I join my dmesg below

 Thanks for your advice !

 OpenBSD 4.5-stable (GENERIC) #9: Sun May 17 22:59:17 CEST 2009
 r...@arwen.saintlo.fr:/usr/src/sys/arch/i386/compile/GENERIC
 cpu0: Intel(R) Pentium(R) III CPU family 1266MHz (GenuineIntel  
 686-class) 1.27 GHz
 cpu0:  
 FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE
 real mem  = 267988992 (255MB)
 avail mem = 250839040 (239MB)
 mainbus0 at root
 bios0 at mainbus0: AT/286+ BIOS, date 12/31/99, BIOS32 rev. 0 @ 0xf,  
 SMBIOS rev. 2.3 @ 0xec000 (31 entries)
 bios0: vendor Compaq version D11 date 01/29/2002
 bios0: Compaq ProLiant ML350 G2
 acpi0 at bios0: rev 0
 acpi0: tables DSDT FACP APIC SPCR
 acpi0: wakeup devices PBTN(S5)
 acpitimer0 at acpi0: 3579545 Hz, 32 bits
 acpimadt0 at acpi0 addr 0xfee0: PC-AT compat
 cpu0 at mainbus0: apid 3 (boot processor)
 cpu0: apic clock running at 132MHz
 ioapic0 at mainbus0: apid 8 pa 0xfec0, version 11, 16 pins
 ioapic0: misconfigured as apic 0, remapped to apid 8
 ioapic1 at mainbus0: apid 2 pa 0xfec01000, version 11, 16 pins
 ioapic1: misconfigured as apic 0, remapped to apid 2
 acpiprt0 at acpi0: bus 0 (PCI0)
 acpiprt1 at acpi0: bus 2 (PCI1)
 acpicpu0 at acpi0
 acpitz0 at acpi0: critical temperature 31 degC
 acpibtn0 at acpi0: PBTN
 bios0: ROM list: 0xc/0x8000 0xc8000/0x1800 0xc9800/0x1800  
 0xcb000/0x1800 0xcc800/0x4000! 0xd0800/0x1800 0xee000/0x2000!
 pci0 at mainbus0 bus 0: configuration mode 1 (bios)
 pchb0 at pci0 dev 0 function 0 ServerWorks CNB20LE Host rev 0x06
 pchb1 at pci0 dev 0 function 1 ServerWorks CNB20LE Host rev 0x06
 pci1 at pchb1 bus 2
 em0 at pci1 dev 1 function 0 Intel PRO/1000T (82544GC) rev 0x02: apic  
 2 int 0 (irq 5), address 00:02:b3:b9:0d:a4
 em1 at pci1 dev 2 function 0 Intel PRO/1000T (82544GC) rev 0x02: apic  
 2 int 2 (irq 15), address 00:02:b3:b9:0d:7d
 re0 at pci1 dev 3 function 0 D-Link Systems DGE-528T rev 0x10:  
 RTL8169/8110SB (0x1000), apic 2 int 4 (irq 15), address 00:1c:f0:6f:38:7e
 rgephy0 at re0 phy 7: RTL8169S/8110S PHY, rev. 3
 cac0 at pci1 dev 4 function 0 DEC Compaq SMART RAID 42xx rev 0x01:  
 apic 2 int 6 (irq 11), Smart Array 431
 scsibus0 at cac0: 1 targets
 sd0 at scsibus0 targ 0 lun 0: Compaq, RAID1 vol #00,  SCSI2 0/direct fixed
 sd0: 34727MB, 512 bytes/sec, 71122560 sec total
 re1 at pci1 dev 5 function 0 D-Link Systems DGE-528T rev 0x10:  
 RTL8169/8110SB (0x1000), apic 2 int 8 (irq 15), address 00:1c:f0:62:eb:12
 rgephy1 at re1 phy 7: RTL8169S/8110S PHY, rev. 3
 fxp0 at pci0 dev 1 function 0 Intel 8255x rev 0x08, i82559: apic 2 int  
 10 (irq 5), address 00:02:a5:44:33:f7
 inphy0 at fxp0 phy 1: i82555 10/100 PHY, rev. 4
 ahc0 at pci0 dev 2 function 0 Adaptec AHA-3960D U160 rev 0x01: apic 2  
 int 11 (irq 11)
 scsibus1 at ahc0: 16 targets, initiator 7
 ahc1 at pci0 dev 2 function 1 Adaptec AHA-3960D U160 rev 0x01: apic 2  
 int 11 (irq 11)
 scsibus2 at ahc1: 16 targets, initiator 7
 st0 at scsibus2 targ 6 lun 0: COMPAQ, SDT-9000, 4.20 SCSI2  
 1/sequential removable
 fxp1 at pci0 dev 4 function 0 Intel 8255x rev 0x08, i82559: apic 2 int  
 13 (irq 10), address 00:08:02:45:29:64
 inphy1 at fxp1 phy 1: i82555 10/100 PHY, rev. 4
 vga1 at pci0 dev 5 function 0 ATI Rage XL rev 0x27
 wsdisplay0 at vga1 mux 1: console (80x25, vt100 emulation)
 wsdisplay0: screen 1-5 added (80x25, vt100 emulation)
 Compaq Netelligent ASMC rev 0x00 at pci0 dev 6 function 0 not configured
 piixpm0 at pci0 dev 15 function 0 ServerWorks CSB5 rev 0x92: polling
 iic0 at piixpm0
 iic0: addr 0x28 00=a0 01=10 02=03 03=01 04=7f 05=04 06=03 07=00 08=00  
 09=00 0b=00 0c=03 0d=41 0e=02 0f=00 10=00 11=05 18=3a 19=10 20=ff 21=ff  
 28=00 29=00 2a=04 2b=00 2c=00 2d=00 2e=00 30=00 31=00 32=00 38=00 39=00  
 3a=00 3b=00 3c=00 3d=00 3e=00 40=08 41=08 42=80 48=03 49=03 4a=03 50=00  
 51=80 58=00 59=00 60=f0 61=f0 68=af 69=af 70=ff 71=00 78=ff 79=ff 80=2b  
 81=37 82=ff 88=f0 89=f0 8a=f0 90=3c 91=46 92=ff 98=37 99=41 9a=ff a0=22  
 a1=2d a2=80 a8=ff a9=ff b0=00 b1=00 b8=06 b9=00 words 00=a0a0 01=1010  
 02=0303 03=0101 04=7f7f 05=0404 06=0303 07=
 spdmem0 at iic0 addr 0x50: 256MB SDRAM registered ECC PC133CL2
 pciide0 at pci0 dev 15 function 1 ServerWorks CSB5 IDE rev 0x92: DMA
 atapiscsi0 at pciide0 channel 0 drive 0
 scsibus3 at atapiscsi0: 2 targets
 

Re: random crashes on a firewall with OpenBSD 4.5-stable

2009-06-26 Thread Michal
Well, you can check the Volt readings in the bios, most will give you a
reading, but I am sure there is some BSD software out there, maybe someone
in the list will know. On windows you can use Speedfan. Even if it's not
this, it's worth knowing how to check this as a simple check on servers

-Original Message-
From: Comhte [mailto:com...@daknet.org]
Sent: 26 June 2009 16:42
To: Michal
Subject: Re: random crashes on a firewall with OpenBSD 4.5-stable

Oh sorry :p
How could i test the power supply unit ?

Michal a icrit :
 Other servers?? I don't mean PDU, I mean PSU...the power supply in the
 server. If your shearing a power supply across 2 servers I would be
shocked
 :)

 -Original Message-
 From: Comete [mailto:com...@daknet.org]
 Sent: 26 June 2009 13:48
 To: Michal
 Subject: Re: random crashes on a firewall with OpenBSD 4.5-stable

 No problem with the PSU and voltage limits. The PSU isn't used at its
 full capacity and the other servers plugged on it work well.

 Could it be a bad network interface ?

 Michal a icrit :
 Just stabbing the dark here, test your Voltage Rails on your PSU. Check
 they
 are within limits. I find unexplained crash's can be traced back to PSU's
 quite often

 -Original Message-
 From: owner-m...@openbsd.org [mailto:owner-m...@openbsd.org] On Behalf Of
 Comhte
 Sent: 26 June 2009 12:22
 To: Misc OpenBSD
 Cc: Daniel Gracia Garallar
 Subject: Re: random crashes on a firewall with OpenBSD 4.5-stable

 Well i have tested the RAM with memtest, no error.

 maybe another idea ?

 Thanks

 Daniel Gracia Garallar a C)crit :
 Oh and maybe bad RAM; I've hit some nasty errors with these faulty
 DIMMs... :/

 ComC(te escribiC3:
 Hi,

 we are using the last OpenBSD 4.5-stable release on an old Compaq
 Proliant ML350 as a firewall with spamd. But we encounter randomly
 some system crashes (once a week or two weeks). The system always
 displays the same message:

 uvm_fault (0xd080d9e00x0,0,1) - e

 kernel: page fault trap, code=0

 Stopped at cac_pci_l0_intr_pending+0xb
 push 0x34 (%eax)

 What do you think it could be ? I thought about maybe a hardware
 problem but where exactly...

 I join my dmesg below

 Thanks for your advice !

 OpenBSD 4.5-stable (GENERIC) #9: Sun May 17 22:59:17 CEST 2009
 r...@arwen.saintlo.fr:/usr/src/sys/arch/i386/compile/GENERIC
 cpu0: Intel(R) Pentium(R) III CPU family 1266MHz (GenuineIntel
 686-class) 1.27 GHz
 cpu0:


FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,
 FXSR,SSE
 real mem  = 267988992 (255MB)
 avail mem = 250839040 (239MB)
 mainbus0 at root
 bios0 at mainbus0: AT/286+ BIOS, date 12/31/99, BIOS32 rev. 0 @
 0xf, SMBIOS rev. 2.3 @ 0xec000 (31 entries)
 bios0: vendor Compaq version D11 date 01/29/2002
 bios0: Compaq ProLiant ML350 G2
 acpi0 at bios0: rev 0
 acpi0: tables DSDT FACP APIC SPCR
 acpi0: wakeup devices PBTN(S5)
 acpitimer0 at acpi0: 3579545 Hz, 32 bits
 acpimadt0 at acpi0 addr 0xfee0: PC-AT compat
 cpu0 at mainbus0: apid 3 (boot processor)
 cpu0: apic clock running at 132MHz
 ioapic0 at mainbus0: apid 8 pa 0xfec0, version 11, 16 pins
 ioapic0: misconfigured as apic 0, remapped to apid 8
 ioapic1 at mainbus0: apid 2 pa 0xfec01000, version 11, 16 pins
 ioapic1: misconfigured as apic 0, remapped to apid 2
 acpiprt0 at acpi0: bus 0 (PCI0)
 acpiprt1 at acpi0: bus 2 (PCI1)
 acpicpu0 at acpi0
 acpitz0 at acpi0: critical temperature 31 degC
 acpibtn0 at acpi0: PBTN
 bios0: ROM list: 0xc/0x8000 0xc8000/0x1800 0xc9800/0x1800
 0xcb000/0x1800 0xcc800/0x4000! 0xd0800/0x1800 0xee000/0x2000!
 pci0 at mainbus0 bus 0: configuration mode 1 (bios)
 pchb0 at pci0 dev 0 function 0 ServerWorks CNB20LE Host rev 0x06
 pchb1 at pci0 dev 0 function 1 ServerWorks CNB20LE Host rev 0x06
 pci1 at pchb1 bus 2
 em0 at pci1 dev 1 function 0 Intel PRO/1000T (82544GC) rev 0x02:
 apic 2 int 0 (irq 5), address 00:02:b3:b9:0d:a4
 em1 at pci1 dev 2 function 0 Intel PRO/1000T (82544GC) rev 0x02:
 apic 2 int 2 (irq 15), address 00:02:b3:b9:0d:7d
 re0 at pci1 dev 3 function 0 D-Link Systems DGE-528T rev 0x10:
 RTL8169/8110SB (0x1000), apic 2 int 4 (irq 15), address
 00:1c:f0:6f:38:7e
 rgephy0 at re0 phy 7: RTL8169S/8110S PHY, rev. 3
 cac0 at pci1 dev 4 function 0 DEC Compaq SMART RAID 42xx rev 0x01:
 apic 2 int 6 (irq 11), Smart Array 431
 scsibus0 at cac0: 1 targets
 sd0 at scsibus0 targ 0 lun 0: Compaq, RAID1 vol #00,  SCSI2 0/direct
 fixed
 sd0: 34727MB, 512 bytes/sec, 71122560 sec total
 re1 at pci1 dev 5 function 0 D-Link Systems DGE-528T rev 0x10:
 RTL8169/8110SB (0x1000), apic 2 int 8 (irq 15), address
 00:1c:f0:62:eb:12
 rgephy1 at re1 phy 7: RTL8169S/8110S PHY, rev. 3
 fxp0 at pci0 dev 1 function 0 Intel 8255x rev 0x08, i82559: apic 2
 int 10 (irq 5), address 00:02:a5:44:33:f7
 inphy0 at fxp0 phy 1: i82555 10/100 PHY, rev. 4
 ahc0 at pci0 dev 2 function 0 Adaptec AHA-3960D U160 rev 0x01: apic
 2 int 11 (irq 11)
 scsibus1 at ahc0: 16 targets, initiator 7
 ahc1 at pci0 dev 2 function 1 Adaptec AHA-3960D

Re: random crashes on a firewall with OpenBSD 4.5-stable

2009-06-26 Thread Jussi Peltola
But even measuring the ripple with a scope won't guarantee it's OK.
Swapping out all of the hardware is sometimes the only way to find out.
Same goes for memtest86+: it can prove it's broken, but if it doesn't
find problems it doesn't guarantee there are none.

-- 
Jussi Peltola



Re: random crashes on a firewall with OpenBSD 4.5-stable

2009-06-26 Thread Maxim Bourmistrov

Overheating?

On 26 jun 2009, at 17.50, Michal wrote:

Well, you can check the Volt readings in the bios, most will give  
you a
reading, but I am sure there is some BSD software out there, maybe  
someone
in the list will know. On windows you can use Speedfan. Even if it's  
not
this, it's worth knowing how to check this as a simple check on  
servers


-Original Message-
From: Comhte [mailto:com...@daknet.org]
Sent: 26 June 2009 16:42
To: Michal
Subject: Re: random crashes on a firewall with OpenBSD 4.5-stable

Oh sorry :p
How could i test the power supply unit ?

Michal a icrit :
Other servers?? I don't mean PDU, I mean PSU...the power supply in  
the

server. If your shearing a power supply across 2 servers I would be

shocked

:)

-Original Message-
From: Comete [mailto:com...@daknet.org]
Sent: 26 June 2009 13:48
To: Michal
Subject: Re: random crashes on a firewall with OpenBSD 4.5-stable

No problem with the PSU and voltage limits. The PSU isn't used at its
full capacity and the other servers plugged on it work well.

Could it be a bad network interface ?

Michal a icrit :
Just stabbing the dark here, test your Voltage Rails on your PSU.  
Check

they
are within limits. I find unexplained crash's can be traced back  
to PSU's

quite often

-Original Message-
From: owner-m...@openbsd.org [mailto:owner-m...@openbsd.org] On  
Behalf Of

Comhte
Sent: 26 June 2009 12:22
To: Misc OpenBSD
Cc: Daniel Gracia Garallar
Subject: Re: random crashes on a firewall with OpenBSD 4.5-stable

Well i have tested the RAM with memtest, no error.

maybe another idea ?

Thanks

Daniel Gracia Garallar a C)crit :

Oh and maybe bad RAM; I've hit some nasty errors with these faulty
DIMMs... :/

ComC(te escribiC3:

Hi,

we are using the last OpenBSD 4.5-stable release on an old Compaq
Proliant ML350 as a firewall with spamd. But we encounter randomly
some system crashes (once a week or two weeks). The system always
displays the same message:

uvm_fault (0xd080d9e00x0,0,1) - e

kernel: page fault trap, code=0

Stopped at cac_pci_l0_intr_pending+0xb
push 0x34 (%eax)

What do you think it could be ? I thought about maybe a hardware
problem but where exactly...

I join my dmesg below

Thanks for your advice !

OpenBSD 4.5-stable (GENERIC) #9: Sun May 17 22:59:17 CEST 2009
   r...@arwen.saintlo.fr:/usr/src/sys/arch/i386/compile/GENERIC
cpu0: Intel(R) Pentium(R) III CPU family 1266MHz (GenuineIntel
686-class) 1.27 GHz
cpu0:



FPU 
,V86 
,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,

FXSR,SSE

real mem  = 267988992 (255MB)
avail mem = 250839040 (239MB)
mainbus0 at root
bios0 at mainbus0: AT/286+ BIOS, date 12/31/99, BIOS32 rev. 0 @
0xf, SMBIOS rev. 2.3 @ 0xec000 (31 entries)
bios0: vendor Compaq version D11 date 01/29/2002
bios0: Compaq ProLiant ML350 G2
acpi0 at bios0: rev 0
acpi0: tables DSDT FACP APIC SPCR
acpi0: wakeup devices PBTN(S5)
acpitimer0 at acpi0: 3579545 Hz, 32 bits
acpimadt0 at acpi0 addr 0xfee0: PC-AT compat
cpu0 at mainbus0: apid 3 (boot processor)
cpu0: apic clock running at 132MHz
ioapic0 at mainbus0: apid 8 pa 0xfec0, version 11, 16 pins
ioapic0: misconfigured as apic 0, remapped to apid 8
ioapic1 at mainbus0: apid 2 pa 0xfec01000, version 11, 16 pins
ioapic1: misconfigured as apic 0, remapped to apid 2
acpiprt0 at acpi0: bus 0 (PCI0)
acpiprt1 at acpi0: bus 2 (PCI1)
acpicpu0 at acpi0
acpitz0 at acpi0: critical temperature 31 degC
acpibtn0 at acpi0: PBTN
bios0: ROM list: 0xc/0x8000 0xc8000/0x1800 0xc9800/0x1800
0xcb000/0x1800 0xcc800/0x4000! 0xd0800/0x1800 0xee000/0x2000!
pci0 at mainbus0 bus 0: configuration mode 1 (bios)
pchb0 at pci0 dev 0 function 0 ServerWorks CNB20LE Host rev 0x06
pchb1 at pci0 dev 0 function 1 ServerWorks CNB20LE Host rev 0x06
pci1 at pchb1 bus 2
em0 at pci1 dev 1 function 0 Intel PRO/1000T (82544GC) rev 0x02:
apic 2 int 0 (irq 5), address 00:02:b3:b9:0d:a4
em1 at pci1 dev 2 function 0 Intel PRO/1000T (82544GC) rev 0x02:
apic 2 int 2 (irq 15), address 00:02:b3:b9:0d:7d
re0 at pci1 dev 3 function 0 D-Link Systems DGE-528T rev 0x10:
RTL8169/8110SB (0x1000), apic 2 int 4 (irq 15), address

00:1c:f0:6f:38:7e

rgephy0 at re0 phy 7: RTL8169S/8110S PHY, rev. 3
cac0 at pci1 dev 4 function 0 DEC Compaq SMART RAID 42xx rev  
0x01:

apic 2 int 6 (irq 11), Smart Array 431
scsibus0 at cac0: 1 targets
sd0 at scsibus0 targ 0 lun 0: Compaq, RAID1 vol #00,  SCSI2 0/ 
direct

fixed
sd0: 34727MB, 512 bytes/sec, 71122560 sec total
re1 at pci1 dev 5 function 0 D-Link Systems DGE-528T rev 0x10:
RTL8169/8110SB (0x1000), apic 2 int 8 (irq 15), address

00:1c:f0:62:eb:12

rgephy1 at re1 phy 7: RTL8169S/8110S PHY, rev. 3
fxp0 at pci0 dev 1 function 0 Intel 8255x rev 0x08, i82559:  
apic 2

int 10 (irq 5), address 00:02:a5:44:33:f7
inphy0 at fxp0 phy 1: i82555 10/100 PHY, rev. 4
ahc0 at pci0 dev 2 function 0 Adaptec AHA-3960D U160 rev 0x01:  
apic

2 int 11 (irq 11)
scsibus1 at ahc0: 16 targets, initiator 7
ahc1 at pci0 dev 2 function 1 Adaptec AHA-3960D

Re: random crashes on a firewall with OpenBSD 4.5-stable

2009-06-26 Thread Stuart Henderson
On 2009-06-26, Michal mic...@sharescope.co.uk wrote:
 Well, you can check the Volt readings in the bios, most will give you a
 reading, but I am sure there is some BSD software out there

sysctl hw.sensors works for some systems. also see sensorsd(8).



Re: random crashes on a firewall with OpenBSD 4.5-stable

2009-06-26 Thread Daniel Gracia Garallar
Can't read that? Custom compiled kernel and cac error speaks by 
themselves; dirty solution, try other disk controller. Best solution, 
discard you don't have bad hardware and, if everything is ok, make 
contact with developers and help searching for a code patch to improve 
the RAID adapter driver.


Regards!

Dani

ComC(te escribiC3:

Hi,

we are using the last OpenBSD 4.5-stable release on an old Compaq 
Proliant ML350 as a firewall with spamd. But we encounter randomly some 
system crashes (once a week or two weeks). The system always displays 
the same message:


uvm_fault (0xd080d9e00x0,0,1) - e

kernel: page fault trap, code=0

Stopped at cac_pci_l0_intr_pending+0xb
push 0x34 (%eax)

What do you think it could be ? I thought about maybe a hardware problem 
but where exactly...


I join my dmesg below

Thanks for your advice !

OpenBSD 4.5-stable (GENERIC) #9: Sun May 17 22:59:17 CEST 2009
r...@arwen.saintlo.fr:/usr/src/sys/arch/i386/compile/GENERIC
cpu0: Intel(R) Pentium(R) III CPU family 1266MHz (GenuineIntel 
686-class) 1.27 GHz
cpu0: 
FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE 


real mem  = 267988992 (255MB)
avail mem = 250839040 (239MB)
mainbus0 at root
bios0 at mainbus0: AT/286+ BIOS, date 12/31/99, BIOS32 rev. 0 @ 0xf, 
SMBIOS rev. 2.3 @ 0xec000 (31 entries)

bios0: vendor Compaq version D11 date 01/29/2002
bios0: Compaq ProLiant ML350 G2
acpi0 at bios0: rev 0
acpi0: tables DSDT FACP APIC SPCR
acpi0: wakeup devices PBTN(S5)
acpitimer0 at acpi0: 3579545 Hz, 32 bits
acpimadt0 at acpi0 addr 0xfee0: PC-AT compat
cpu0 at mainbus0: apid 3 (boot processor)
cpu0: apic clock running at 132MHz
ioapic0 at mainbus0: apid 8 pa 0xfec0, version 11, 16 pins
ioapic0: misconfigured as apic 0, remapped to apid 8
ioapic1 at mainbus0: apid 2 pa 0xfec01000, version 11, 16 pins
ioapic1: misconfigured as apic 0, remapped to apid 2
acpiprt0 at acpi0: bus 0 (PCI0)
acpiprt1 at acpi0: bus 2 (PCI1)
acpicpu0 at acpi0
acpitz0 at acpi0: critical temperature 31 degC
acpibtn0 at acpi0: PBTN
bios0: ROM list: 0xc/0x8000 0xc8000/0x1800 0xc9800/0x1800 
0xcb000/0x1800 0xcc800/0x4000! 0xd0800/0x1800 0xee000/0x2000!

pci0 at mainbus0 bus 0: configuration mode 1 (bios)
pchb0 at pci0 dev 0 function 0 ServerWorks CNB20LE Host rev 0x06
pchb1 at pci0 dev 0 function 1 ServerWorks CNB20LE Host rev 0x06
pci1 at pchb1 bus 2
em0 at pci1 dev 1 function 0 Intel PRO/1000T (82544GC) rev 0x02: apic 
2 int 0 (irq 5), address 00:02:b3:b9:0d:a4
em1 at pci1 dev 2 function 0 Intel PRO/1000T (82544GC) rev 0x02: apic 
2 int 2 (irq 15), address 00:02:b3:b9:0d:7d
re0 at pci1 dev 3 function 0 D-Link Systems DGE-528T rev 0x10: 
RTL8169/8110SB (0x1000), apic 2 int 4 (irq 15), address 00:1c:f0:6f:38:7e

rgephy0 at re0 phy 7: RTL8169S/8110S PHY, rev. 3
cac0 at pci1 dev 4 function 0 DEC Compaq SMART RAID 42xx rev 0x01: 
apic 2 int 6 (irq 11), Smart Array 431

scsibus0 at cac0: 1 targets
sd0 at scsibus0 targ 0 lun 0: Compaq, RAID1 vol #00,  SCSI2 0/direct 
fixed

sd0: 34727MB, 512 bytes/sec, 71122560 sec total
re1 at pci1 dev 5 function 0 D-Link Systems DGE-528T rev 0x10: 
RTL8169/8110SB (0x1000), apic 2 int 8 (irq 15), address 00:1c:f0:62:eb:12

rgephy1 at re1 phy 7: RTL8169S/8110S PHY, rev. 3
fxp0 at pci0 dev 1 function 0 Intel 8255x rev 0x08, i82559: apic 2 int 
10 (irq 5), address 00:02:a5:44:33:f7

inphy0 at fxp0 phy 1: i82555 10/100 PHY, rev. 4
ahc0 at pci0 dev 2 function 0 Adaptec AHA-3960D U160 rev 0x01: apic 2 
int 11 (irq 11)

scsibus1 at ahc0: 16 targets, initiator 7
ahc1 at pci0 dev 2 function 1 Adaptec AHA-3960D U160 rev 0x01: apic 2 
int 11 (irq 11)

scsibus2 at ahc1: 16 targets, initiator 7
st0 at scsibus2 targ 6 lun 0: COMPAQ, SDT-9000, 4.20 SCSI2 
1/sequential removable
fxp1 at pci0 dev 4 function 0 Intel 8255x rev 0x08, i82559: apic 2 int 
13 (irq 10), address 00:08:02:45:29:64

inphy1 at fxp1 phy 1: i82555 10/100 PHY, rev. 4
vga1 at pci0 dev 5 function 0 ATI Rage XL rev 0x27
wsdisplay0 at vga1 mux 1: console (80x25, vt100 emulation)
wsdisplay0: screen 1-5 added (80x25, vt100 emulation)
Compaq Netelligent ASMC rev 0x00 at pci0 dev 6 function 0 not configured
piixpm0 at pci0 dev 15 function 0 ServerWorks CSB5 rev 0x92: polling
iic0 at piixpm0
iic0: addr 0x28 00=a0 01=10 02=03 03=01 04=7f 05=04 06=03 07=00 08=00 
09=00 0b=00 0c=03 0d=41 0e=02 0f=00 10=00 11=05 18=3a 19=10 20=ff 21=ff 
28=00 29=00 2a=04 2b=00 2c=00 2d=00 2e=00 30=00 31=00 32=00 38=00 39=00 
3a=00 3b=00 3c=00 3d=00 3e=00 40=08 41=08 42=80 48=03 49=03 4a=03 50=00 
51=80 58=00 59=00 60=f0 61=f0 68=af 69=af 70=ff 71=00 78=ff 79=ff 80=2b 
81=37 82=ff 88=f0 89=f0 8a=f0 90=3c 91=46 92=ff 98=37 99=41 9a=ff a0=22 
a1=2d a2=80 a8=ff a9=ff b0=00 b1=00 b8=06 b9=00 words 00=a0a0 01=1010 
02=0303 03=0101 04=7f7f 05=0404 06=0303 07=

spdmem0 at iic0 addr 0x50: 256MB SDRAM registered ECC PC133CL2
pciide0 at pci0 dev 15 function 1 ServerWorks CSB5 IDE rev 0x92: DMA
atapiscsi0 at pciide0 channel 0 drive 0
scsibus3 at atapiscsi0: 2 

random crashes on a firewall with OpenBSD 4.5-stable

2009-06-25 Thread Comète

Hi,

we are using the last OpenBSD 4.5-stable release on an old Compaq 
Proliant ML350 as a firewall with spamd. But we encounter randomly some 
system crashes (once a week or two weeks). The system always displays 
the same message:


uvm_fault (0xd080d9e00x0,0,1) - e

kernel: page fault trap, code=0

Stopped at cac_pci_l0_intr_pending+0xb
push 0x34 (%eax)

What do you think it could be ? I thought about maybe a hardware problem 
but where exactly...


I join my dmesg below

Thanks for your advice !

OpenBSD 4.5-stable (GENERIC) #9: Sun May 17 22:59:17 CEST 2009
r...@arwen.saintlo.fr:/usr/src/sys/arch/i386/compile/GENERIC
cpu0: Intel(R) Pentium(R) III CPU family 1266MHz (GenuineIntel 
686-class) 1.27 GHz
cpu0: 
FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE

real mem  = 267988992 (255MB)
avail mem = 250839040 (239MB)
mainbus0 at root
bios0 at mainbus0: AT/286+ BIOS, date 12/31/99, BIOS32 rev. 0 @ 0xf, 
SMBIOS rev. 2.3 @ 0xec000 (31 entries)

bios0: vendor Compaq version D11 date 01/29/2002
bios0: Compaq ProLiant ML350 G2
acpi0 at bios0: rev 0
acpi0: tables DSDT FACP APIC SPCR
acpi0: wakeup devices PBTN(S5)
acpitimer0 at acpi0: 3579545 Hz, 32 bits
acpimadt0 at acpi0 addr 0xfee0: PC-AT compat
cpu0 at mainbus0: apid 3 (boot processor)
cpu0: apic clock running at 132MHz
ioapic0 at mainbus0: apid 8 pa 0xfec0, version 11, 16 pins
ioapic0: misconfigured as apic 0, remapped to apid 8
ioapic1 at mainbus0: apid 2 pa 0xfec01000, version 11, 16 pins
ioapic1: misconfigured as apic 0, remapped to apid 2
acpiprt0 at acpi0: bus 0 (PCI0)
acpiprt1 at acpi0: bus 2 (PCI1)
acpicpu0 at acpi0
acpitz0 at acpi0: critical temperature 31 degC
acpibtn0 at acpi0: PBTN
bios0: ROM list: 0xc/0x8000 0xc8000/0x1800 0xc9800/0x1800 
0xcb000/0x1800 0xcc800/0x4000! 0xd0800/0x1800 0xee000/0x2000!

pci0 at mainbus0 bus 0: configuration mode 1 (bios)
pchb0 at pci0 dev 0 function 0 ServerWorks CNB20LE Host rev 0x06
pchb1 at pci0 dev 0 function 1 ServerWorks CNB20LE Host rev 0x06
pci1 at pchb1 bus 2
em0 at pci1 dev 1 function 0 Intel PRO/1000T (82544GC) rev 0x02: apic 
2 int 0 (irq 5), address 00:02:b3:b9:0d:a4
em1 at pci1 dev 2 function 0 Intel PRO/1000T (82544GC) rev 0x02: apic 
2 int 2 (irq 15), address 00:02:b3:b9:0d:7d
re0 at pci1 dev 3 function 0 D-Link Systems DGE-528T rev 0x10: 
RTL8169/8110SB (0x1000), apic 2 int 4 (irq 15), address 00:1c:f0:6f:38:7e

rgephy0 at re0 phy 7: RTL8169S/8110S PHY, rev. 3
cac0 at pci1 dev 4 function 0 DEC Compaq SMART RAID 42xx rev 0x01: 
apic 2 int 6 (irq 11), Smart Array 431

scsibus0 at cac0: 1 targets
sd0 at scsibus0 targ 0 lun 0: Compaq, RAID1 vol #00,  SCSI2 0/direct fixed
sd0: 34727MB, 512 bytes/sec, 71122560 sec total
re1 at pci1 dev 5 function 0 D-Link Systems DGE-528T rev 0x10: 
RTL8169/8110SB (0x1000), apic 2 int 8 (irq 15), address 00:1c:f0:62:eb:12

rgephy1 at re1 phy 7: RTL8169S/8110S PHY, rev. 3
fxp0 at pci0 dev 1 function 0 Intel 8255x rev 0x08, i82559: apic 2 int 
10 (irq 5), address 00:02:a5:44:33:f7

inphy0 at fxp0 phy 1: i82555 10/100 PHY, rev. 4
ahc0 at pci0 dev 2 function 0 Adaptec AHA-3960D U160 rev 0x01: apic 2 
int 11 (irq 11)

scsibus1 at ahc0: 16 targets, initiator 7
ahc1 at pci0 dev 2 function 1 Adaptec AHA-3960D U160 rev 0x01: apic 2 
int 11 (irq 11)

scsibus2 at ahc1: 16 targets, initiator 7
st0 at scsibus2 targ 6 lun 0: COMPAQ, SDT-9000, 4.20 SCSI2 
1/sequential removable
fxp1 at pci0 dev 4 function 0 Intel 8255x rev 0x08, i82559: apic 2 int 
13 (irq 10), address 00:08:02:45:29:64

inphy1 at fxp1 phy 1: i82555 10/100 PHY, rev. 4
vga1 at pci0 dev 5 function 0 ATI Rage XL rev 0x27
wsdisplay0 at vga1 mux 1: console (80x25, vt100 emulation)
wsdisplay0: screen 1-5 added (80x25, vt100 emulation)
Compaq Netelligent ASMC rev 0x00 at pci0 dev 6 function 0 not configured
piixpm0 at pci0 dev 15 function 0 ServerWorks CSB5 rev 0x92: polling
iic0 at piixpm0
iic0: addr 0x28 00=a0 01=10 02=03 03=01 04=7f 05=04 06=03 07=00 08=00 
09=00 0b=00 0c=03 0d=41 0e=02 0f=00 10=00 11=05 18=3a 19=10 20=ff 21=ff 
28=00 29=00 2a=04 2b=00 2c=00 2d=00 2e=00 30=00 31=00 32=00 38=00 39=00 
3a=00 3b=00 3c=00 3d=00 3e=00 40=08 41=08 42=80 48=03 49=03 4a=03 50=00 
51=80 58=00 59=00 60=f0 61=f0 68=af 69=af 70=ff 71=00 78=ff 79=ff 80=2b 
81=37 82=ff 88=f0 89=f0 8a=f0 90=3c 91=46 92=ff 98=37 99=41 9a=ff a0=22 
a1=2d a2=80 a8=ff a9=ff b0=00 b1=00 b8=06 b9=00 words 00=a0a0 01=1010 
02=0303 03=0101 04=7f7f 05=0404 06=0303 07=

spdmem0 at iic0 addr 0x50: 256MB SDRAM registered ECC PC133CL2
pciide0 at pci0 dev 15 function 1 ServerWorks CSB5 IDE rev 0x92: DMA
atapiscsi0 at pciide0 channel 0 drive 0
scsibus3 at atapiscsi0: 2 targets
cd0 at scsibus3 targ 0 lun 0: COMPAQ, CRD-8402B, 1.03 ATAPI 5/cdrom 
removable

cd0(pciide0:0:0): using PIO mode 4, DMA mode 2
ohci0 at pci0 dev 15 function 2 ServerWorks OSB4/CSB5 USB rev 0x05: 
apic 8 int 10 (irq 10), version 1.0, legacy support

pchb2 at pci0 dev 15 function 3 ServerWorks CSB5 LPC rev 0x00
usb0 at ohci0: USB 

Re: random crashes on a firewall with OpenBSD 4.5-stable

2009-06-25 Thread Daniel Gracia Garallar
Oh and maybe bad RAM; I've hit some nasty errors with these faulty 
DIMMs... :/


ComC(te escribiC3:

Hi,

we are using the last OpenBSD 4.5-stable release on an old Compaq 
Proliant ML350 as a firewall with spamd. But we encounter randomly some 
system crashes (once a week or two weeks). The system always displays 
the same message:


uvm_fault (0xd080d9e00x0,0,1) - e

kernel: page fault trap, code=0

Stopped at cac_pci_l0_intr_pending+0xb
push 0x34 (%eax)

What do you think it could be ? I thought about maybe a hardware problem 
but where exactly...


I join my dmesg below

Thanks for your advice !

OpenBSD 4.5-stable (GENERIC) #9: Sun May 17 22:59:17 CEST 2009
r...@arwen.saintlo.fr:/usr/src/sys/arch/i386/compile/GENERIC
cpu0: Intel(R) Pentium(R) III CPU family 1266MHz (GenuineIntel 
686-class) 1.27 GHz
cpu0: 
FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE 


real mem  = 267988992 (255MB)
avail mem = 250839040 (239MB)
mainbus0 at root
bios0 at mainbus0: AT/286+ BIOS, date 12/31/99, BIOS32 rev. 0 @ 0xf, 
SMBIOS rev. 2.3 @ 0xec000 (31 entries)

bios0: vendor Compaq version D11 date 01/29/2002
bios0: Compaq ProLiant ML350 G2
acpi0 at bios0: rev 0
acpi0: tables DSDT FACP APIC SPCR
acpi0: wakeup devices PBTN(S5)
acpitimer0 at acpi0: 3579545 Hz, 32 bits
acpimadt0 at acpi0 addr 0xfee0: PC-AT compat
cpu0 at mainbus0: apid 3 (boot processor)
cpu0: apic clock running at 132MHz
ioapic0 at mainbus0: apid 8 pa 0xfec0, version 11, 16 pins
ioapic0: misconfigured as apic 0, remapped to apid 8
ioapic1 at mainbus0: apid 2 pa 0xfec01000, version 11, 16 pins
ioapic1: misconfigured as apic 0, remapped to apid 2
acpiprt0 at acpi0: bus 0 (PCI0)
acpiprt1 at acpi0: bus 2 (PCI1)
acpicpu0 at acpi0
acpitz0 at acpi0: critical temperature 31 degC
acpibtn0 at acpi0: PBTN
bios0: ROM list: 0xc/0x8000 0xc8000/0x1800 0xc9800/0x1800 
0xcb000/0x1800 0xcc800/0x4000! 0xd0800/0x1800 0xee000/0x2000!

pci0 at mainbus0 bus 0: configuration mode 1 (bios)
pchb0 at pci0 dev 0 function 0 ServerWorks CNB20LE Host rev 0x06
pchb1 at pci0 dev 0 function 1 ServerWorks CNB20LE Host rev 0x06
pci1 at pchb1 bus 2
em0 at pci1 dev 1 function 0 Intel PRO/1000T (82544GC) rev 0x02: apic 
2 int 0 (irq 5), address 00:02:b3:b9:0d:a4
em1 at pci1 dev 2 function 0 Intel PRO/1000T (82544GC) rev 0x02: apic 
2 int 2 (irq 15), address 00:02:b3:b9:0d:7d
re0 at pci1 dev 3 function 0 D-Link Systems DGE-528T rev 0x10: 
RTL8169/8110SB (0x1000), apic 2 int 4 (irq 15), address 00:1c:f0:6f:38:7e

rgephy0 at re0 phy 7: RTL8169S/8110S PHY, rev. 3
cac0 at pci1 dev 4 function 0 DEC Compaq SMART RAID 42xx rev 0x01: 
apic 2 int 6 (irq 11), Smart Array 431

scsibus0 at cac0: 1 targets
sd0 at scsibus0 targ 0 lun 0: Compaq, RAID1 vol #00,  SCSI2 0/direct 
fixed

sd0: 34727MB, 512 bytes/sec, 71122560 sec total
re1 at pci1 dev 5 function 0 D-Link Systems DGE-528T rev 0x10: 
RTL8169/8110SB (0x1000), apic 2 int 8 (irq 15), address 00:1c:f0:62:eb:12

rgephy1 at re1 phy 7: RTL8169S/8110S PHY, rev. 3
fxp0 at pci0 dev 1 function 0 Intel 8255x rev 0x08, i82559: apic 2 int 
10 (irq 5), address 00:02:a5:44:33:f7

inphy0 at fxp0 phy 1: i82555 10/100 PHY, rev. 4
ahc0 at pci0 dev 2 function 0 Adaptec AHA-3960D U160 rev 0x01: apic 2 
int 11 (irq 11)

scsibus1 at ahc0: 16 targets, initiator 7
ahc1 at pci0 dev 2 function 1 Adaptec AHA-3960D U160 rev 0x01: apic 2 
int 11 (irq 11)

scsibus2 at ahc1: 16 targets, initiator 7
st0 at scsibus2 targ 6 lun 0: COMPAQ, SDT-9000, 4.20 SCSI2 
1/sequential removable
fxp1 at pci0 dev 4 function 0 Intel 8255x rev 0x08, i82559: apic 2 int 
13 (irq 10), address 00:08:02:45:29:64

inphy1 at fxp1 phy 1: i82555 10/100 PHY, rev. 4
vga1 at pci0 dev 5 function 0 ATI Rage XL rev 0x27
wsdisplay0 at vga1 mux 1: console (80x25, vt100 emulation)
wsdisplay0: screen 1-5 added (80x25, vt100 emulation)
Compaq Netelligent ASMC rev 0x00 at pci0 dev 6 function 0 not configured
piixpm0 at pci0 dev 15 function 0 ServerWorks CSB5 rev 0x92: polling
iic0 at piixpm0
iic0: addr 0x28 00=a0 01=10 02=03 03=01 04=7f 05=04 06=03 07=00 08=00 
09=00 0b=00 0c=03 0d=41 0e=02 0f=00 10=00 11=05 18=3a 19=10 20=ff 21=ff 
28=00 29=00 2a=04 2b=00 2c=00 2d=00 2e=00 30=00 31=00 32=00 38=00 39=00 
3a=00 3b=00 3c=00 3d=00 3e=00 40=08 41=08 42=80 48=03 49=03 4a=03 50=00 
51=80 58=00 59=00 60=f0 61=f0 68=af 69=af 70=ff 71=00 78=ff 79=ff 80=2b 
81=37 82=ff 88=f0 89=f0 8a=f0 90=3c 91=46 92=ff 98=37 99=41 9a=ff a0=22 
a1=2d a2=80 a8=ff a9=ff b0=00 b1=00 b8=06 b9=00 words 00=a0a0 01=1010 
02=0303 03=0101 04=7f7f 05=0404 06=0303 07=

spdmem0 at iic0 addr 0x50: 256MB SDRAM registered ECC PC133CL2
pciide0 at pci0 dev 15 function 1 ServerWorks CSB5 IDE rev 0x92: DMA
atapiscsi0 at pciide0 channel 0 drive 0
scsibus3 at atapiscsi0: 2 targets
cd0 at scsibus3 targ 0 lun 0: COMPAQ, CRD-8402B, 1.03 ATAPI 5/cdrom 
removable

cd0(pciide0:0:0): using PIO mode 4, DMA mode 2
ohci0 at pci0 dev 15 function 2 ServerWorks OSB4/CSB5 USB rev 0x05: 
apic 8 int 10 (irq 10), version