Hello!
This CARP master crashes a few moments/minutes after it's up. The slave
with the same configuration and almost the same hardware (different
nics) has been running flawless for hours.
The CARP config was done following the FAQ:
# cat /etc/hostname.carp0
inet my.public.ip 255.255.255.240 NONE vhid 1 carpdev re0 pass
long-key-removed -inet6 description "CARP INET"
# cat /etc/hostname.carp1
inet 172.16.10.1 255.255.254.0 172.16.11.255 vhid 1 carpdev rl0 pass
long-key-removed -inet6 description "CARP LAN"
# cat /etc/hostname.pfsync0
up syncdev xl0
#
dmesg & crash - trace follow:
OpenBSD 6.0 (GENERIC) #1: Tue Mar 14 10:00:51 CET 2017
[email protected]:/binpatchng/work-binpatch60-i386/src/sys/arch/i386/compile/GENERIC
cpu0: Intel(R) Pentium(R) 4 CPU 2.40GHz ("GenuineIntel" 686-class) 2.40 GHz
cpu0:
FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE,CNXT-ID,xTPR,PERF
real mem = 668745728 (637MB)
avail mem = 643280896 (613MB)
mpath0 at root
scsibus0 at mpath0: 256 targets
mainbus0 at root
bios0 at mainbus0: date 02/26/03, BIOS32 rev. 0 @ 0xf0010, SMBIOS rev.
2.3 @ 0xfbfb0 (68 entries)
bios0: vendor Intel Corp. version "RG84510A.86A.0028.P15.0302260937"
date 02/26/2003
bios0: Intel Corporation D845GEBV2
acpi0 at bios0: rev 0
acpi0: sleep states S0 S1 S4 S5
acpi0: tables DSDT FACP APIC ASF!
acpi0: wakeup devices P0P1(S4) UAR1(S4) USB0(S4) USB1(S4) USB2(S4)
USB3(S4) AC97(S4) SLPB(S4)
acpitimer0 at acpi0: 3579545 Hz, 24 bits
acpimadt0 at acpi0 addr 0xfee00000: PC-AT compat
cpu0 at mainbus0: apid 0 (boot processor)
mtrr: Pentium Pro MTRR support, 8 var ranges, 88 fixed ranges
cpu0: apic clock running at 99MHz
ioapic0 at mainbus0: apid 1 pa 0xfec00000, version 20, 24 pins
acpiprt0 at acpi0: bus 0 (PCI0)
acpiprt1 at acpi0: bus 1 (P0P1)
acpicpu0 at acpi0: C1(@1 halt!)
acpipwrres0 at acpi0: URP1
acpipwrres1 at acpi0: FDDP
acpipwrres2 at acpi0: LPTP
"PNP0501" at acpi0 not configured
"PNP0400" at acpi0 not configured
"PNP0303" at acpi0 not configured
"PNP0F03" at acpi0 not configured
acpibtn0 at acpi0: SLPB
bios0: ROM list: 0xc0000/0xb200! 0xcb800/0x800 0xcc000/0x1000
pci0 at mainbus0 bus 0: configuration mode 1 (bios)
pchb0 at pci0 dev 0 function 0 "Intel 82845G Host" rev 0x03
inteldrm0 at pci0 dev 2 function 0 "Intel 82845G Video" rev 0x03
drm0 at inteldrm0
intagp0 at inteldrm0
agp0 at intagp0: aperture at 0xf0000000, size 0x8000000
inteldrm0: apic 1 int 16
inteldrm0: 1024x768
wsdisplay0 at inteldrm0 mux 1: console (std, vt100 emulation)
wsdisplay0: screen 1-5 added (std, vt100 emulation)
uhci0 at pci0 dev 29 function 0 "Intel 82801DB USB" rev 0x02: apic 1 int 16
uhci1 at pci0 dev 29 function 1 "Intel 82801DB USB" rev 0x02: apic 1 int 19
uhci2 at pci0 dev 29 function 2 "Intel 82801DB USB" rev 0x02: apic 1 int 18
ehci0 at pci0 dev 29 function 7 "Intel 82801DB USB" rev 0x02: apic 1 int 23
ehci0: timed out waiting for BIOS
usb0 at ehci0: USB revision 2.0
uhub0 at usb0 "Intel EHCI root hub" rev 2.00/1.00 addr 1
ppb0 at pci0 dev 30 function 0 "Intel 82801BA Hub-to-PCI" rev 0x82
pci1 at ppb0 bus 1
re0 at pci1 dev 2 function 0 "Realtek 8169" rev 0x10: RTL8169/8110SB
(0x1000), apic 1 int 18, address 14:cc:20:04:b3:28
rgephy0 at re0 phy 7: RTL8169S/8110S/8211 PHY, rev. 3
xl0 at pci1 dev 3 function 0 "3Com 3c905C 100Base-TX" rev 0x78: apic 1
int 19, address 00:0a:5e:44:4a:a0
exphy0 at xl0 phy 24: 3Com internal media interface
rl0 at pci1 dev 4 function 0 "Realtek 8139" rev 0x10: apic 1 int 17,
address 00:50:bf:39:0d:5c
rlphy0 at rl0 phy 0: RTL internal PHY
ichpcib0 at pci0 dev 31 function 0 "Intel 82801DB LPC" rev 0x02
pciide0 at pci0 dev 31 function 1 "Intel 82801DB IDE" rev 0x02: DMA,
channel 0 configured to compatibility, channel 1 configured to compatibility
wd0 at pciide0 channel 0 drive 0: <ST320014A>
wd0: 16-sector PIO, LBA, 19092MB, 39102336 sectors
wd1 at pciide0 channel 0 drive 1: <SAMSUNG SV2042H>
wd1: 16-sector PIO, LBA, 19465MB, 39865392 sectors
wd0(pciide0:0:0): using PIO mode 4, Ultra-DMA mode 5
wd1(pciide0:0:1): using PIO mode 4, Ultra-DMA mode 5
pciide0: channel 1 ignored (disabled)
ichiic0 at pci0 dev 31 function 3 "Intel 82801DB SMBus" rev 0x02: apic 1
int 17
iic0 at ichiic0
adt0 at iic0 addr 0x2e: lm85 rev 0x60
spdmem0 at iic0 addr 0x50: 512MB DDR SDRAM non-parity PC2100CL2.5
spdmem1 at iic0 addr 0x51: 128MB DDR SDRAM non-parity PC2700CL2.5
auich0 at pci0 dev 31 function 5 "Intel 82801DB AC97" rev 0x02: apic 1
int 17, ICH4 AC97
ac97: codec id 0x41445374 (Analog Devices AD1981B)
ac97: codec features headphone, 20 bit DAC, No 3D Stereo
audio0 at auich0
usb1 at uhci0: USB revision 1.0
uhub1 at usb1 "Intel UHCI root hub" rev 1.00/1.00 addr 1
usb2 at uhci1: USB revision 1.0
uhub2 at usb2 "Intel UHCI root hub" rev 1.00/1.00 addr 1
usb3 at uhci2: USB revision 1.0
uhub3 at usb3 "Intel UHCI root hub" rev 1.00/1.00 addr 1
isa0 at ichpcib0
isadma0 at isa0
com0 at isa0 port 0x3f8/8 irq 4: ns16550a, 16 byte fifo
com0: console
pckbc0 at isa0 port 0x60/5 irq 1 irq 12
pckbd0 at pckbc0 (kbd slot)
wskbd0 at pckbd0: console keyboard, using wsdisplay0
pms0 at pckbc0 (aux slot)
wsmouse0 at pms0 mux 0
pcppi0 at isa0 port 0x61
spkr0 at pcppi0
lpt0 at isa0 port 0x378/4 irq 7
npx0 at isa0 port 0xf0/16: reported by CPUID; using exception 16
vscsi0 at root
scsibus1 at vscsi0: 256 targets
softraid0 at root
scsibus2 at softraid0: 256 targets
root on wd0a (e72d52380906a909.a) swap on wd0b dump on wd0b
Automatic boot in progress: starting file system checks.
/dev/wd0a (e72d52380906a909.a): file system is clean; not checking
/dev/wd0d (e72d52380906a909.d): file system is clean; not checking
/dev/wd0e (e72d52380906a909.e): file system is clean; not checking
/dev/wd0f (e72d52380906a909.f): file system is clean; not checking
/dev/wd0g (e72d52380906a909.g): file system is clean; not checking
setting tty flags
pf enabled
net.inet.ip.forwarding: 0 -> 1
net.inet.ip.ifq.maxlen: 256 -> 512
kern.bufcachepercent: 20 -> 40
net.inet.carp.preempt: 0 -> 1
starting network
reordering libraries: done.
starting early daemons: syslogd pflogdpanic: m_clget: request for
4294967294 byte cluster
Stopped at Debugger+0x7: leave
TID PID UID PRFLAGS PFLAGS CPU COMMAND
*28342 28342 0 0x14000 0x210 0 softnet
Debugger(d09ee194,f5891e68,d09cc1f0,f5891e68,d64e9900) at Debugger+0x7
panic(d09cc1f0,fffffffe,2,d6262300,0) at panic+0x71
m_clget(d64e9900,2,fffffffe,d03bccaf,0) at m_clget+0xb2
m_dup_pkt(d6262800,2,2,d041763a,d1b18054) at m_dup_pkt+0x7c
carp_input(d1b18054,d6262800,d1d26280,d09c8983,0) at carp_input+0x100
if_input_process(d0b43f40,f5891f68,d03bfdd0,0,0) at if_input_process+0xa9
taskq_thread(d59e3040) at taskq_thread+0x4f
http://www.openbsd.org/ddb.html describes the minimum info required in bug
reports. Insufficient info makes it difficult to find and fix bugs.
ddb> trace
Debugger(d09ee194,f5891e68,d09cc1f0,f5891e68,d64e9900) at Debugger+0x7
panic(d09cc1f0,fffffffe,2,d6262300,0) at panic+0x71
m_clget(d64e9900,2,fffffffe,d03bccaf,0) at m_clget+0xb2
m_dup_pkt(d6262800,2,2,d041763a,d1b18054) at m_dup_pkt+0x7c
carp_input(d1b18054,d6262800,d1d26280,d09c8983,0) at carp_input+0x100
if_input_process(d0b43f40,f5891f68,d03bfdd0,0,0) at if_input_process+0xa9
taskq_thread(d59e3040) at taskq_thread+0x4f
ddb> ps
TID PPID PGRP UID S FLAGS WAIT COMMAND
7635 42639 49236 0 3 0x3 biowait pflogd
42639 92579 49236 0 3 0x10008b pause sh
92579 49236 49236 0 3 0x10008b pause sh
46375 48377 48377 73 3 0x100010 ffs_fsync syslogd
48377 1 48377 0 3 0x100080 netio syslogd
49236 1 49236 0 3 0x10008b pause sh
58106 0 0 0 3 0x14200 pgzero zerothread
24052 0 0 0 3 0x14200 aiodoned aiodoned
34198 0 0 0 3 0x14200 syncer update
65415 0 0 0 3 0x14200 cleaner cleaner
78044 0 0 0 3 0x14200 reaper reaper
49447 0 0 0 3 0x14200 pgdaemon pagedaemon
22847 0 0 0 3 0x14200 bored crynlk
26140 0 0 0 3 0x14200 bored crypto
84600 0 0 0 3 0x14200 pftm pfpurge
90332 0 0 0 3 0x14200 bored sensors
99271 0 0 0 3 0x14200 usbtsk usbtask
70610 0 0 0 3 0x14200 usbatsk usbatsk
72279 0 0 0 3 0x14200 bored i915
8331 0 0 0 3 0x40014200 acpi0 acpi0
*28342 0 0 0 7 0x14210 softnet
13417 0 0 0 3 0x14200 bored systqmp
76901 0 0 0 3 0x14200 bored systq
49336 0 0 0 3 0x40014200 idle0
27510 0 0 0 3 0x14200 kmalloc kmthread
1 0 1 0 3 0x82 wait init
0 -1 0 0 3 0x10200 scheduler swapper
ddb> shwo panic
No such command
ddb> show panic
m_clget: request for 4294967294 byte cluster
ddb> show uvm
Current UVM status:
pagesize=4096 (0x1000), pagemask=0xfff, pageshift=12
157254 VM pages: 1011 active, 1021 inactive, 864 wired, 145695 free
(18228 ze
ro)
min 10% (25) anon, 10% (25) vnode, 5% (12) vtext
pages 0 anon, 0 vnode, 0 vtext
freemin=5241, free-target=6988, inactive-target=0, wired-max=52418
faults=43687, traps=43692, intrs=1647, ctxswitch=3288 fpuswitch=11
softint=1238, syscalls=91368, kmapent=21
fault counts:
noram=0, noanon=0, noamap=0, pgwait=0, pgrele=0
ok relocks(total)=2360(2360), anget(retries)=18996(0), amapcopy=25455
neighbor anon/obj pg=1196/16231, gets(lock/unlock)=7798/2361
cases: anon=14325, anoncow=4671, obj=7225, prcopy=572, przero=16893
daemon and swap counts:
woke=0, revs=0, scans=0, obscans=0, anscans=0
busy=0, freed=0, reactivate=0, deactivate=0
pageouts=0, pending=0, nswget=0
nswapdev=1, nanon=0, nanonneeded=0 nfreeanon=0
swpages=64259, swpginuse=0, swpgonly=0 paging=0
kernel pointers:
objs(kern)=0xd0b89580
ddb> show bcstats
Current Buffer Cache status:
numbufs 1383 busymapped 3, delwri 44
kvaslots 1216 avail kva slots 1213
bufpages 5502, dirtypages 176
pendingreads 5, pendingwrites 1
ddb> boot dump
syncing disks... 32 32 31 25 25 18 4 3 done
WARNING: not updating battery clock
dumping to dev 1, offset 10
dump 250 249 248 247 246 245 244 243 242 241 240 239 238 237 236 235 234
233 232 231 230 229 228 227 226 225 224 223 222 221 220 219 218 217 216
215 214 213 212 211 210 209 208 207 206 205 204 203 202 201 200 199 198
197 196 195 194 193 192 191 190 189 188 187 186 185 184 183 182 181 180
179 178 177 176 175 174 173 172 171 170 169 168 167 166 165 164 163 162
161 160 159 158 157 156 155 154 153 152 151 150 149 148 147 146 145 144
143 142 141 140 139 138 137 136 135 134 133 132 131 130 129 128 127 126
125 124 123 122 121 120 119 118 117 116 115 114 113 112 111 110 109 108
107 106 105 104 103 102 101 100 99 98 97 96 95 94 93 92 91 90 89 88 87
86 85 84 83 82 81 80 79 78 77 76 75 74 73 72 71 70 69 68 67 66 65 64 63
62 61 60 59 58 57 56 55 54 53 52 51 50 49 48 47 46 45 44 43 42 41 40 39
38 37 36 35 34 33 32 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15
14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 area improper
rebooting...