Hi guys,
OpenBSD 6.3 GENERIC.MP running on XEN (AWS m4.large) - dmesg at the end of this
mail. Image build from https://github.com/kolargol/openbsd-aws which is fork
from https://github.com/ajacoutot/aws-openbsd - my fork have some minor
modifications.
Recently i run into problems after using syspatch(8) on clean image.
Process:
- build
- deploy (m4.large)
- syspatch
- reboot
- crash on boot
First i thought it could be one of recent errata (that I yet not apply on other
xen servers), so i recreated image, did syspatch(8) and reverted up to
errata_001 - same crash.
Then i thought that bsd get corrupted somewhere when reordering, so i copied
bsd.mp, corrected KARL, booted…crash
My last attempt was to attach crashed disk and to inspect what happen there, no
logs but i found cores on root:
ip-10-1-0-43# find . -name *.core
./root/root/ntpd.core
./root/ls.core
./root/dhclient.core
./root/df.core
./root/perl.core
./root/date.core
./usr/share/relink/kernel/GENERIC.MP/jot.core
so i checked with gdb:
ip-10-1-0-43# gdb usr/bin/perl -c root/perl.core
GNU gdb 6.3
Copyright 2004 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are
welcome to change it and/or distribute copies of it under certain conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB. Type "show warranty" for details.
This GDB was configured as "amd64-unknown-openbsd6.3"...(no debugging symbols
found)
Core was generated by `perl'.
Program terminated with signal 8, Arithmetic exception.
(no debugging symbols found)
Loaded symbols for /mnt/usr/bin/perl
Reading symbols from /usr/lib/libperl.so.18.1...done.
Loaded symbols for /usr/lib/libperl.so.18.1
Reading symbols from /usr/lib/libm.so.10.1...done.
Loaded symbols for /usr/lib/libm.so.10.1
Reading symbols from /usr/lib/libc.so.92.3...done.
Loaded symbols for /usr/lib/libc.so.92.3
Reading symbols from /usr/libexec/ld.so...done.
Loaded symbols for /usr/libexec/ld.so
#0 0x000002cca3fc212b in Perl_get_hash_seed (seed_buffer=0x2cca4388d48 "") at
util.c:4761
4761 util.c: No such file or directory.
in util.c
(gdb) bt
#0 0x000002cca3fc212b in Perl_get_hash_seed (seed_buffer=0x2cca4388d48 "") at
util.c:4761
#1 0x000002cca401818d in perl_construct (my_perl=Variable "my_perl" is not
available.
) at perl.c:273
#2 0x000002ca67d00630 in main () from /mnt/usr/bin/perl
Current language: auto; currently minimal
forgive me if i missed something obvious here - but any idea what is wrong
here? What can cause Floating point exception during boot on system that
otherwise works without patching?
and as promised, dmesg witch crash:
\|/-\|/booting hd0a:/bsd:
-\|/-8616075\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|+2454544/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-+262168\|/-\|/-\|/-\|/-\+0+671744|
[643151/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|+98+712056/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\+493074|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/]=0xd38788
-\|/-\|entry point at 0x1000158
[ using 1849224 bytes of bsd ELF symbol table ]
Copyright (c) 1982, 1986, 1989, 1991, 1993
The Regents of the University of California. All rights reserved.
Copyright (c) 1995-2018 OpenBSD. All rights reserved. https://www.OpenBSD.org
OpenBSD 6.3 (GENERIC.MP) #6: Tue Jul 24 13:40:48 CEST 2018
[email protected]:/usr/src/sys/arch/amd64/compile/GENERIC.MP
real mem = 8573157376 (8176MB)
avail mem = 8306262016 (7921MB)
mpath0 at root
scsibus0 at mpath0: 256 targets
mainbus0 at root
bios0 at mainbus0: SMBIOS rev. 2.7 @ 0xeb01f (12 entries)
bios0: vendor Xen version "4.2.amazon" date 08/24/2006
bios0: Xen HVM domU
acpi0 at bios0: rev 2
acpi0: sleep states S3 S4 S5
acpi0: tables DSDT FACP APIC HPET WAET SSDT SSDT
acpi0: wakeup devices
acpitimer0 at acpi0: 3579545 Hz, 32 bits
acpimadt0 at acpi0 addr 0xfee00000: PC-AT compat
ioapic0 at mainbus0: apid 1 pa 0xfec00000, version 11, 48 pins
, remapped to apid 1
cpu0 at mainbus0: apid 0 (boot processor)
cpu0: Intel(R) Xeon(R) CPU E5-2686 v4 @ 2.30GHz, 2300.34 MHz
cpu0:
FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,MMX,FXSR,SSE,SSE2,HTT,SSE3,PCLMUL,SSSE3,FMA3,CX16,PCID,SSE4.1,SSE4.2,MOVBE,POPCNT,DEADLINE,AES,XSAVE,AVX,F16C,RDRAND,HV,NXE,PAGE1GB,RDTSCP,LONG,LAHF,ABM,FSGSBASE,BMI1,AVX2,SMEP,BMI2,ERMS,INVPCID,XSAVEOPT,MELTDOWN
cpu0: 256KB 64b/line 8-way L2 cache
cpu0: smt 0, core 0, package 0
mtrr: Pentium Pro MTRR support, 8 var ranges, 88 fixed ranges
cpu0: apic clock running at 100MHz
cpu1 at mainbus0: apid 1 (application processor)
cpu1: Intel(R) Xeon(R) CPU E5-2686 v4 @ 2.30GHz, 2300.31 MHz
cpu1:
FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,MMX,FXSR,SSE,SSE2,HTT,SSE3,PCLMUL,SSSE3,FMA3,CX16,PCID,SSE4.1,SSE4.2,MOVBE,POPCNT,DEADLINE,AES,XSAVE,AVX,F16C,RDRAND,HV,NXE,PAGE1GB,RDTSCP,LONG,LAHF,ABM,FSGSBASE,BMI1,AVX2,SMEP,BMI2,ERMS,INVPCID,XSAVEOPT,MELTDOWN
cpu1: 256KB 64b/line 8-way L2 cache
cpu1: smt 1, core 0, package 0
acpihpet0 at acpi0: 62500000 Hz
acpiprt0 at acpi0: bus 0 (PCI0)
acpicpu0 at acpi0: C1(@1 halt!)
acpicpu1 at acpi0: C1(@1 halt!)
"ACPI0007" at acpi0 not configured
"ACPI0007" at acpi0 not configured
pvbus0 at mainbus0: Xen 4.2
xen0 at pvbus0: features 0x705, 329 grant table frames, event channel 4
xbf0 at xen0 backend 0 channel 6: disk
scsibus1 at xbf0: 2 targets
sd0 at scsibus1 targ 0 lun 0: <Xen, phy hda 768, 0000> SCSI3 0/direct fixed
sd0: 25600MB, 512 bytes/sector, 52428800 sectors
xnf0 at xen0 backend 0 channel 7: address 06:e9:8b:a9:83:f0
"console" at xen0: device/console/0 not configured
pci0 at mainbus0 bus 0
pchb0 at pci0 dev 0 function 0 "Intel 82441FX" rev 0x02
pcib0 at pci0 dev 1 function 0 "Intel 82371SB ISA" rev 0x00
pciide0 at pci0 dev 1 function 1 "Intel 82371SB IDE" rev 0x00: DMA, channel 0
wired to compatibility, channel 1 wired to compatibility
pciide0: channel 0 disabled (no drives)
pciide0: channel 1 disabled (no drives)
piixpm0 at pci0 dev 1 function 3 "Intel 82371AB Power" rev 0x01: SMBus disabled
vga1 at pci0 dev 2 function 0 "Cirrus Logic CL-GD5446" rev 0x00
wsdisplay0 at vga1 mux 1: console (80x25, vt100 emulation)
wsdisplay0: screen 1-5 added (80x25, vt100 emulation)
xspd0 at pci0 dev 3 function 0 "XenSource Platform Device" rev 0x01
isa0 at pcib0
isadma0 at isa0
fdc0 at isa0 port 0x3f0/6 irq 6 drq 2
com0 at isa0 port 0x3f8/8 irq 4: ns16550a, 16 byte fifo
com0: console
pckbc0 at isa0 port 0x60/5 irq 1 irq 12
pckbd0 at pckbc0 (kbd slot)
wskbd0 at pckbd0: console keyboard, using wsdisplay0
pms0 at pckbc0 (aux slot)
wsmouse0 at pms0 mux 0
pcppi0 at isa0 port 0x61
spkr0 at pcppi0
vscsi0 at root
scsibus2 at vscsi0: 256 targets
softraid0 at root
scsibus3 at softraid0: 256 targets
root on sd0a (e00ea5eace8ee4cd.a) swap on sd0b dump on sd0b
Automatic boot in progress: starting file system checks.
/dev/sd0a (e00ea5eace8ee4cd.a): file system is clean; not checking
/dev/sd0i (e00ea5eace8ee4cd.i): file system is clean; not checking
/dev/sd0d (e00ea5eace8ee4cd.d): file system is clean; not checking
/dev/sd0f (e00ea5eace8ee4cd.f): file system is clean; not checking
/dev/sd0e (e00ea5eace8ee4cd.e): file system is clean; not checking
setting tty flags
pf enabled
starting network
fd0 at fdc0 drive 0: density unknown
Floating point exception (core dumped)
WARNING: /etc/hostname.xnf0 is insecure, fixing permissions.
Floating point exception (core dumped)
no IP address found for egress
stdin:1: could not parse host specification
pfctl: Syntax error in config file: pf rules not loaded
Floating point exception (core dumped)
Floating point exception (core dumped)
Floating point exception (core dumped)
reordering libraries:fd1 at fdc0 drive 1: density unknown
done.
starting early daemons: syslogd pflogd(failed) ntpd.
starting RPC daemons:.
savecore: no core dump
checking quotas: done.
clearing /tmp
kern.securelevel: 0 -> 1
creating runtime link editor directory cache.
preserving editor files.
Floating point exception (core dumped)
starting network daemons: sshd smtpd.
starting local daemons: cron.
Floating point exception (core dumped)
Floating point exception (core dumped)
reorder_kernel: kernel relinking failed; see
/usr/share/relink/kernel/GENERIC.MP/relink.log
OpenBSD/amd64 (ip-10-1-0-195.eu-west-1.compute.internal) (tty00)
login:
thanks!
_
Zbyszek Żółkiewski