Good afternoon OpenBSD team,
I got my hands on legacy Sun Netra X1 (400MHz, 384M RAM ECC, 2x 80G IDE HDD).
I've installed 5.7 release with
root on softraid raid1 and enabled softdeps. I've got kernel panic while
compiling errata 003 and 004 in SSH session.
Opening 3rd session immediately ended with panic.
psycho0: uncorrectable DMA error AFAR 6656a250 (pa=0 tte=0/49c10012)
AFSR 41000
0ff40800000
- bioctl still reports healthy softraid
- the wrench LED is off on the machine -> should be healthy
- boot with diag mode doesn't report any errors
Should the machine be considered memory-constrained, so the softdeps
should stay off?
I would really appreciate any hint how to debug this further ...
Thank you very much for your time,
Jan Vlach
P.S. DDB sections marked with ^###
### WHAT I DID ###
1)
cd /usr/src/lib/libcrypto/crypto
make obj
make
make install
cd /usr/src/lib/libssl/ssl
make obj
make
make install
from:
http://ftp.openbsd.org/pub/OpenBSD/patches/5.7/common/003_openssl.patch.sig
2)
cd /usr/src/usr.sbin/smtpd
make clean
make obj
make
make install
from: http://ftp.openbsd.org/pub/OpenBSD/patches/5.7/common/004_smtpd.patch.sig
### DDB PS
ddb> ps
PID PPID PGRP UID S FLAGS WAIT COMMAND
5246 4675 31718 1000 2 0x10 sshd
* 4675 31718 31718 1000 7 0x10 sshd
5243 26563 31771 0 3 0x83 piperd as
27921 26563 31771 0 2 0x3 cc1
26563 31771 31771 0 3 0x83 wait cc
31718 2327 31718 0 3 0x92 poll sshd
20292 32327 31502 0 3 0x83 piperd as
4663 32327 31502 0 2 0x3 cc1
32327 6223 31502 0 3 0x83 wait cc
6223 32487 31502 0 3 0x8b pause make
32487 31502 31502 0 3 0x8b pause sh
31502 1655 31502 0 3 0x8b pause make
1655 16735 1655 0 3 0x8b pause ksh
16735 24352 16735 1000 3 0x8b pause ksh
24352 29509 29509 1000 3 0x90 select sshd
29509 2327 29509 0 3 0x92 poll sshd
5606 7198 5606 1000 3 0x83 poll top
31771 24863 31771 0 3 0x8b pause make
24863 23520 24863 0 3 0x8b pause ksh
12041 14579 12041 0 3 0x83 ttyin ksh
14579 27984 14579 1000 3 0x8b pause ksh
27984 31936 31936 1000 3 0x90 select sshd
31936 2327 31936 0 3 0x92 poll sshd
7198 1 7198 1000 3 0x8b pause ksh
23520 5025 23520 1000 3 0x8b pause ksh
5025 2062 2062 1000 3 0x90 select sshd
2062 2327 2062 0 3 0x92 poll sshd
24705 1 24705 0 3 0x80 poll cron
981 1 981 99 3 0x90 poll sndiod
31755 7068 7068 95 3 0x90 kqread smtpd
26407 7068 7068 95 3 0x90 kqread smtpd
30477 7068 7068 95 3 0x90 kqread smtpd
13510 7068 7068 95 3 0x90 kqread smtpd
4676 7068 7068 95 3 0x90 kqread smtpd
25400 7068 7068 103 3 0x90 kqread smtpd
7068 1 7068 0 3 0x80 kqread smtpd
2327 1 2327 0 3 0x80 select sshd
12659 5494 18102 83 3 0x90 poll ntpd
5494 18102 18102 83 3 0x90 poll ntpd
18102 1 18102 0 3 0x80 poll ntpd
10998 14096 14096 74 3 0x90 bpf pflogd
14096 1 14096 0 3 0x80 netio pflogd
6010 11552 11552 73 3 0x90 kqread syslogd
11552 1 11552 0 3 0x80 netio syslogd
30908 1 30908 77 3 0x90 poll dhclient
31053 1 31053 0 3 0x80 poll dhclient
10496 0 0 0 2 0x14200 zerothread
21761 0 0 0 3 0x14200 aiodoned aiodoned
138 0 0 0 3 0x14200 syncer update
1424 0 0 0 3 0x14200 cleaner cleaner
16186 0 0 0 3 0x14200 reaper reaper
4969 0 0 0 3 0x14200 pgdaemon pagedaemon
12560 0 0 0 3 0x14200 bored srdis
19078 0 0 0 3 0x14200 bored crypto
3784 0 0 0 3 0x14200 pftm pfpurge
12816 0 0 0 3 0x14200 usbtsk usbtask
23674 0 0 0 3 0x14200 usbatsk usbatsk
10916 0 0 0 3 0x14200 bored sensors
3519 0 0 0 3 0x14200 bored systqmp
17839 0 0 0 3 0x14200 bored systq
7509 0 0 0 3 0x40014200 idle0
24762 0 0 0 3 0x14200 kmalloc kmthread
1 0 1 0 3 0x82 wait init
0 -1 0 0 3 0x10200 scheduler swapper
### DDB TRACE
ddb> trace
psycho_ue(400005b9200, 40000f5ea00, 22d532, c, 54, 34) at psycho_ue+0x7c
intr_handler(e0017ec8, 400005b9300, 22d533, 4, 400069afac0, 40006bfd590) at int
r_handler+0xc
sparc_interrupt(0, 0, 4000c463bc8, 0, 0, 0) at sparc_interrupt+0x298
dofilewritev(0, 3, 4000c0b3080, 4000c463cd8, 0, 0) at dofilewritev+0x16c
sys_write(4000c29ab40, 4000c463db8, 4000c463df8, 0, 4010, 14b) at sys_write+0xb
0
syscall(4000c463ed0, 404, 274d37de28, 274d37de2c, 0, 0) at syscall+0x28c
softtrap(3, 26fd4e33d8, 54, 0, 0, 0) at softtrap+0x19c
### DDB TRACE
ddb> show panic
psycho0: uncorrectable DMA error AFAR 6656a250 (pa=0 tte=0/49c10012) AFSR 41000
0ff40800000
### DDB DMESG
ddb> dmesg
console is /pci@1f,0/isa@7/serial@0,3f8
Copyright (c) 1982, 1986, 1989, 1991, 1993
The Regents of the University of California. All rights reserved.
Copyright (c) 1995-2015 OpenBSD. All rights reserved. http://www.OpenBSD.org
OpenBSD 5.7 (GENERIC) #466: Sun Mar 8 13:47:17 MDT 2015
[email protected]:/usr/src/sys/arch/sparc64/compile/GENERIC
real mem = 402653184 (384MB)
avail mem = 380633088 (363MB)
mpath0 at root
scsibus0 at mpath0: 256 targets
mainbus0 at root: Sun Netra X1 (UltraSPARC-IIe 400MHz)
cpu0 at mainbus0: SUNW,UltraSPARC-IIe (rev 1.4) @ 400 MHz
cpu0: physical 16K instruction (32 b/l), 16K data (32 b/l), 256K external (64 b
/l)
psycho0 at mainbus0: SUNW,sabre, impl 0, version 0, ign 7c0
psycho0: bus range 0-0, PCI bus 0
psycho0: dvma map 60000000-7fffffff
pci0 at psycho0
ebus0 at pci0 dev 7 function 0 "Acer Labs M1533 ISA" rev 0x00
"dma" at ebus0 addr 0-ffff ivec 0x2a not configured
rtc0 at ebus0 addr 70-71: m5819
power0 at ebus0 addr 2000-2007 ivec 0x23
lom0 at ebus0 addr 8010-8011 ivec 0x2a: LOMlite2 rev 3.8
com0 at ebus0 addr 3f8-3ff ivec 0x2b: ns16550a, 16 byte fifo
com0: console
com1 at ebus0 addr 2e8-2ef ivec 0x2b: ns16550a, 16 byte fifo
"flashprom" at ebus0 addr 0-7ffff not configured
alipm0 at pci0 dev 3 function 0 "Acer Labs M7101 Power" rev 0x00: 74KHz clock
iic0 at alipm0
"max1617" at alipm0 addr 0x18 skipped due to alipm0 bugs
spdmem0 at iic0 addr 0x56: 256MB SDRAM registered ECC PC133CL2
spdmem1 at iic0 addr 0x57: 128MB SDRAM registered ECC PC133CL2
dc0 at pci0 dev 12 function 0 "Davicom DM9102" rev 0x31: ivec 0x7c6, address 00
:03:ba:04:cf:f2
amphy0 at dc0 phy 1: DM9102 10/100 PHY, rev. 0
dc1 at pci0 dev 5 function 0 "Davicom DM9102" rev 0x31: ivec 0x7dc, address 00:
03:ba:04:cf:f3
amphy1 at dc1 phy 1: DM9102 10/100 PHY, rev. 0
ohci0 at pci0 dev 10 function 0 "Acer Labs M5237 USB" rev 0x03: ivec 0x7e4, ver
sion 1.0, legacy support
pciide0 at pci0 dev 13 function 0 "Acer Labs M5229 UDMA IDE" rev 0xc3: DMA, cha
nnel 0 configured to native-PCI, channel 1 configured to native-PCI
pciide0: using ivec 0x7cc for native-PCI interrupt
wd0 at pciide0 channel 0 drive 0: <SAMSUNG SP0802N>
--db_more--
wd0: 16-sector PIO, LBA48, 76351MB, 156368016 sectors
wd0(pciide0:0:0): using PIO mode 4, Ultra-DMA mode 2
wd1 at pciide0 channel 1 drive 0: <IC35L090AVV207-0>
wd1: 16-sector PIO, LBA48, 76293MB, 156250000 sectors
wd1(pciide0:1:0): using PIO mode 4, Ultra-DMA mode 2
usb0 at ohci0: USB revision 1.0
uhub0 at usb0 "Acer Labs OHCI root hub" rev 1.00/1.00 addr 1
vscsi0 at root
scsibus1 at vscsi0: 256 targets
softraid0 at root
scsibus2 at softraid0: 256 targets
sd0 at scsibus2 targ 1 lun 0: <OPENBSD, SR RAID 1, 005> SCSI2 0/direct fixed
sd0: 76293MB, 512 bytes/sector, 156248544 sectors
bootpath: /pci@1f,0/ide@d,0/disk@0,0
root on sd0a (65b0748db208ec37.a) swap on sd0b dump on sd0b
syncing disks... done
sd0 detached
console is /pci@1f,0/isa@7/serial@0,3f8
Copyright (c) 1982, 1986, 1989, 1991, 1993
The Regents of the University of California. All rights reserved.
Copyright (c) 1995-2015 OpenBSD. All rights reserved. http://www.OpenBSD.org
--db_more--
OpenBSD 5.7 (GENERIC) #466: Sun Mar 8 13:47:17 MDT 2015
[email protected]:/usr/src/sys/arch/sparc64/compile/GENERIC
real mem = 402653184 (384MB)
avail mem = 380633088 (363MB)
mpath0 at root
scsibus0 at mpath0: 256 targets
mainbus0 at root: Sun Netra X1 (UltraSPARC-IIe 400MHz)
cpu0 at mainbus0: SUNW,UltraSPARC-IIe (rev 1.4) @ 400 MHz
cpu0: physical 16K instruction (32 b/l), 16K data (32 b/l), 256K external (64 b
/l)
psycho0 at mainbus0: SUNW,sabre, impl 0, version 0, ign 7c0
psycho0: bus range 0-0, PCI bus 0
psycho0: dvma map 60000000-7fffffff
pci0 at psycho0
ebus0 at pci0 dev 7 function 0 "Acer Labs M1533 ISA" rev 0x00
"dma" at ebus0 addr 0-ffff ivec 0x2a not configured
rtc0 at ebus0 addr 70-71: m5819
power0 at ebus0 addr 2000-2007 ivec 0x23
lom0 at ebus0 addr 8010-8011 ivec 0x2a: LOMlite2 rev 3.8
com0 at ebus0 addr 3f8-3ff ivec 0x2b: ns16550a, 16 byte fifo
com0: console
com1 at ebus0 addr 2e8-2ef ivec 0x2b: ns16550a, 16 byte fifo
"flashprom" at ebus0 addr 0-7ffff not configured
--db_more--
alipm0 at pci0 dev 3 function 0 "Acer Labs M7101 Power" rev 0x00: 74KHz clock
iic0 at alipm0
"max1617" at alipm0 addr 0x18 skipped due to alipm0 bugs
spdmem0 at iic0 addr 0x56: 256MB SDRAM registered ECC PC133CL2
spdmem1 at iic0 addr 0x57: 128MB SDRAM registered ECC PC133CL2
dc0 at pci0 dev 12 function 0 "Davicom DM9102" rev 0x31: ivec 0x7c6, address 00
:03:ba:04:cf:f2
amphy0 at dc0 phy 1: DM9102 10/100 PHY, rev. 0
dc1 at pci0 dev 5 function 0 "Davicom DM9102" rev 0x31: ivec 0x7dc, address 00:
03:ba:04:cf:f3
amphy1 at dc1 phy 1: DM9102 10/100 PHY, rev. 0
ohci0 at pci0 dev 10 function 0 "Acer Labs M5237 USB" rev 0x03: ivec 0x7e4, ver
sion 1.0, legacy support
pciide0 at pci0 dev 13 function 0 "Acer Labs M5229 UDMA IDE" rev 0xc3: DMA, cha
nnel 0 configured to native-PCI, channel 1 configured to native-PCI
pciide0: using ivec 0x7cc for native-PCI interrupt
wd0 at pciide0 channel 0 drive 0: <SAMSUNG SP0802N>
wd0: 16-sector PIO, LBA48, 76351MB, 156368016 sectors
wd0(pciide0:0:0): using PIO mode 4, Ultra-DMA mode 2
wd1 at pciide0 channel 1 drive 0: <IC35L090AVV207-0>
wd1: 16-sector PIO, LBA48, 76293MB, 156250000 sectors
wd1(pciide0:1:0): using PIO mode 4, Ultra-DMA mode 2
usb0 at ohci0: USB revision 1.0
--db_more--
uhub0 at usb0 "Acer Labs OHCI root hub" rev 1.00/1.00 addr 1
vscsi0 at root
scsibus1 at vscsi0: 256 targets
softraid0 at root
scsibus2 at softraid0: 256 targets
sd0 at scsibus2 targ 1 lun 0: <OPENBSD, SR RAID 1, 005> SCSI2 0/direct fixed
sd0: 76293MB, 512 bytes/sector, 156248544 sectors
bootpath: /pci@1f,0/ide@d,0/disk@0,0
root on sd0a (65b0748db208ec37.a) swap on sd0b dump on sd0b
panic: psycho0: uncorrectable DMA error AFAR 6656a250 (pa=0 tte=0/49c10012) AFS
R 410000ff40800000
kdb breakpoint at 156b5a4
Stopped at Debugger+0x8: nop
RUN AT LEAST 'trace' AND 'ps' AND INCLUDE OUTPUT WHEN REPORTING THIS PANIC!
DO NOT EVEN BOTHER REPORTING THIS WITHOUT INCLUDING THAT INFORMATION!
ddb> ddb> ddb> ddb> ddb> ddb> ddb> ddb> ddb> ddb> ddb> ddb> ddb> ddb> ddb> ddb>
ddb> ddb> psycho_ue(400005b9200, 40000f5ea00, 22d532, c, 54, 34) at psycho_ue+
0x7c
intr_handler(e0017ec8, 400005b9300, 22d533, 4, 400069afac0, 40006bfd590) at int
r_handler+0xc
sparc_interrupt(0, 0, 4000c463bc8, 0, 0, 0) at sparc_interrupt+0x298
dofilewritev(0, 3, 4000c0b3080, 4000c463cd8, 0, 0) at dofilewritev+0x16c
sys_write(4000c29ab40, 4000c463db8, 4000c463df8, 0, 4010, 14b) at sys_write+0xb
--db_more--
0
syscall(4000c463ed0, 404, 274d37de28, 274d37de2c, 0, 0) at syscall+0x28c
softtrap(3, 26fd4e33d8, 54, 0, 0, 0) at softtrap+0x19c
ddb> PID PPID PGRP UID S FLAGS WAIT COMMAND
5246 4675 31718 1000 2 0x10 sshd
* 4675 31718 31718 1000 7 0x10 sshd
5243 26563 31771 0 3 0x83 piperd as
27921 26563 31771 0 2 0x3 cc1
26563 31771 31771 0 3 0x83 wait cc
31718 2327 31718 0 3 0x92 poll sshd
20292 32327 31502 0 3 0x83 piperd as
4663 32327 31502 0 2 0x3 cc1
32327 6223 31502 0 3 0x83 wait cc
6223 32487 31502 0 3 0x8b pause make
32487 31502 31502 0 3 0x8b pause sh
31502 1655 31502 0 3 0x8b pause make
1655 16735 1655 0 3 0x8b pause ksh
16735 24352 16735 1000 3 0x8b pause ksh
24352 29509 29509 1000 3 0x90 select sshd
29509 2327 29509 0 3 0x92 poll sshd
5606 7198 5606 1000 3 0x83 poll top
31771 24863 31771 0 3 0x8b pause make
24863 23520 24863 0 3 0x8b pause ksh
--db_more--
12041 14579 12041 0 3 0x83 ttyin ksh
14579 27984 14579 1000 3 0x8b pause ksh
27984 31936 31936 1000 3 0x90 select sshd
31936 2327 31936 0 3 0x92 poll sshd
7198 1 7198 1000 3 0x8b pause ksh
23520 5025 23520 1000 3 0x8b pause ksh
5025 2062 2062 1000 3 0x90 select sshd
2062 2327 2062 0 3 0x92 poll sshd
24705 1 24705 0 3 0x80 poll cron
981 1 981 99 3 0x90 poll sndiod
31755 7068 7068 95 3 0x90 kqread smtpd
26407 7068 7068 95 3 0x90 kqread smtpd
30477 7068 7068 95 3 0x90 kqread smtpd
13510 7068 7068 95 3 0x90 kqread smtpd
4676 7068 7068 95 3 0x90 kqread smtpd
25400 7068 7068 103 3 0x90 kqread smtpd
7068 1 7068 0 3 0x80 kqread smtpd
2327 1 2327 0 3 0x80 select sshd
12659 5494 18102 83 3 0x90 poll ntpd
5494 18102 18102 83 3 0x90 poll ntpd
18102 1 18102 0 3 0x80 poll ntpd
10998 14096 14096 74 3 0x90 bpf pflogd
14096 1 14096 0 3 0x80 netio pflogd
6010 11552 11552 73 3 0x90 kqread syslogd
11552 1 11552 0 3 0x80 netio syslogd
30908 1 30908 77 3 0x90 poll dhclient
31053 1 31053 0 3 0x80 poll dhclient
10496 0 0 0 2 0x14200 zerothread
21761 0 0 0 3 0x14200 aiodoned aiodoned
138 0 0 0 3 0x14200 syncer update
1424 0 0 0 3 0x14200 cleaner cleaner
16186 0 0 0 3 0x14200 reaper reaper
4969 0 0 0 3 0x14200 pgdaemon pagedaemon
12560 0 0 0 3 0x14200 bored srdis
19078 0 0 0 3 0x14200 bored crypto
3784 0 0 0 3 0x14200 pftm pfpurge
12816 0 0 0 3 0x14200 usbtsk usbtask
23674 0 0 0 3 0x14200 usbatsk usbatsk
10916 0 0 0 3 0x14200 bored sensors
3519 0 0 0 3 0x14200 bored systqmp
17839 0 0 0 3 0x14200 bored systq
7509 0 0 0 3 0x40014200 idle0
24762 0 0 0 3 0x14200 kmalloc kmthread
1 0 1 0 3 0x82 wait init
0 -1 0 0 3 0x10200 scheduler swapper
### DMESG after reboot
console is /pci@1f,0/isa@7/serial@0,3f8
Copyright (c) 1982, 1986, 1989, 1991, 1993
The Regents of the University of California. All rights reserved.
Copyright (c) 1995-2015 OpenBSD. All rights reserved. http://www.OpenBSD.org
OpenBSD 5.7 (GENERIC) #466: Sun Mar 8 13:47:17 MDT 2015
[email protected]:/usr/src/sys/arch/sparc64/compile/GENERIC
real mem = 402653184 (384MB)
avail mem = 380633088 (363MB)
mpath0 at root
scsibus0 at mpath0: 256 targets
mainbus0 at root: Sun Netra X1 (UltraSPARC-IIe 400MHz)
cpu0 at mainbus0: SUNW,UltraSPARC-IIe (rev 1.4) @ 400 MHz
cpu0: physical 16K instruction (32 b/l), 16K data (32 b/l), 256K external (64
b/l)
psycho0 at mainbus0: SUNW,sabre, impl 0, version 0, ign 7c0
psycho0: bus range 0-0, PCI bus 0
psycho0: dvma map 60000000-7fffffff
pci0 at psycho0
ebus0 at pci0 dev 7 function 0 "Acer Labs M1533 ISA" rev 0x00
"dma" at ebus0 addr 0-ffff ivec 0x2a not configured
rtc0 at ebus0 addr 70-71: m5819
power0 at ebus0 addr 2000-2007 ivec 0x23
lom0 at ebus0 addr 8010-8011 ivec 0x2a: LOMlite2 rev 3.8
com0 at ebus0 addr 3f8-3ff ivec 0x2b: ns16550a, 16 byte fifo
com0: console
com1 at ebus0 addr 2e8-2ef ivec 0x2b: ns16550a, 16 byte fifo
"flashprom" at ebus0 addr 0-7ffff not configured
alipm0 at pci0 dev 3 function 0 "Acer Labs M7101 Power" rev 0x00: 74KHz clock
iic0 at alipm0
"max1617" at alipm0 addr 0x18 skipped due to alipm0 bugs
spdmem0 at iic0 addr 0x56: 256MB SDRAM registered ECC PC133CL2
spdmem1 at iic0 addr 0x57: 128MB SDRAM registered ECC PC133CL2
dc0 at pci0 dev 12 function 0 "Davicom DM9102" rev 0x31: ivec 0x7c6, address
00:03:ba:04:cf:f2
amphy0 at dc0 phy 1: DM9102 10/100 PHY, rev. 0
dc1 at pci0 dev 5 function 0 "Davicom DM9102" rev 0x31: ivec 0x7dc, address
00:03:ba:04:cf:f3
amphy1 at dc1 phy 1: DM9102 10/100 PHY, rev. 0
ohci0 at pci0 dev 10 function 0 "Acer Labs M5237 USB" rev 0x03: ivec 0x7e4,
version 1.0, legacy support
pciide0 at pci0 dev 13 function 0 "Acer Labs M5229 UDMA IDE" rev 0xc3: DMA,
channel 0 configured to native-PCI, channel 1 configured to native-PCI
pciide0: using ivec 0x7cc for native-PCI interrupt
wd0 at pciide0 channel 0 drive 0: <SAMSUNG SP0802N>
wd0: 16-sector PIO, LBA48, 76351MB, 156368016 sectors
wd0(pciide0:0:0): using PIO mode 4, Ultra-DMA mode 2
wd1 at pciide0 channel 1 drive 0: <IC35L090AVV207-0>
wd1: 16-sector PIO, LBA48, 76293MB, 156250000 sectors
wd1(pciide0:1:0): using PIO mode 4, Ultra-DMA mode 2
usb0 at ohci0: USB revision 1.0
uhub0 at usb0 "Acer Labs OHCI root hub" rev 1.00/1.00 addr 1
vscsi0 at root
scsibus1 at vscsi0: 256 targets
softraid0 at root
scsibus2 at softraid0: 256 targets
softraid0: sd0 was not shutdown properly
sd0 at scsibus2 targ 1 lun 0: <OPENBSD, SR RAID 1, 005> SCSI2 0/direct fixed
sd0: 76293MB, 512 bytes/sector, 156248544 sectors
bootpath: /pci@1f,0/ide@d,0/disk@0,0
root on sd0a (65b0748db208ec37.a) swap on sd0b dump on sd0b
WARNING: / was not properly unmounted