Hello All,
Synopsis: spurious crash in softnet
Category: kernel sgi
Environment:
System: OpenBSD 6.0
Details: OpenBSD 6.0-beta (GENERIC-IP22) #664: Sun Jul 10 00:31:39 MDT 2016
Architecture: SGI (MIPS64)
Machine: Challenge S R5000
Description:
Machine seems to hang at (seemingly) random intervals. This has
occurred on several recent snapshots including 10-Jul. I have
reproduced the issue on multiple systems, so it doesn't seem to be a
hardware issue. It may not be relevant but the machines are running
an MTU of 1454 on sq1.
How-To-Repeat:
Seems to repeat itself given enough time, but I've not been able to tie it
to any specific sequence of events. I will say that the machine typically does
not run longer than a day (though it has on occasion).
Fix: Unknown
sq1: receive FIFO overflow
Trap cause = 4 Frame 0xffffffff91f439b0
Trap PC 0xffffffff888b2be0 RA 0xffffffff888b2dbc fault 0xd97d3b7057b9cf7b
pool_put+0xa8
(1becdf323dc0c775,c0000000030a2800,c0000000030f87e0,ffffffff888d2358)
ra 0xffffffff888d18f0 sp 0xffffffff91f43b08,0
m_extfree+0x110
(1becdf323dc0c775,c0000000030a2800,c0000000030f87e0,ffffffff888d2358)
ra 0xffffffff888d1fa0 sp 0xffffffff91f43ba2
m_free+0x138
(1becdf323dc0c775,c0000000030a2800,c0000000030f87e0,ffffffff888d2358)
ra 0xffffffff888d20b0 sp 0xffffffff91f43bc8, 8
m_freem+0x28
(1becdf323dc0c775,c0000000030a2800,c0000000030f87e0,ffffffff888d2358)
ra 0xffffffff88961b88 sp 0xffffffff91f43bf8, 2
in_arpinput+0x88
(1becdf323dc0c775,c0000000030a2800,c0000000030f87e0,ffffffff888d2358)
ra 0xffffffff8892169c sp 0xffffffff91f43c4
ether_input+0x334
(1becdf323dc0c775,c0000000030a2800,c0000000030f87e0,ffffffff888d2358)
ra 0xffffffff8891df38 sp 0xffffffff91f432
User-level: pid 34898
stopped on non ddb fault
Stopped at pool_put+0xa8: ld v0,8(v1)
ddb> trace
pool_put+0xa8 (1becdf323dc0c775,c0000000030a2800,c0000000030f87e0,ffffffff888d2
358) ra 0xffffffff888d18f0 sp 0xffffffff91f43b08, sz 160
m_extfree+0x110 (1becdf323dc0c775,c0000000030a2800,c0000000030f87e0,ffffffff888
d2358) ra 0xffffffff888d1fa0 sp 0xffffffff91f43ba8, sz 32
m_free+0x138 (1becdf323dc0c775,c0000000030a2800,c0000000030f87e0,ffffffff888d23
58) ra 0xffffffff888d20b0 sp 0xffffffff91f43bc8, sz 48
m_freem+0x28 (1becdf323dc0c775,c0000000030a2800,c0000000030f87e0,ffffffff888d23
58) ra 0xffffffff88961b88 sp 0xffffffff91f43bf8, sz 32
in_arpinput+0x88 (1becdf323dc0c775,c0000000030a2800,c0000000030f87e0,ffffffff88
8d2358) ra 0xffffffff8892169c sp 0xffffffff91f43c18, sz 144
ether_input+0x334 (1becdf323dc0c775,c0000000030a2800,c0000000030f87e0,ffffffff8
88d2358) ra 0xffffffff8891df38 sp 0xffffffff91f43ca8, sz 112
if_input_process+0xf8 (1becdf323dc0c775,c0000000030a2800,c0000000030f87e0,fffff
fff888d2358) ra 0xffffffff888a3968 sp 0xffffffff91f43d18, sz 80
taskq_thread+0xd0 (1becdf323dc0c775,c0000000030a2800,c0000000030f87e0,ffffffff8
88d2358) ra 0xffffffff88a797fc sp 0xffffffff91f43d68, sz 80
proc_trampoline+0x1c (1becdf323dc0c775,c0000000030a2800,c0000000030f87e0,ffffff
ff888d2358) ra 0x0 sp 0xffffffff91f43db8, sz 0
User-level: pid 34898
ddb> ps
TID PPID PGRP UID S FLAGS WAIT COMMAND
74365 1 74365 0 3 0x100083 ttyin getty
2436 1 2436 0 3 0x100098 poll cron
35314 58087 58087 619 3 0x82 kqread bandb
40008 58087 58087 619 3 0x82 kqread ssld
55415 58087 58087 619 3 0x82 kqread resolver
58087 1 58087 619 3 0x90 kqread ircd
82599 1 82599 0 3 0x80 select sshd
65889 67599 99686 83 3 0x100090 poll ntpd
67599 99686 99686 83 3 0x100090 poll ntpd
99686 1 99686 0 3 0x80 poll ntpd
97827 46866 46866 74 3 0x100090 bpf pflogd
46866 1 46866 0 3 0x80 netio pflogd
94958 52247 52247 73 2 0x100090 syslogd
52247 1 52247 0 3 0x100080 netio syslogd
8668 0 0 0 3 0x14200 pgzero zerothread
60775 0 0 0 3 0x14200 aiodoned aiodoned
87475 0 0 0 3 0x14200 syncer update
81321 0 0 0 3 0x14200 cleaner cleaner
75445 0 0 0 3 0x14200 reaper reaper
67147 0 0 0 3 0x14200 pgdaemon pagedaemon
26226 0 0 0 3 0x14200 bored crynlk
85686 0 0 0 3 0x14200 bored crypto
71123 0 0 0 3 0x14200 pftm pfpurge
*34898 0 0 0 7 0x14210 softnet
93986 0 0 0 3 0x14200 bored systqmp
52845 0 0 0 3 0x14200 bored systq
16345 0 0 0 3 0x40014200 idle0
69755 0 0 0 3 0x14200 kmalloc kmthread
1 0 1 0 3 0x82 wait init
0 -1 0 0 3 0x10200 scheduler swapper
ddb> show panic
the kernel did not panic
ddb> show registers
at 0xffffffff88b60000 sysent+0xec0
v0 0xd97d3b7057b9cf73
v1 0xd97d3b7057b9cf73
a0 0x1becdf323dc0c775
a1 0xc0000000030a2800
a2 0xc0000000030f87e0
a3 0xffffffff888d2358 m_extfree_pool
a4 0xffffffff91f43be6 end+0x92e34b6
a5 0x14
a6 0x18
a7 0x8
t0 0x4
t1 0xffffffff88c0e2f0 kernel_pmap_store
t2 0
t3 0xffffffff91f40000 end+0x92df8d0
s0 0xc0000000030f87e0
s1 0xc0000000030a2800
s2 0xffffffff88b88070 mclpools
s3 0x1
s4 0xc0000000000de078
s5 0
s6 0xc0000000030a2818
s7 0xffffffff91f43c38 end+0x92e3508
t8 0x52f2c064
t9 0xffffffff88a95188 int2_splx
k0 0xffffffff8894a114 rtable_match+0x84
k1 0xc000000002f40bc0
gp 0xffffffff88b64430 _gp
sp 0xffffffff91f43b08 end+0x92e33d8
s8 0
ra 0xffffffff888b2dbc pool_put+0x284
sr 0x1000cfa3
lo 0x61861862
hi 0
bad 0xd97d3b7057b9cf7b
cs 0x10
pc 0xffffffff888b2be0 pool_put+0xa8
pool_put+0xa8: ld v0,8(v1)
ddb> continue
panic: trap
Stopped at Debugger+0x4: jr ra
Debugger+0x8: nop
TID PID UID PRFLAGS PFLAGS CPU COMMAND
*34898 34898 0 0x14000 0x210 0 softnet
Debugger+0x4 (73e2c57b1f779808,900000001fbd9880,900000001fbd9830,ffffffff91f438
30) ra 0xffffffff888b6040 sp 0xffffffff91f43868, sz 0
panic+0x100 (73e2c57b1f779808,ffffffff91f43af0,0,ffffffff88c0eb20) ra 0xffffff
ff88a76aec sp 0xffffffff91f43868, sz 112
itsa+0xf4 (73e2c57b1f779808,ffffffff91f43af0,0,ffffffff88c0eb20) ra 0xffffffff
88a7a2fc sp 0xffffffff91f438d8, sz 176
k_general+0x114 (ffffffff91f439b0,ffffffff91f43af0,0,ffffffff888b2be0) ra 0x0 s
p 0xffffffff91f43988, sz 0
(KERNEL TRAP)
pool_put+0xa8 (ffffffff91f439b0,ffffffff91f43af0,0,ffffffff888b2be0) ra 0xffff
ffff888d18f0 sp 0xffffffff91f43b08, sz 160
m_extfree+0x110 (ffffffff91f439b0,ffffffff91f43af0,0,ffffffff888b2be0) ra 0xff
ffffff888d1fa0 sp 0xffffffff91f43ba8, sz 32
m_free+0x138 (ffffffff91f439b0,ffffffff91f43af0,0,ffffffff888b2be0) ra 0xfffff
fff888d20b0 sp 0xffffffff91f43bc8, sz 48
m_freem+0x28 (ffffffff91f439b0,ffffffff91f43af0,0,ffffffff888b2be0) ra 0xfffff
fff88961b88 sp 0xffffffff91f43bf8, sz 32
in_arpinput+0x88 (ffffffff91f439b0,ffffffff91f43af0,0,ffffffff888b2be0) ra 0xf
fffffff8892169c sp 0xffffffff91f43c18, sz 144
ether_input+0x334 (ffffffff91f439b0,ffffffff91f43af0,0,ffffffff888b2be0) ra 0x
ffffffff8891df38 sp 0xffffffff91f43ca8, sz 112
if_input_process+0xf8 (ffffffff91f439b0,ffffffff91f43af0,0,ffffffff888b2be0) r
a 0xffffffff888a3968 sp 0xffffffff91f43d18, sz 80
taskq_thread+0xd0 (ffffffff91f439b0,ffffffff91f43af0,0,ffffffff888b2be0) ra 0x
ffffffff88a797fc sp 0xffffffff91f43d68, sz 80
proc_trampoline+0x1c (ffffffff91f439b0,ffffffff91f43af0,0,ffffffff888b2be0) ra
0x0 sp 0xffffffff91f43db8, sz 0
User-level: pid 34898
http://www.openbsd.org/ddb.html describes the minimum info required in bug
reports. Insufficient info makes it difficult to find and fix bugs.
ddb> boot reboot
panic: wd33c93_scsicmd: busy
Stopped at Debugger+0x4: jr ra
Debugger+0x8: nop
Debugger+0x4 (73e2c57b1f779808,900000001fbd9880,900000001fbd9830,ffffffff91f42b
e0) ra 0xffffffff888b6040 sp 0xffffffff91f42c18, sz 0
panic+0x100 (73e2c57b1f779808,35,c000000002b9c16b,c000000000008700) ra 0xfffff
fff88806960 sp 0xffffffff91f42c18, sz 112
wd33c93_scsi_cmd+0x280
(73e2c57b1f779808,35,c000000002b9c16b,c000000000008700) r
a 0xffffffff88aa9e18 sp 0xffffffff91f42c88, sz 64
scsi_xs_sync+0xb8 (73e2c57b1f779808,35,c000000002b9c16b,c000000000008700) ra 0
xffffffff88ab3454 sp 0xffffffff91f42cc8, sz 64
sd_flush+0x8c (73e2c57b1f779808,35,c000000002b9c16b,c000000000008700) ra 0xfff
fffff88ab54d8 sp 0xffffffff91f42d08, sz 48
sdactivate+0x140 (73e2c57b1f779808,35,c000000002b9c16b,c000000000008700) ra 0x
ffffffff888a54dc sp 0xffffffff91f42d38, sz 48
config_suspend+0x3c (73e2c57b1f779808,35,c000000002b9c16b,c000000000008700) ra
0xffffffff88aae184 sp 0xffffffff91f42d68, sz 48
scsi_activate_target+0x54 (73e2c57b1f779808,35,c000000002b9c16b,c00000000000870
0) ra 0xffffffff88aae20c sp 0xffffffff91f42d98, sz 64
scsi_activate_bus+0x44
(73e2c57b1f779808,35,c000000002b9c16b,c000000000008700) r
a 0xffffffff888a54dc sp 0xffffffff91f42dd8, sz 64
config_suspend+0x3c (73e2c57b1f779808,35,c000000002b9c16b,c000000000008700) ra
0xffffffff888a5398 sp 0xffffffff91f42e18, sz 48
config_activate_children+0x78 (73e2c57b1f779808,35,c000000002b9c16b,c0000000000
08700) ra 0xffffffff888a5510 sp 0xffffffff91f42e48, sz 80
config_suspend+0x70 (73e2c57b1f779808,35,c000000002b9c16b,c000000000008700) ra
ddb> boot reboot
System restart.
sc0,1,0: cmd=0x12 timeout after 2 sec. Resetting SCSI bus
[ using 388944 bytes of bsd ELF symbol table ]
Copyright (c) 1982, 1986, 1989, 1991, 1993
The Regents of the University of California. All rights reserved.
Copyright (c) 1995-2016 OpenBSD. All rights reserved. http://www.OpenBSD.org
OpenBSD 6.0-beta (GENERIC-IP22) #664: Sun Jul 10 00:31:39 MDT 2016
[email protected]:/usr/src/sys/arch/sgi/compile/GENERIC-IP22
real mem = 167772160 (160MB)
rsvd mem = 802816 (1MB)
avail mem = 160169984 (152MB)
mainbus0 at root: Challenge S
cpu0 at mainbus0: MIPS R5000 CPU rev 1.0 150 MHz, R5000 based FPC rev 1.0
cpu0: cache L1-I 32KB D 32KB 2 way, L2 512KB direct
int0 at mainbus0 addr 0x1fbd9880
imc0 at mainbus0: revision 3
gio0 at imc0
hpc0 at gio0 addr 0x1fb80000: SGI HPC3 (onboard)
zs0 at hpc0 offset 0x00059830 irq 29: 85230
zstty0 at zs0 channel 1: console
zstty1 at zs0 channel 0
sq0 at hpc0 offset 0x00054000 irq 3: Seeq 80c03, address 08:00:69:0a:34:09
wdsc0 at hpc0 offset 0x00044000 irq 1: WD33C93B, 20.0 MHz, burst DMA
wdsc0: microcode revision 0x0d, fast SCSI
scsibus0 at wdsc0: 8 targets, initiator 0
sd0 at scsibus0 targ 1 lun 0: <SEAGATE, ST39103LCSUN9.0G, 034A> SCSI2
0/direct fixed serial.SEAGATE_ST39103LCSUN9.0GLS4557570000101519ZQ
sd0: 8637MB, 512 bytes/sector, 17689267 sectors
pione at hpc0 offset 0x00059800 irq 5 not configured
panel0 at hpc0 offset 0x00059850 irq 9: power button
dsclock0 at hpc0 offset 0x00060000
hpc1 at gio0 addr 0x1fb00000: SGI HPC3 (IO+ mezzanine)
hpc1: using EXP1's DMA channel
sq1 at hpc1 offset 0x00054000 irq 0: Seeq 80c03, address 08:00:69:02:64:d1
clock0 at mainbus0: int 5
vscsi0 at root
scsibus1 at vscsi0: 256 targets
softraid0 at root
scsibus2 at softraid0: 256 targets
boot device: sd0
root on sd0a (ffbd62fcf39fc195.a) swap on sd0b dump on sd0b
WARNING: / was not properly unmounted