> Date: Thu, 29 Apr 2021 09:52:10 +0100 > From: Stuart Henderson <s...@spacehopper.org> > > On 2021/04/27 10:35, Alexander Bluhm wrote: > > On Mon, Apr 26, 2021 at 07:43:29PM +0200, Alexander Bluhm wrote: > > > One of my i386 machines paniced during make -j 9 build. > > > > This is perfectly reproducable. Machine crashes while building > > clang. This time with snapshot kernel. > > Same panic here building ports (base build was done with an older > kernel) on 2/4 builders. > > I'm now going to try with the "Convert allocations to km_alloc(9)" > commit reverted (i386/pmap.c:1.211->1.212, i386/pmapae.c:1.60->1.61) > > > > panic: kernel diagnostic assertion "pg->wire_count == 0" failed: file > > "/usr/src/sys/uvm/uvm_page.c", line 1265 > > Stopped at db_enter+0x4: popl %ebp > > TID PID UID PRFLAGS PFLAGS CPU COMMAND > > 100962 21754 21 0x3 0 3 c++ > > 165585 775 21 0x3 0 4 c++ > > 482715 4014 21 0x3 0 2 c++ > > 104450 78451 21 0x3 0 6 c++ > > 492054 2530 21 0x3 0 7 c++ > > 441463 23628 21 0x3 0 5 c++ > > 266171 31308 21 0x3 0 0 c++ > > *453385 97818 0 0x14000 0x200 1K pagedaemon > > db_enter() at db_enter+0x4 > > panic(d0bd507b) at panic+0xd3 > > __assert(d0c39ae9,d0bad7c6,4f1,d0c4a94c) at __assert+0x19 > > uvm_pagedeactivate(d4fd923c) at uvm_pagedeactivate+0x122 > > uvmpd_scan() at uvmpd_scan+0x294 > > uvm_pageout(d6fc34c8) at uvm_pageout+0x365 > > Exact same trace. > > panic: kernel diagnostic assertion "pg->wire_count == 0" failed: file > "/usr/src/sys/uvm/uvm_page.c", line 1265 > Stopped at db_enter+0x4: popl %ebp > TID PID UID PRFLAGS PFLAGS CPU COMMAND > 234486 25502 0 0x11 0 2 perl > 214247 55192 55 0x2 0x4000000 0 rustc.bin > 471121 59408 0 0x13 0 3 perl > * 31012 31392 0 0x14000 0x200 1K pagedaemon > > > version: OpenBSD 6.9-current (GENERIC.MP) #802: Mon Apr 26 02:54:36 > > MDT 2021\012 > > dera...@i386.openbsd.org:/usr/src/sys/arch/i386/compile/GENERIC.MP\012 > > OpenBSD 6.9-current (GENERIC.MP) #0: Wed Apr 28 21:36:47 MDT 2021 > st...@i386.ports.openbsd.org:/usr/src/sys/arch/i386/compile/GENERIC.MMP > > Built from a clean CVS checkout (no diffs) done at Thu Apr 29 01:20:58 UTC > 2021 > > > ddb{1}> show register > > ds 0x10 > > es 0x10 > > fs 0x20 > > gs 0 > > edi 0xd0bd507b acx100_txpower_maxim+0xe81a > > esi 0x100 > > ebp 0xf582662c > > ebx 0xf5826654 > > edx 0x3fd > > ecx 0 > > eax 0x1 > > eip 0xd08a8874 db_enter+0x4 > > cs 0x8 > > eflags 0x202 > > esp 0xf582662c > > ss 0x10 > > db_enter+0x4: popl %ebp > > same except for > > edi 0xd0bc972d acx100_txpower_maxim+0xc82f > ebp 0xf598b6cc > ebx 0xf598b6f4 > eip 0xd0817b04 db_enter+0x4 > esp 0xf598b6cc > > > ddb{1}> show uvmexp > > Current UVM status: > > pagesize=4096 (0x1000), pagemask=0xfff, pageshift=12 > > 832339 VM pages: 517320 active, 5058 inactive, 1 wired, 147968 free (0 > > zero) > > min 10% (25) anon, 10% (25) vnode, 5% (12) vtext > > freemin=27744, free-target=36992, inactive-target=208035, wired-max=277446 > > faults=201515090, traps=201773820, intrs=2524713, ctxswitch=15585835 > > fpuswitch=99375 > > softint=3182921, syscalls=287857563, kmapent=15 > > fault counts: > > noram=0, noanon=0, noamap=0, pgwait=0, pgrele=0 > > ok relocks(total)=269519(271199), anget(retries)=141363890(0), > > amapcopy=24784161 > > neighbor anon/obj pg=13486052/85376915, > > gets(lock/unlock)=27741219/271199 > > cases: anon=139829185, anoncow=1534705, obj=27205510, prcopy=534029, > > przero=32411649 > > daemon and swap counts: > > woke=2, revs=1, scans=101727, obscans=101727, anscans=0 > > busy=0, freed=101727, reactivate=0, deactivate=0 > > pageouts=0, pending=0, nswget=0 > > nswapdev=1 > > swpages=917207, swpginuse=0, swpgonly=0 paging=0 > > kernel pointers: > > objs(kern)=0xd0e719ac > > Current UVM status: > pagesize=4096 (0x1000), pagemask=0xfff, pageshift=12 > 564802 VM pages: 298864 active, 47935 inactive, 1 wired, 100404 free (2361 > zero) > min 10% (25) anon, 10% (25) vnode, 5% (12) vtext > freemin=18826, free-target=25101, inactive-target=141715, wired-max=188267 > faults=827797786, traps=847312744, intrs=42461394, ctxswitch=169637771 > fpuswitch=1593503 > softint=25918765, syscalls=579364703, kmapent=28 > fault counts: > noram=0, noanon=0, noamap=0, pgwait=0, pgrele=0 > ok relocks(total)=1345165(1347679), anget(retries)=367906740(0), > amapcopy=128308912 > neighbor anon/obj pg=299836900/426127687, > gets(lock/unlock)=116789120/1347679 > cases: anon=273495004, anoncow=94411736, obj=109145091, prcopy=7641515, > przero=343104381 > daemon and swap counts: > woke=3, revs=1, scans=78347, obscans=78347, anscans=0 > busy=0, freed=78347, reactivate=0, deactivate=0 > pageouts=0, pending=0, nswget=0 > nswapdev=1 > swpages=644714, swpginuse=0, swpgonly=0 paging=0 > kernel pointers: > objs(kern)=0xd0ece538 > > OpenBSD 6.9-current (GENERIC.MP) #0: Wed Apr 28 21:36:47 MDT 2021 > st...@i386.ports.openbsd.org:/usr/src/sys/arch/i386/compile/GENERIC.MP > RTC BIOS diagnostic error 80<clock_battery> > real mem = 2372272128 (2262MB) > avail mem = 2312630272 (2205MB) > random: good seed from bootblocks > mpath0 at root > scsibus0 at mpath0: 256 targets > mainbus0 at root > bios0 at mainbus0: date 12/15/15, BIOS32 rev. 0 @ 0xf4cc0, SMBIOS rev. 2.8 @ > 0x8ef68000 (39 entries) > bios0: vendor Dell Inc. version "2.5.0" date 05/03/2018 > > bios0: Dell Inc. PowerEdge R230 > acpi0 at bios0: ACPI 5.0 > acpi0: sleep states S0 S5 > acpi0: tables DSDT FACP BOOT SSDT SLIC HPET LPIT APIC MCFG WDAT SSDT DBGP > DBG2 SSDT SSDT SSDT SSDT DMAR SSDT SSDT PRAR > acpi0: wakeup devices PEG0(S0) PEGP(S0) PEG1(S0) PEGP(S0) PEG2(S0) PEGP(S0) > XHC_(S0) XDCI(S0) RP01(S0) PXSX(S0) RP02(] > acpitimer0 at acpi0: 3579545 Hz, 24 bits > > acpihpet0 at acpi0: 23999999 Hz > acpimadt0 at acpi0 addr 0xfee00000: PC-AT compat > cpu0 at mainbus0: apid 0 (boot processor) > cpu0: Intel(R) Xeon(R) CPU E3-1270 v6 @ 3.80GHz ("GenuineIntel" 686-class) > 3.80 GHz, 06-9e-09 > cpu0: > FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTTN > mtrr: Pentium Pro MTRR support, 10 var ranges, 88 fixed ranges > > cpu0: apic clock running at 24MHz > cpu0: mwait min=64, max=64, C-substates=0.2.1.2.4.1, IBE > cpu1 at mainbus0: apid 2 (application processor) > cpu1: Intel(R) Xeon(R) CPU E3-1270 v6 @ 3.80GHz ("GenuineIntel" 686-class) > 3.80 GHz, 06-9e-09 > cpu1: > FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTTN > cpu2 at mainbus0: apid 4 (application processor) > > cpu2: Intel(R) Xeon(R) CPU E3-1270 v6 @ 3.80GHz ("GenuineIntel" 686-class) > 3.80 GHz, 06-9e-09 > cpu2: > FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTTN > cpu3 at mainbus0: apid 6 (application processor) > > cpu3: Intel(R) Xeon(R) CPU E3-1270 v6 @ 3.80GHz ("GenuineIntel" 686-class) > 3.80 GHz, 06-9e-09 > cpu3: > FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTTN > ioapic0 at mainbus0: apid 2 pa 0xfec00000, version 20, 24 pins > > acpimcfg0 at acpi0 > acpimcfg0: addr 0xe0000000, bus 0-255 > acpiprt0 at acpi0: bus 0 (PCI0) > acpiprt0: no apic found for irq 32 > acpiprt0: no apic found for irq 33 > acpiprt0: no apic found for irq 34 > acpiprt1 at acpi0: bus 1 (PEG0) > acpiprt2 at acpi0: bus 2 (PEG1) > acpiprt3 at acpi0: bus -1 (PEG2) > acpiprt4 at acpi0: bus -1 (RP01) > acpiprt5 at acpi0: bus -1 (RP02) > acpiprt6 at acpi0: bus -1 (RP03) > acpiprt7 at acpi0: bus -1 (RP04) > acpiprt8 at acpi0: bus -1 (RP05) > acpiprt9 at acpi0: bus -1 (RP06) > acpiprt10 at acpi0: bus -1 (RP07) > acpiprt11 at acpi0: bus -1 (RP08) > acpiprt12 at acpi0: bus 3 (RP09) > acpiprt13 at acpi0: bus -1 (RP10) > acpiprt14 at acpi0: bus 4 (RP11) > acpiprt15 at acpi0: bus -1 (RP12) > acpiprt16 at acpi0: bus -1 (RP13) > acpiprt17 at acpi0: bus -1 (RP14) > acpiprt18 at acpi0: bus -1 (RP15) > acpiprt19 at acpi0: bus -1 (RP16) > acpiprt20 at acpi0: bus -1 (RP17) > acpiprt21 at acpi0: bus -1 (RP18) > acpiprt22 at acpi0: bus -1 (RP19) > acpiprt23 at acpi0: bus -1 (RP20) > "PNP0A08" at acpi0 not configured > acpicmos0 at acpi0 > acpibtn0 at acpi0: SLPB > "PNP0C14" at acpi0 not configured > "PNP0C33" at acpi0 not configured > acpicpu0 at acpi0: C3(500@76 mwait.1@0x10), C1(1000@1 mwait.1), PSS > acpicpu1 at acpi0: C3(500@76 mwait.1@0x10), C1(1000@1 mwait.1), PSS > acpicpu2 at acpi0: C3(500@76 mwait.1@0x10), C1(1000@1 mwait.1), PSS > acpicpu3 at acpi0: C3(500@76 mwait.1@0x10), C1(1000@1 mwait.1), PSS > acpivideo0 at acpi0: GFX0 > acpivout0 at acpivideo0: DD1F > bios0: ROM list: 0xc0000/0x8000 0xc8000/0x3800! 0xcb800/0x1200! > ipmi at mainbus0 not configured > cpu0: Enhanced SpeedStep 3793 MHz: speeds: 3801, 3800, 3600, 3400, 3200, > 2900, 2700, 2500, 2300, 2100, 1900, 1700, 14z > memory map conflict 0xe00fd000/0x1000 > > memory map conflict 0xfe000000/0x11000 > pci0 at mainbus0 bus 0: configuration mode 1 (no bios) > 0:22:0: io address conflict 0x2e/0x2 > 0:22:1: io address conflict 0x2e/0x2 > 0:31:4: io address conflict 0x2e/0x2 > pchb0 at pci0 dev 0 function 0 "Intel Xeon E3-1200 v6/7 Host" rev 0x05 > ppb0 at pci0 dev 1 function 0 "Intel Core 6G PCIE" rev 0x05: apic 2 int 16 > pci1 at ppb0 bus 1 > ppb1 at pci0 dev 1 function 1 "Intel Core 6G PCIE" rev 0x05: apic 2 int 16 > pci2 at ppb1 bus 2 > em0 at pci2 dev 0 function 0 "Intel I350" rev 0x01: msi, address > a0:36:9f:c3:f7:60 > em1 at pci2 dev 0 function 1 "Intel I350" rev 0x01: msi, address > a0:36:9f:c3:f7:61 > xhci0 at pci0 dev 20 function 0 "Intel 100 Series xHCI" rev 0x31: msi, xHCI > 1.0 > usb0 at xhci0: USB revision 3.0 > > uhub0 at usb0 configuration 1 interface 0 "Intel xHCI root hub" rev 3.00/1.00 > addr 1 > pchtemp0 at pci0 dev 20 function 2 "Intel 100 Series Thermal" rev 0x31: can't > map registers > "Intel 100 Series MEI" rev 0x31 at pci0 dev 22 function 0 not configured > > "Intel 100 Series MEI" rev 0x31 at pci0 dev 22 function 1 not configured > ahci0 at pci0 dev 23 function 0 "Intel 100 Series AHCI" rev 0x31: msi, AHCI > 1.3.1 > ahci0: port 0: 6.0Gb/s > > scsibus1 at ahci0: 32 targets > sd0 at scsibus1 targ 0 lun 0: <ATA, Samsung SSD 860, RVT0> > naa.5002538e096121f5 > sd0: 476940MB, 512 bytes/sector, 976773168 sectors, thin > > ppb2 at pci0 dev 29 function 0 "Intel 100 Series PCIE" rev 0xf1: apic 2 int 16 > pci3 at ppb2 bus 3 > ppb3 at pci0 dev 29 function 2 "Intel 100 Series PCIE" rev 0xf1: apic 2 int 18 > pci4 at ppb3 bus 4 > ppb4 at pci4 dev 0 function 0 "Renesas SH7758 PCIE Switch" rev 0x00 > pci5 at ppb4 bus 5 > ppb5 at pci5 dev 0 function 0 "Renesas SH7758 PCIE Switch" rev 0x00 > pci6 at ppb5 bus 6 > ppb6 at pci6 dev 0 function 0 "Renesas SH7758 PCIE-PCI" rev 0x00 > pci7 at ppb6 bus 7 > vga1 at pci7 dev 0 function 0 "Matrox MGA G200eR" rev 0x01 > wsdisplay0 at vga1 mux 1: console (80x25, vt100 emulation) > wsdisplay0: screen 1-5 added (80x25, vt100 emulation) > pcib0 at pci0 dev 31 function 0 "Intel C236 LPC" rev 0x31 > "Intel 100 Series PMC" rev 0x31 at pci0 dev 31 function 2 not configured > ichiic0 at pci0 dev 31 function 4 "Intel 100 Series SMBus" rev 0x31: apic 2 > int 16 > iic0 at ichiic0 > > iic0: addr 0x19 00=00 01=00 02=00 03=00 04=00 05=c1 06=1c 07=22 08=00 words > 00=00ef 01=0000 02=0000 03=0000 04=0000 01 > iic0: addr 0x1b 00=00 01=00 02=00 03=00 04=00 05=c1 06=1c 07=22 08=00 words > 00=00ef 01=0000 02=0000 03=0000 04=0000 01 > isa0 at pcib0 > > isadma0 at isa0 > com0 at isa0 port 0x3f8/8 irq 4: ns16550a, 16 byte fifo > com0: console > com1 at isa0 port 0x2f8/8 irq 3: ns16550a, 16 byte fifo > pckbc0 at isa0 port 0x60/5 irq 1 irq 12 > pcppi0 at isa0 port 0x61 > spkr0 at pcppi0 > npx0 at isa0 port 0xf0/16: reported by CPUID; using exception 16 > dt: 443 probes > uhub1 at uhub0 port 3 configuration 1 interface 0 "no manufacturer Gadget USB > HUB" rev 2.00/0.00 addr 2 > vscsi0 at root > > scsibus2 at vscsi0: 256 targets > softraid0 at root > scsibus3 at softraid0: 256 targets > root on sd0a (81ccb2b97e4e39c2.a) swap on sd0b dump on sd0b > WARNING: / was not properly unmounted
I suspect pmapae.c rev 1.61 causes this issue. Does reverting that commit "fix" the issue? It won't really fix the issue as you may still hit the "can't locate PD page" panic.