On Sat, Dec 27, 2025 at 03:41:38PM +0100, Mark Kettenis wrote: > > Date: Sat, 27 Dec 2025 14:59:49 +0100 > > From: Alexander Bluhm <[email protected]> > > > > Hi, > > > > I have a Sun Fire V215 that hangs with MP kernel but runs fine with > > single processor GENERIC kernel. I did see userland memory corruption > > with 7.8 GENERIC.MP, so I wanted to try recent IPI fixes. But with > > snapshot it hangs buring boot. No break into ddb possible. > > The IPI fix only affects sun4v and this is a sun4u machine. So it > wouldn't help. But it would be interesting to know if you still see > those memory corruption issues. A few other changes went in as well.
With 7.8 GENERIC.MP I did see gcc internal compiler errors. I guess it may be bad RAM or some kernel issues. Current GENERIC.MP hangs shortly after kernel boot. 7.8 and current GENERIC survive make build. So I suspect a MP kernel bug and not bad RAM. > Theo has a sparc64-specific diff in his tree. I can't rule out that > that causes issues on sun4u. It might be worth building your own > GENERIC.MP kernel and see if that fares better. If I remember correctly, I also tried a self built GENERIC.MP a few days ago. bluhm > > Rebooting with command: boot disk /bsd.snap -d > > Boot device: /pci@1e,600000/pci@0/pci@a/pci@0/pci@8/scsi@1/disk@0,0 File > > and args: /bsd.snap -d > > OpenBSD IEEE 1275 Bootblock 2.1 > > ..>> OpenBSD BOOT 1.30 > > Can't read disk label. > > Can't open disk label package > > Booting /pci@1e,600000/pci@0/pci@a/pci@0/pci@8/scsi@1/disk@0,0:a/bsd.snap > > 10542576@0x1000000+528@0x1a0ddf0+154196@0x1c00000+4040108@0x1c25a54 > > symbols @ 0xfe978400 544212+165+698760+491505 start=0x1000000 > > [ using 1735680 bytes of bsd ELF symbol table ] > > console is /ebus@1f,464000/serial@2,80 > > Copyright (c) 1982, 1986, 1989, 1991, 1993 > > The Regents of the University of California. All rights reserved. > > Copyright (c) 1995-2025 OpenBSD. All rights reserved. > > https://www.OpenBSD.org > > > > OpenBSD 7.8-current (GENERIC.MP) #47: Fri Dec 26 19:05:29 MST 2025 > > [email protected]:/usr/src/sys/arch/sparc64/compile/GENERIC.MP > > real mem = 4294967296 (4096MB) > > avail mem = 4200751104 (4006MB) > > random: good seed from bootblocks > > mpath0 at root > > scsibus0 at mpath0: 256 targets > > Stopped at db_enter+0x8: nop > > ddb{0}> write db_console 1 > > db_console 0 = 1 > > ddb{0}> c > > mainbus0 at root: Sun Fire V215 > > cpu0 at mainbus0: SUNW,UltraSPARC-IIIi (rev 3.4) @ 1504 MHz > > cpu0: physical 32K instruction (32 b/l), 64K data (32 b/l), 1024K external > > (64 b/l) > > cpu1 at mainbus0: SUNW,UltraSPARC-IIIi (rev 3.4) @ 1504 MHz > > cpu1: physical 32K instruction (32 b/l), 64K data (32 b/l), 1024K external > > (64 b/l) > > "memory-controller" at mainbus0 not configured > > "memory-controller" at mainbus0 not configured > > pyro0 at mainbus0: "Fire", rev 3, ign 780, bus A 2 to 13 > > pyro0: dvma map c0000000-ffffffff > > pci0 at pyro0 > > ppb0 at pci0 dev 0 function 0 "PLX PEX 8532" rev 0xbc > > pci1 at ppb0 bus 3 > > ppb1 at pci1 dev 1 function 0 "PLX PEX 8532" rev 0xbc > > pci2 at ppb1 bus 4 > > ppb2 at pci2 dev 0 function 0 "Acer Labs M5249" rev 0x00 > > pci3 at ppb2 bus 5 > > ohci0 at pci3 dev 28 function 0 "Acer Labs M5237 USB" rev 0x03: ivec 0x780, > > version 1.0, legacy support > > ohci1 at pci3 dev 28 function 1 "Acer Labs M5237 USB" rev 0x03: ivec 0x780, > > version 1.0, legacy support > > ehci0 at pci3 dev 28 function 3 "Acer Labs M5239 USB2" rev 0x01: ivec 0x781 > > usb0 at ehci0: USB revision 2.0 > > uhub0 at usb0 configuration 1 interface 0 "Acer Labs EHCI root hub" rev > > 2.00/1.00 addr 1 > > ebus0 at pci3 dev 30 function 0 "Acer Labs M1575 ISA" rev 0x00 > > rtc0 at ebus0 addr 70-73: m5823 > > pciide0 at pci3 dev 31 function 0 "Acer Labs M5229 IDE" rev 0xc8: DMA, > > channel 0 configured to native-PCI, channel 1 configured to native-PCI > > pciide0: using ivec 0x784 for native-PCI interrupt > > atapiscsi0 at pciide0 channel 0 drive 0 > > scsibus1 at atapiscsi0: 2 targets > > cd0 at scsibus1 targ 0 lun 0: <TEAC, DV-28SL, 1.0A> removable > > cd0(pciide0:0:0): using PIO mode 4, Ultra-DMA mode 2 > > pciide0: channel 1 disabled (no drives) > > usb1 at ohci0: USB revision 1.0 > > uhub1 at usb1 configuration 1 interface 0 "Acer Labs OHCI root hub" rev > > 1.00/1.00 addr 1 > > usb2 at ohci1: USB revision 1.0 > > uhub2 at usb2 configuration 1 interface 0 "Acer Labs OHCI root hub" rev > > 1.00/1.00 addr 1 > > ppb3 at pci1 dev 2 function 0 "PLX PEX 8532" rev 0xbc: msi > > pci4 at ppb3 bus 6 > > ppb4 at pci1 dev 8 function 0 "PLX PEX 8532" rev 0xbc: msi > > pci5 at ppb4 bus 7 > > ix0 at pci5 dev 0 function 0 "Intel 82599" rev 0x01, msix, 2 queues, > > address 90:e2:ba:d4:41:98 > > ix1 at pci5 dev 0 function 1 "Intel 82599" rev 0x01, msix, 2 queues, > > address 90:e2:ba:d4:41:99 > > ppb5 at pci1 dev 9 function 0 "PLX PEX 8532" rev 0xbc > > pci6 at ppb5 bus 8 > > ppb6 at pci6 dev 0 function 0 "ServerWorks PCIE-PCIX" rev 0xb5 > > pci7 at ppb6 bus 9 > > bge0 at pci7 dev 4 function 0 "Broadcom BCM5714" rev 0xa3, BCM5715 A3 > > (0x9003): ivec 0x795, address 00:14:4f:c3:df:7a > > brgphy0 at bge0 phy 1: BCM5714, rev. 0 > > bge1 at pci7 dev 4 function 1 "Broadcom BCM5714" rev 0xa3, BCM5715 A3 > > (0x9003): ivec 0x796, address 00:14:4f:c3:df:7b > > brgphy1 at bge1 phy 1: BCM5714, rev. 0 > > ppb7 at pci7 dev 8 function 0 "ServerWorks HT-1000 PCIX" rev 0xb4 > > pci8 at ppb7 bus 10 > > ppb8 at pci1 dev 10 function 0 "PLX PEX 8532" 0xbc > > pci9 at ppb8 bus 11 > > ppb9 at pci9 dev 0 function 0 "ServerWorks PCIE-PCIX" rev 0xb5 > > pci10 at ppb9 bus 12 > > bge2 at pci10 dev 4 function 0 "Broadcom BCM5714" rev 0xa3, BCM5715 A3 > > (0x9003): ivec 0x796, address 00:14:4f:c3:df:7c > > brgphy2 at bge2 phy 1: BCM5714, rev. 0 > > bge3 at pci10 dev 4 function 1 "Broadcom BCM5714" rev 0xa3, BCM5715 A3 > > (0x9003): ivec 0x797, address 00:14:4f:c3:df:7d > > brgphy3 at bge3 phy 1: BCM5714, rev. 0 > > ppb10 at pci10 dev 8 function 0 "ServerWorks HT-1000 PCIX" rev 0xb4 > > pci11 at ppb10 bus 13 > > mpi0 at pci11 dev 1 function 0 "Symbios Logic SAS1064" rev 0x02: msi > > mpi0: SAS1064, firmware 1.8.4.0 > > scsibus2 at mpi0: 63 targets > > sym0 at scsibus2 targ 0 lun 0: <FUJITSU, MAY2073RC, 0103> > > naa.500000e019dd9c70 > > sd0 at scsibus0 targ 0 lun 0: <FUJITSU, MAY2073RC, 0103> > > naa.500000e019dd9c70 > > sd0: 70136MB, 512 bytes/sector, 143638992 sectors > > sym1 at scsibus2 targ 1 lun 0: <SEAGATE, ST2000NX0273, E005> > > naa.5000c500f6411963 > > sd1 at scsibus0 targ 1 lun 0: <SEAGATE, ST2000NX0273, E005> > > naa.5000c500f6411963 > > sd1: 1907729MB, 512 bytes/sector, 3907029168 sectors > > pyro1 at mainbus0: "Fire", rev 3, ign 7c0, bus B 2 to 255 > > pyro1: dvma map c0000000-ffffffff > > pci12 at pyro1 > > ebus1 at mainbus0 > > "flashprom" at ebus1 addr 0-1fffff not configured > > com0 at ebus1 addr 80-87 ivec 0x7c8: ns16550a, 16 byte fifo > > com0: console > > com1 at ebus1 addr 40-47 ivec 0x7c9: ns16550a, 16 byte fifo > > "rmc-comm" at ebus1 addr 0-7 ivec 0x7ca not configured > > "gpio" at ebus1 addr c0-c0 not configured > > led0 at ebus1 addr 0-80: rev 0x5a > > power0 at ebus1 addr 40-c1 ivec 0x7c3 > > "i2c" at mainbus0 not configured > > vscsi0 at root > > scsibus3 at vscsi0: 256 targets > > softraid0 at root > > scsibus4 at softraid0: 256 targets > > bootpath: /pci@1e,600000/pci@0,0/pci@a,0/pci@0,0/pci@8,0/scsi@1,0/disk@0,0 > > root on sd0a (15e09306f0835f11.a) swap on sd0b dump on sd0b > > Automatic boot in progress: starting file system checks. > > /dev/sd0a (15e09306f0835f11.a): file system is clean; not checking > > /dev/sd0e (15e09306f0835f11.e): file system is clean; not checking > > /dev/sd0f (15e09306f0835f11.f): file system is clean; not checking > > /dev/sd0g (15e09306f0835f11.g): file system is clean; not checking > > /dev/sd0h (15e09306f0835f11.h): file system is clean; not checking > > /dev/sd0i (15e09306f0835f11.i): file system is clean; not checking > > /dev/sd0j (15e09306f0835f11.j): file system is clean; not checking > > /dev/sd0k (15e09306f0835f11.k): file system is clean; not checking > > > > SC Alert: SC Request to send Break to host. > > sc> break -c > > Are you sure you want to send a break to the system [y/n]? y > > Enter #. to return to ALOM. > > > > SC Alert: SC Request to send Break to host. > > > > bluhm > > > >
