Hello OpenBSD Bugs Team, OpenBSD 5.0 (sparc64) will panic during boot.
System: Sun Fire v245 OpenBootPROM: update from 4.22.x to 4.30.4 (2009) A.L.O.M update from 1.6.1 to 1.6.10 {1} ok printenv Variable Name Value Default Value asr-policy normal normal keyboard-layout German test-args diag-passes 1 1 local-mac-address? true true fcode-debug? false false scsi-initiator-id 7 7 oem-logo No default oem-logo? false false oem-banner System: web23sc.domain.tld No default oem-banner? false false ansi-terminal? true true screen-#columns 80 80 screen-#rows 34 34 ttyb-rts-dtr-off false false ttyb-ignore-cd true true ttya-rts-dtr-off false false ttya-ignore-cd true true ttyb-mode 9600,8,n,1,- 9600,8,n,1,- ttya-mode 9600,8,n,1,- 9600,8,n,1,- output-device ttya screen input-device ttya keyboard auto-boot-on-error? true true error-reset-recovery sync sync load-base 16384 16384 auto-boot? true true network-boot-arguments boot-command boot boot diag-file diag-device net net boot-file boot-device disk net cdrom disk net use-nvramrc? false false nvramrc ." ChassisSerialNumber 0 ... security-mode none No default security-password No default security-#badlogins 0 No default verbosity normal normal diag-trigger error-reset power-on-res ... error-reset power-on-res ... service-mode? false false diag-script normal normal diag-level max max diag-switch? false false {1} ok boot cdrom SC Alert: Host System has Reset Probing system devices Probing memory Probing I/O buses Sun Fire V245, No Keyboard Copyright 2009 Sun Microsystems, Inc. All rights reserved. OpenBoot 4.30.4, 8192 MB memory installed, Serial #00000000. Ethernet address aa:bb:cc:dd:ee:ff, Host ID: 00000000. Rebooting with command: boot cdrom Boot device: /pci@1e,600000/pci@0/pci@1/pci@0/ide@1f/cdrom@0,0:f File and args: OpenBSD IEEE 1275 Bootblock 1.3 ..>> OpenBSD BOOT 1.4 Trying bsd... Booting /pci@1e,600000/pci@0/pci@1/pci@0/ide@1f/cdrom@0,0:f/bsd 3834776@0x1000000+7272@0x13a8398+3252416@0x1800000+941888@0x1b1a0c0 symbols @ 0xfefc2240 65 start=0x1000000 console is /ebus@1f,464000/serial@2,80 Copyright (c) 1982, 1986, 1989, 1991, 1993 The Regents of the University of California. All rights reserved. Copyright (c) 1995-2011 OpenBSD. All rights reserved. http://www.OpenBSD.org OpenBSD 5.0 (RAMDISK) #31: Wed Aug 17 10:53:59 MDT 2011 dera...@sparc64.openbsd.org:/usr/src/sys/arch/sparc64/compile/RAMDISK real mem = 8589934592 (8192MB) avail mem = 8447467520 (8056MB) mainbus0 at root: Sun Fire V245 "SUNW,UltraSPARC-IIIi" at mainbus0 not configured cpu0 at mainbus0: SUNW,UltraSPARC-IIIi (rev 3.4) @ 1504 MHz cpu0: physical 32K instruction (32 b/l), 64K data (32 b/l), 1024K external (64 b/l) "memory-controller" at mainbus0 not configured "memory-controller" at mainbus0 not configured pyro0 at mainbus0: "Fire", rev 3, ign 780, bus A 2 to 13 pyro0: dvma map c0000000-ffffffff pci0 at pyro0 ppb0 at pci0 dev 0 function 0 "PLX PEX 8532" rev 0xba pci1 at ppb0 bus 3 ppb1 at pci1 dev 1 function 0 "PLX PEX 8532" rev 0xba pci2 at ppb1 bus 4 ppb2 at pci2 dev 0 function 0 "Acer Labs M5249 PCI-PCI" rev 0x00 pci3 at ppb2 bus 5 ohci0 at pci3 dev 28 function 0 "Acer Labs M5237 USB" rev 0x03: ivec 0x780, version 1.0, legacy support ohci1 at pci3 dev 28 function 1 "Acer Labs M5237 USB" rev 0x03: ivec 0x780, version 1.0, legacy support ehci0 at pci3 dev 28 function 3 "Acer Labs M5239 USB2" rev 0x01: ivec 0x781 usb0 at ehci0: USB revision 2.0 uhub0 at usb0 "Acer Labs EHCI root hub" rev 2.00/1.00 addr 1 ebus0 at pci3 dev 30 function 0 "Acer Labs M1575 ISA" rev 0x00 rtc0 at ebus0 addr 70-73: m5823 pciide0 at pci3 dev 31 function 0 "Acer Labs M5229 UDMA IDE" rev 0xc8: DMA, channel 0 configured to native-PCI, channel 1 configured to native-PCI pciide0: using ivec 0x784 for native-PCI interrupt atapiscsi0 at pciide0 channel 0 drive 0 scsibus0 at atapiscsi0: 2 targets cd0 at scsibus0 targ 0 lun 0: <MATSHITA, DVD-RAM UJ-845S, D100> ATAPI 5/cdrom removable cd0(pciide0:0:0): using PIO mode 4, Ultra-DMA mode 2 pciide0: channel 1 disabled (no drives) usb1 at ohci0: USB revision 1.0 uhub1 at usb1 "Acer Labs OHCI root hub" rev 1.00/1.00 addr 1 usb2 at ohci1: USB revision 1.0 uhub2 at usb2 "Acer Labs OHCI root hub" rev 1.00/1.00 addr 1 ppb3 at pci1 dev 2 function 0 "PLX PEX 8532" rev 0xba: msi pci4 at ppb3 bus 6 ppb4 at pci1 dev 8 function 0 "PLX PEX 8532" rev 0xba: msi pci5 at ppb4 bus 7 ppb5 at pci1 dev 9 function 0 "PLX PEX 8532" rev 0xba pci6 at ppb5 bus 8 ppb6 at pci6 dev 0 function 0 "ServerWorks PCIE-PCIX" rev 0xb5 pci7 at ppb6 bus 9 bge0 at pci7 dev 4 function 0 "Broadcom BCM5714" rev 0xa3, BCM5715 A3 (0x9003): ivec 0x795, address aa:bb:cc:dd:ee:ff brgphy0 at bge0 phy 1: BCM5714 10/100/1000baseT/SX PHY, rev. 0 bge1 at pci7 dev 4 function 1 "Broadcom BCM5714" rev 0xa3, BCM5715 A3 (0x9003): ivec 0x796, address aa:bb:cc:dd:ee:ff brgphy1 at bge1 phy 1: BCM5714 10/100/1000baseT/SX PHY, rev. 0 ppb7 at pci7 dev 8 function 0 "ServerWorks HT-1000 PCIX" rev 0xb4 pci8 at ppb7 bus 10 ppb8 at pci1 dev 10 function 0 "PLX PEX 8532" rev 0xba pci9 at ppb8 bus 11 ppb9 at pci9 dev 0 function 0 "ServerWorks PCIE-PCIX" rev 0xb5 pci10 at ppb9 bus 12 bge2 at pci10 dev 4 function 0 "Broadcom BCM5714" rev 0xa3, BCM5715 A3 (0x9003): ivec 0x796, address aa:bb:cc:dd:ee:ff brgphy2 at bge2 phy 1: BCM5714 10/100/1000baseT/SX PHY, rev. 0 bge3 at pci10 dev 4 function 1 "Broadcom BCM5714" rev 0xa3, BCM5715 A3 (0x9003): ivec 0x797, address aa:bb:cc:dd:ee:ff brgphy3 at bge3 phy 1: BCM5714 10/100/1000baseT/SX PHY, rev. 0 ppb10 at pci10 dev 8 function 0 "ServerWorks HT-1000 PCIX" rev 0xb4 pci11 at ppb10 bus 13 mpi0 at pci11 dev 1 function 0 "Symbios Logic SAS1064" rev 0x02: msi scsibus1 at mpi0: 63 targets sd0 at scsibus1 targ 0 lun 0: <SEAGATE, ST973401LSUN72G, 0556> SCSI3 0/direct fixed sd0: 70007MB, 512 bytes/sector, 143374738 sectors sd1 at scsibus1 targ 1 lun 0: <SEAGATE, ST973401LSUN72G, 0556> SCSI3 0/direct fixed sd1: 70007MB, 512 bytes/sector, 143374738 sectors sd2 at scsibus1 targ 2 lun 0: <SEAGATE, ST973401LSUN72G, 0556> SCSI3 0/direct fixed sd2: 70007MB, 512 bytes/sector, 143374738 sectors sd3 at scsibus1 targ 3 lun 0: <SEAGATE, ST973401LSUN72G, 0556> SCSI3 0/direct fixed sd3: 70007MB, 512 bytes/sector, 143374738 sectors pyro1 at mainbus0: "Fire", rev 3, ign 7c0, bus B 2 to 4 pyro1: dvma map c0000000-ffffffff pci12 at pyro1 ppb11 at pci12 dev 0 function 0 "Intel 41210 PCIE-PCIX" rev 0x09 pci13 at ppb11 bus 3 ppb12 at pci12 dev 0 function 2 "Intel 41210 PCIE-PCIX" rev 0x09 pci14 at ppb12 bus 4 ebus1 at mainbus0: ign 7c0 "flashprom" at ebus1 addr 0-1fffff not configured com0 at ebus1 addr 80-87 ivec 0x8: ns16550a, 16 byte fifo com0: console com1 at ebus1 addr 40-47 ivec 0x9: ns16550a, 16 byte fifo "rmc-comm" at ebus1 addr 0-7 ivec 0xa not configured "gpio" at ebus1 addr c0-c0 not configured "env-monitor" at ebus1 addr 0-80 not configured "power" at ebus1 addr 40-c1 ivec 0x3 not configured "i2c" at mainbus0 not configured data error type 32 sfsr=80800c sfva=400793e4041 afsr=10080005000000 afva=7f8100010c0 tf=0xe0017aa8 panic: data fault: pc=101b490 addr=400793e4041 sfsr=80800c<TM,PRIV,W> halted Program terminated {1} ok it seems well to describe http://marc.info/?l=openbsd-sparc&m=133016022323288&w=2 On Tue, Mar 13, 2012 at 08:15:33PM +0100, Mark Kettenis wrote: > > Date: Mon, 12 Mar 2012 22:39:31 +0100 (CET) > > From: Mark Kettenis <mark.kette...@xs4all.nl> > > > > > Date: Sat, 25 Feb 2012 09:55:57 +0100 > > > From: Paul de Weerd <we...@weirdnet.nl> > > > > > > I recently got a v215 from a friend and have installed OpenBSD on it. > > > Occassionally, it will panic during boot. This happened during > > > install and I see it now during regular reboots. I can pretty much > > > reproduce this at will with a couple of reboots. > > > > > > Could this be faulty hardware ? To reset the ALOM password, I > > > installed Solaris 10 (took an eternity) and that never showed any > > > problems, but I guess that doesn't prove much. > > > > > > First the panic and then full dmesg (from a succesful boot) are > > > included below. > > > > I doubt this is faulty hardware. I've seen similar reports for a > > v445, which has the same crappy Acer Labs pciide(4) controller. I > > fear that the wdc.c changes made in April 2011 introduced this > > behaviour. > > So thanks to Paul giving me access to the machine in question I've > been able to figure out what's going wrong here. > > The data error always happens when running wdcintr() for channel 1. > Now on these machines we have the following line in dmesg > > ... > pciide0: channel 1 disabled (no drives) > ... > > indicating that there is no actual hardware connected to channel 1. > As a result of this we skip further initialization of the channel. > Therefore it shouldn't be a terrible surprise that the chip doesn't > like it when we try to read registers associated with this channel. > On crappy PC hardware this won't be noticed, but on sparc64 this > results in an unrecoverable fault. > > The solution is easy. We shouldn't be calling wdcintr() for a channel > that isn't properly initialized. > > ok? > > > Index: pciide.c > =================================================================== > RCS file: /cvs/src/sys/dev/pci/pciide.c,v > retrieving revision 1.337 > diff -u -p -r1.337 pciide.c > --- pciide.c 15 Jan 2012 15:16:23 -0000 1.337 > +++ pciide.c 13 Mar 2012 18:54:50 -0000 > @@ -1838,6 +1838,9 @@ pciide_pci_intr(void *arg) > if (cp->compat) > continue; > > + if (cp->hw_ok == 0) > + continue; > + > if (pciide_intr_flag(cp) == 0) > continue; > http://www.openbsd.org/cgi-bin/cvsweb/src/sys/dev/pci/pciide.c would be the patch in openbsd 5.1 current? Revision 1.338: download - view: text, markup, annotated - select for diffs Tue Mar 13 21:32:42 2012 UTC (4 weeks, 4 days ago) by kettenis Branches: MAIN CVS tags: HEAD Diff to: previous 1.337: preferred, coloured Changes since revision 1.337: +4 -1 lines Don't call wdcintr() for channels that aren't properly initialized or don't have any drives on them. Fixes a panic seen on sparc64 machines with that pesky Acer Labs chipset. ok deraadt@, krw@, henning@ System OpenBootPROM {1} ok setenv boot-command boot -c boot-command = boot -c {1} ok {1} ok reset-all SC Alert: Host System has Reset Probing system devices Probing memory Probing I/O buses Sun Fire V245, No Keyboard Copyright 2009 Sun Microsystems, Inc. All rights reserved. OpenBoot 4.30.4, 8192 MB memory installed, Serial #00000000. Ethernet address aa:bb:cc:dd:ee:ff, Host ID: 00000000. Boot device: /pci@1e,600000/pci@0/pci@1/pci@0/ide@1f/cdrom@0,0:f File and args: -c OpenBSD IEEE 1275 Bootblock 1.3 ..>> OpenBSD BOOT 1.4 Trying bsd... Booting /pci@1e,600000/pci@0/pci@1/pci@0/ide@1f/cdrom@0,0:f/bsd 3834776@0x1000000+7272@0x13a8398+3252416@0x1800000+941888@0x1b1a0c0 symbols @ 0xfefc2240 65 start=0x1000000 console is /ebus@1f,464000/serial@2,80 Copyright (c) 1982, 1986, 1989, 1991, 1993 The Regents of the University of California. All rights reserved. Copyright (c) 1995-2011 OpenBSD. All rights reserved. http://www.OpenBSD.org OpenBSD 5.0 (RAMDISK) #31: Wed Aug 17 10:53:59 MDT 2011 dera...@sparc64.openbsd.org:/usr/src/sys/arch/sparc64/compile/RAMDISK real mem = 8589934592 (8192MB) avail mem = 8447467520 (8056MB) User Kernel Config UKC> disable pciide 89 pciide* disabled 90 pciide* disabled UKC> quit Continuing... mainbus0 at root: Sun Fire V245 "SUNW,UltraSPARC-IIIi" at mainbus0 not configured cpu0 at mainbus0: SUNW,UltraSPARC-IIIi (rev 3.4) @ 1504 MHz cpu0: physical 32K instruction (32 b/l), 64K data (32 b/l), 1024K external (64 b/l) "memory-controller" at mainbus0 not configured "memory-controller" at mainbus0 not configured pyro0 at mainbus0: "Fire", rev 3, ign 780, bus A 2 to 13 pyro0: dvma map c0000000-ffffffff pci0 at pyro0 ppb0 at pci0 dev 0 function 0 "PLX PEX 8532" rev 0xba pci1 at ppb0 bus 3 ppb1 at pci1 dev 1 function 0 "PLX PEX 8532" rev 0xba pci2 at ppb1 bus 4 ppb2 at pci2 dev 0 function 0 "Acer Labs M5249 PCI-PCI" rev 0x00 pci3 at ppb2 bus 5 ohci0 at pci3 dev 28 function 0 "Acer Labs M5237 USB" rev 0x03: ivec 0x780, version 1.0, legacy support ohci1 at pci3 dev 28 function 1 "Acer Labs M5237 USB" rev 0x03: ivec 0x780, version 1.0, legacy support ehci0 at pci3 dev 28 function 3 "Acer Labs M5239 USB2" rev 0x01: ivec 0x781 usb0 at ehci0: USB revision 2.0 uhub0 at usb0 "Acer Labs EHCI root hub" rev 2.00/1.00 addr 1 ebus0 at pci3 dev 30 function 0 "Acer Labs M1575 ISA" rev 0x00 rtc0 at ebus0 addr 70-73: m5823 "Acer Labs M5229 UDMA IDE" rev 0xc8 at pci3 dev 31 function 0 not configured usb1 at ohci0: USB revision 1.0 uhub1 at usb1 "Acer Labs OHCI root hub" rev 1.00/1.00 addr 1 usb2 at ohci1: USB revision 1.0 uhub2 at usb2 "Acer Labs OHCI root hub" rev 1.00/1.00 addr 1 ppb3 at pci1 dev 2 function 0 "PLX PEX 8532" rev 0xba: msi pci4 at ppb3 bus 6 ppb4 at pci1 dev 8 function 0 "PLX PEX 8532" rev 0xba: msi pci5 at ppb4 bus 7 ppb5 at pci1 dev 9 function 0 "PLX PEX 8532" rev 0xba pci6 at ppb5 bus 8 ppb6 at pci6 dev 0 function 0 "ServerWorks PCIE-PCIX" rev 0xb5 pci7 at ppb6 bus 9 bge0 at pci7 dev 4 function 0 "Broadcom BCM5714" rev 0xa3, BCM5715 A3 (0x9003): ivec 0x795, address aa:bb:cc:dd:ee:ff brgphy0 at bge0 phy 1: BCM5714 10/100/1000baseT/SX PHY, rev. 0 bge1 at pci7 dev 4 function 1 "Broadcom BCM5714" rev 0xa3, BCM5715 A3 (0x9003): ivec 0x796, address aa:bb:cc:dd:ee:ff brgphy1 at bge1 phy 1: BCM5714 10/100/1000baseT/SX PHY, rev. 0 ppb7 at pci7 dev 8 function 0 "ServerWorks HT-1000 PCIX" rev 0xb4 pci8 at ppb7 bus 10 ppb8 at pci1 dev 10 function 0 "PLX PEX 8532" rev 0xba pci9 at ppb8 bus 11 ppb9 at pci9 dev 0 function 0 "ServerWorks PCIE-PCIX" rev 0xb5 pci10 at ppb9 bus 12 bge2 at pci10 dev 4 function 0 "Broadcom BCM5714" rev 0xa3, BCM5715 A3 (0x9003): ivec 0x796, address aa:bb:cc:dd:ee:ff brgphy2 at bge2 phy 1: BCM5714 10/100/1000baseT/SX PHY, rev. 0 bge3 at pci10 dev 4 function 1 "Broadcom BCM5714" rev 0xa3, BCM5715 A3 (0x9003): ivec 0x797, address aa:bb:cc:dd:ee:ff brgphy3 at bge3 phy 1: BCM5714 10/100/1000baseT/SX PHY, rev. 0 ppb10 at pci10 dev 8 function 0 "ServerWorks HT-1000 PCIX" rev 0xb4 pci11 at ppb10 bus 13 mpi0 at pci11 dev 1 function 0 "Symbios Logic SAS1064" rev 0x02: msi scsibus0 at mpi0: 63 targets sd0 at scsibus0 targ 0 lun 0: <SEAGATE, ST973401LSUN72G, 0556> SCSI3 0/direct fixed sd0: 70007MB, 512 bytes/sector, 143374738 sectors sd1 at scsibus0 targ 1 lun 0: <SEAGATE, ST973401LSUN72G, 0556> SCSI3 0/direct fixed sd1: 70007MB, 512 bytes/sector, 143374738 sectors sd2 at scsibus0 targ 2 lun 0: <SEAGATE, ST973401LSUN72G, 0556> SCSI3 0/direct fixed sd2: 70007MB, 512 bytes/sector, 143374738 sectors sd3 at scsibus0 targ 3 lun 0: <SEAGATE, ST973401LSUN72G, 0556> SCSI3 0/direct fixed sd3: 70007MB, 512 bytes/sector, 143374738 sectors pyro1 at mainbus0: "Fire", rev 3, ign 7c0, bus B 2 to 4 pyro1: dvma map c0000000-ffffffff pci12 at pyro1 ppb11 at pci12 dev 0 function 0 "Intel 41210 PCIE-PCIX" rev 0x09 pci13 at ppb11 bus 3 ppb12 at pci12 dev 0 function 2 "Intel 41210 PCIE-PCIX" rev 0x09 pci14 at ppb12 bus 4 ebus1 at mainbus0: ign 7c0 "flashprom" at ebus1 addr 0-1fffff not configured com0 at ebus1 addr 80-87 ivec 0x8: ns16550a, 16 byte fifo com0: console com1 at ebus1 addr 40-47 ivec 0x9: ns16550a, 16 byte fifo "rmc-comm" at ebus1 addr 0-7 ivec 0xa not configured "gpio" at ebus1 addr c0-c0 not configured "env-monitor" at ebus1 addr 0-80 not configured "power" at ebus1 addr 40-c1 ivec 0x3 not configured "i2c" at mainbus0 not configured uhub3 at uhub2 port 1 "vendor 0x03eb Standard USB Hub" rev 1.10/3.00 addr 2 softraid0 at root scsibus1 at softraid0: 256 targets bootpath: /pci@1e,600000/pci@0,0/pci@1,0/pci@0,0/ide@1f,0/cdrom@0,0:f root on rd0a swap on rd0b dump on rd0b erase ^?, werase ^W, kill ^U, intr ^C, status ^T Welcome to the OpenBSD/sparc64 5.0 installation program. (I)nstall, (U)pgrade or (S)hell? best regards [demime 1.01d removed an attachment of type application/pkcs7-signature which had a name of smime.p7s]