Public bug reported:
boot failures with 5.4.0-7-generic on OPAL power box:
I was running ADT tests and the machine hung/rebooted. I was unable to
log in. After I rebooted the machine with the ipmi tool the machine
crashed with the following kernel output:
[ 51.081421774,5] SkiBoot skiboot-5.4.8-5787ad3 starting...
[ 51.081426316,5] initial console log level: memory 7, driver 5
[ 51.081429224,6] CPU: P8 generation processor(max 8 threads/core)
[ 51.081432044,7] CPU: Boot CPU PIR is 0x0470 PVR is 0x004d0200
[ 51.081435009,7] CPU: Initial max PIR set to 0x1fff
[ 51.082535316,5] OPAL table: 0x300bfc40 .. 0x300c0110, branch table:
0x30002000
[ 51.082543101,5] FDT: Parsing fdt @0xff00000
[ 51.087692296,5] XSCOM: chip 0x0 at 0x3fc0000000000 [P8 DD2.0]
[ 51.087702232,5] XSCOM: chip 0x8 at 0x3fc4000000000 [P8 DD2.0]
[ 51.087709775,6] XSTOP: XSCOM addr = 0x2010c82, FIR bit = 31
[ 51.087713185,6] MFSI 0:0: Initialized
[ 51.087715462,6] MFSI 0:2: Initialized
[ 51.087717669,6] MFSI 0:1: Initialized
[ 51.087720203,6] MFSI 8:0: Initialized
[ 51.087722365,6] MFSI 8:2: Initialized
[ 51.087724518,6] MFSI 8:1: Initialized
[ 51.088044434,5] LPC: LPC[000]: Initialized, access via XSCOM @0xb0020
[ 51.088162270,5] LPC: LPC: Default bus on chip 0x0
[ 51.088303476,6] MEM: parsing reserved memory from node
/ibm,hostboot/reserved-memory
[ 51.088313438,7] HOMER: Init chip 0
[ 51.088316406,7] PBA BAR0 : 0x00000007fd800000
[ 51.088319108,7] PBA MASK0: 0x0000000000300000
[ 51.088321761,7] HOMER Image at 0x7fd800000 size 4MB
[ 51.088325579,7] PBA BAR2 : 0x40000007fda00000
[ 51.088328358,7] PBA MASK2: 0x0000000000000000
[ 51.088330928,7] SLW Image at 0x7fda00000 size 1MB
[ 51.088334409,7] PBA BAR3 : 0x00000007ff800000
[ 51.088337060,7] PBA MASK3: 0x0000000000700000
[ 51.088339732,7] OCC Common Area at 0x7ff800000 size 8MB
[ 51.088342594,7] HOMER: Init chip 8
[ 51.088345257,7] PBA BAR0 : 0x00000007fdc00000
[ 51.088347872,7] PBA MASK0: 0x0000000000300000
[ 51.088350519,7] HOMER Image at 0x7fdc00000 size 4MB
[ 51.088354173,7] PBA BAR2 : 0x40000007fde00000
[ 51.088356860,7] PBA MASK2: 0x0000000000000000
[ 51.088359365,7] SLW Image at 0x7fde00000 size 1MB
[ 51.088362788,7] PBA BAR3 : 0x00000007ff800000
[ 51.088365419,7] PBA MASK3: 0x0000000000700000
[ 51.088367946,7] OCC Common Area at 0x7ff800000 size 8MB
[ 51.088387526,7] CPU idle state device tree init
[ 51.088391002,4] SLW: HB-provided idle states property found
[ 51.088567406,7] AST: PNOR LPC offset: 0x0c000000
[ 51.088650577,5] PLAT: Using virtual UART
[ 51.088977615,7] UART: Using LPC IRQ 4
[ 51.203625382,5] PLAT: Detected Firestone platform
[ 51.219765305,5] PLAT: Detected BMC platform AMI
[ 51.239417466,5] CENTAUR: Found centaur for chip 0x0 channel 4
[ 51.239524825,5] CENTAUR: FSI host: 0x0 cMFSI0 port 7
[ 51.241283553,5] CENTAUR: Found centaur for chip 0x0 channel 5
[ 51.241759761,5] CENTAUR: FSI host: 0x0 cMFSI0 port 6
[ 51.242362656,5] PSI[0x000]: Found PSI bridge [active=0]
[ 51.242690427,5] PSI[0x008]: Found PSI bridge [active=0]
[ 51.245117930,5] CPU: All 128 processors called in...
[ 2.472212005,5] FLASH: Found system flash: Macronix MXxxL51235F id:0
[ 2.472354468,5] BT: Interface initialized, IO 0x00e4
[ 3.421491873,5] NVRAM: Size is 576 KB
[ 4.095942958,5] STB: secure mode off
[ 4.096004331,5] STB: trusted mode off
[ 4.096965839,5] CAPI: Preloading ucode 200ea
[ 4.097023615,5] FLASH: Queueing preload of 2/200ea
[ 4.097202595,5] FLASH: Queueing preload of 0/0
[ 4.097723471,5] FLASH: Queueing preload of 1/0
[ 4.097739635,7] FFS: Partition map size: 0x1000
[ 4.101069429,7] FLASH: CAPP partition has ECC
[ 4.117588444,5] STB: sb_verify skipped resource 2, secure_mode=0
[ 4.117607170,5] Chip 0 Found PBCQ0 at /xscom@3fc0000000000/pbcq@2012000
[ 4.117610665,7] PHB3[0:0]: X[PE]=0x02012000 X[PCI]=0x09012000
X[SPCI]=0x09013c00
[ 4.117690635,7] PHB3[0:0] REGS = 0x0003fffe40000000 [4k]
[ 4.124862367,7] PHB3[0:0] PCIBAR = 0x0003fffe40000000
[ 4.144741905,7] PHB3[0:0] MMIO0 = 0x0000200000000000 [0x0000010000000000]
[ 4.147663099,7] PHB3[0:0] MMIO1 = 0x00003fe000000000 [0x0000000080000000]
[ 4.151015049,7] PHB3[0:0] BAREN = 0xf800000000000000
[ 4.151018735,7] PHB3[0:0] NEWBAREN = 0xf800000000000000
[ 4.152491015,7] PHB3[0:0] IRSNC = 0x0100000000000000
[ 4.177266431,5] STB: tb_measure skipped resource 2, trusted_mode=0
[ 4.177266472,7] PHB3[0:0] IRSNM = 0xff00000000000000
[ 4.177269336,7] PHB3[0:0] LSI = 0xff00000000000000
[ 4.177278668,5] Chip 0 Found PBCQ1 at /xscom@3fc0000000000/pbcq@2012400
[ 4.177282022,7] PHB3[0:1]: X[PE]=0x02012400 X[PCI]=0x09012400
X[SPCI]=0x09013c40
[ 4.178715842,7] PHB3[0:1] REGS = 0x0003fffe40100000 [4k]
[ 4.183043807,7] PHB3[0:1] PCIBAR = 0x0003fffe40100000
[ 4.190163295,5] Chip 8 Found PBCQ0 at /xscom@3fc4000000000/pbcq@2012000
[ 4.208231423,5] Chip 8 Found PBCQ1 at /xscom@3fc4000000000/pbcq@2012400
[ 7.170627939,5] Chip 8 Found PBCQ2 at /xscom@3fc4000000000/pbcq@2012800
[ 8.269331117,3] PHB#0000: Base location code not found !
[ 13.422844377,5] STB: sb_verify skipped resource 0, secure_mode=0
[ 13.422853191,7] BT: seq 0x05 netfn 0x06 cmd 0x06: Message sent to host
[ 13.423112031,5] STB: tb_measure skipped resource 0, trusted_mode=0
[ 13.425274455,3] FLASH: No ROOTFS partition
[ 13.435729110,3] PHB#0001: Base location code not found !
[ 13.497563875,3] PHB#0020: Base location code not found !
[ 14.047321740,3] PHB#0021: Base location code not found !
[ 14.109002459,3] PHB#0022: Base location code not found !
[ 14.170907665,5] PCI: Resetting PHBs...
[ 15.273761743,5] PCI: Probing slots...
[ 16.432898479,5] PHB#0000:00:00.0 [ROOT] 1014 03dc R:00 C:060400 B:01..ff
SLOT=Slot5
[ 16.434393023,5] PHB#0001:00:00.0 [ROOT] 1014 03dc R:00 C:060400 B:01..ff
SLOT=Slot4
[ 16.434910029,5] PHB#0020:00:00.0 [ROOT] 1014 03dc R:00 C:060400 B:01..ff
SLOT=Slot2
[ 16.435845882,5] PHB#0021:00:00.0 [ROOT] 1014 03dc R:00 C:060400 B:01..15
SLOT=Backplane PLX
[ 16.438571433,5] PHB#0021:01:00.0 [SWUP] 10b5 8725 R:ca C:060400 B:02..15
LOC_CODE=Backplane PLX
[ 16.440061205,5] PHB#0021:02:01.0 [SWDN] 10b5 8725 R:ca C:060400 B:03..07
SLOT=Slot3
[ 16.445628911,5] PHB#0021:02:08.0 [SWDN] 10b5 8725 R:ca C:060400 B:08..0c
[ 16.447124810,5] PHB#0021:02:09.0 [SWDN] 10b5 8725 R:ca C:060400 B:0d..0d
SLOT=Backplane USB
[ 16.449944597,5] PHB#0021:0d:00.0 [EP ] 104c 8241 R:02 C:0c0330 (
usb-xhci) LOC_CODE=Backplane USB
[ 16.451469746,5] PHB#0021:02:0a.0 [SWDN] 10b5 8725 R:ca C:060400 B:0e..0e
SLOT=Backplane SATA
[ 16.485306975,5] PHB#0021:0e:00.0 [LGCY] 1b4b 9235 R:11 C:010601 (
sata) LOC_CODE=Backplane SATA
[ 16.490275695,5] PHB#0021:02:0b.0 [SWDN] 10b5 8725 R:ca C:060400 B:0f..10
SLOT=Backplane BMC
[ 16.491741344,5] PHB#0021:0f:00.0 [ETOX] 1a03 1150 R:03 C:060400 B:10..10
LOC_CODE=Backplane BMC
[ 16.493424358,5] PHB#0021:10:00.0 [PCID] 1a03 2000 R:30 C:030000 (
vga) LOC_CODE=Backplane BMC
[ 16.496259662,5] PHB#0021:02:0c.0 [SWDN] 10b5 8725 R:ca C:060400 B:11..15
Petitboot (v1.4.4-e414dbe) 8335-GTA 0000000000000000
──────────────────────────────────────────────────────────────────────────────
Ubuntu, with Linux 5.4.0-7-generic (recovery mode)
Ubuntu, with Linux 5.4.0-7-generic
Ubuntu
[Network: enP4p1s0f0 / 98:be:94:01:1f:a4]
netboot enP4p1s0f0 (pxelinux.0)
[Network: enP4p1s0f2 / 98:be:94:01:1f:a6]
netboot enP4p1s0f2 (pxelinux.0)
[Network: enP4p1s0f1 / 98:be:94:01:1f:a5]
netboot enP4p1s0f1 (pxelinux.0)
[Network: enP4p1s0f3 / 98:be:94:01:1f:a7]
netboot enP4p1s0f3 (pxelinux.0)
System information
System configuration
System status log
Language
Rescan devices
Retrieve config from URL
*Exit to shell
──────────────────────────────────────────────────────────────────────────────
Enter=accept, e=edit, n=new, x=exit, l=language, g=log, h=help
The system is going down NOW!ig from tftp://10.245.71.3/ppc64el/pxelinux.cfg/01-
Sent SIGTERM to all processes
Sent SIGKILL to all processes
cpu 0x78: Vector: 300 (Data Access) at [c0000007ff3b3a00]
pc: c0000000004a5a0c: xhci_irq+0x44c/0x18a0
lr: c0000000004a5604: xhci_irq+0x44/0x18a0
sp: c0000007ff3b3c80
msr: 9000000000009033
dar: b0
dsisr: 40000000
current = 0xc000000001300280
paca = 0xc00000000fe96800 softe: 0 irq_happened: 0x01
pid = 0, comm = swapper/120
enter ? for help
[c0000007ff3b3c80] c0000000004a6bcc xhci_irq+0x160c/0x18a0 (unreliable)
[c0000007ff3b3e00] c000000000096bd8 handle_irq_event_percpu+0x58/0x170
[c0000007ff3b3eb0] c000000000096d5c handle_irq_event+0x6c/0x9c
[c0000007ff3b3ee0] c00000000009ac18 handle_fasteoi_irq+0xc8/0x184
[c0000007ff3b3f10] c0000000000961a0 generic_handle_irq+0x34/0x54
[c0000007ff3b3f30] c00000000000df28 __do_irq+0xb4/0xd0
[c0000007ff3b3f90] c000000000019d58 call_do_irq+0x14/0x24
[c00000000133ba80] c00000000000dfd4 do_IRQ+0x90/0xcc
[c00000000133bad0] c0000000000021a8 hardware_interrupt_common+0x128/0x180
--- Exception: 501 (Hardware Interrupt) at c00000000000d8f0
arch_local_irq_restore+0x70/0x80
[c00000000133bdc0] 0000000000000001 (unreliable)
[c00000000133bde0] c0000000004db3e0 cpuidle_enter_state+0x1c8/0x238
[c00000000133be30] c00000000008c39c cpu_startup_entry+0x250/0x2ec
[c00000000133bee0] c00000000000b4d8 rest_init+0x9c/0xb0
[c00000000133bf00] c0000000007b3bf8 start_kernel+0x510/0x518
[c00000000133bf90] c000000000008c60 start_here_common+0x20/0x440
78:mon>
Was able to reboot back into a previous kernel w/o any issues.
** Affects: linux (Ubuntu)
Importance: High
Status: New
** Changed in: linux (Ubuntu)
Importance: Undecided => High
--
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1855143
Title:
5.4.0-7 kernel crash on boot on power box
To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1855143/+subscriptions
--
ubuntu-bugs mailing list
[email protected]
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs