Re-installed the kernel, it's booting fine now. I wonder if I had some
kind of corruption from a previous test crash. Can't reproduce this now.
Marking it as Invalid.

** Changed in: linux (Ubuntu)
       Status: New => Invalid

** Changed in: linux (Ubuntu)
     Assignee: (unassigned) => Colin Ian King (colin-king)

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1855143

Title:
  5.4.0-7 kernel crash on boot on power box

Status in linux package in Ubuntu:
  Invalid

Bug description:
  boot failures with 5.4.0-7-generic on OPAL power box:

  I was running ADT tests and the machine hung/rebooted. I was unable to
  log in. After I rebooted the machine with the ipmi tool the machine
  crashed with the following kernel output:

  [   51.081421774,5] SkiBoot skiboot-5.4.8-5787ad3 starting...
  [   51.081426316,5] initial console log level: memory 7, driver 5
  [   51.081429224,6] CPU: P8 generation processor(max 8 threads/core)
  [   51.081432044,7] CPU: Boot CPU PIR is 0x0470 PVR is 0x004d0200
  [   51.081435009,7] CPU: Initial max PIR set to 0x1fff
  [   51.082535316,5] OPAL table: 0x300bfc40 .. 0x300c0110, branch table: 
0x30002000
  [   51.082543101,5] FDT: Parsing fdt @0xff00000
  [   51.087692296,5] XSCOM: chip 0x0 at 0x3fc0000000000 [P8 DD2.0]
  [   51.087702232,5] XSCOM: chip 0x8 at 0x3fc4000000000 [P8 DD2.0]
  [   51.087709775,6] XSTOP: XSCOM addr = 0x2010c82, FIR bit = 31
  [   51.087713185,6] MFSI 0:0: Initialized
  [   51.087715462,6] MFSI 0:2: Initialized
  [   51.087717669,6] MFSI 0:1: Initialized
  [   51.087720203,6] MFSI 8:0: Initialized
  [   51.087722365,6] MFSI 8:2: Initialized
  [   51.087724518,6] MFSI 8:1: Initialized
  [   51.088044434,5] LPC: LPC[000]: Initialized, access via XSCOM @0xb0020
  [   51.088162270,5] LPC: LPC: Default bus on chip 0x0
  [   51.088303476,6] MEM: parsing reserved memory from node 
/ibm,hostboot/reserved-memory
  [   51.088313438,7] HOMER: Init chip 0
  [   51.088316406,7]   PBA BAR0 : 0x00000007fd800000
  [   51.088319108,7]   PBA MASK0: 0x0000000000300000
  [   51.088321761,7]   HOMER Image at 0x7fd800000 size 4MB
  [   51.088325579,7]   PBA BAR2 : 0x40000007fda00000
  [   51.088328358,7]   PBA MASK2: 0x0000000000000000
  [   51.088330928,7]   SLW Image at 0x7fda00000 size 1MB
  [   51.088334409,7]   PBA BAR3 : 0x00000007ff800000
  [   51.088337060,7]   PBA MASK3: 0x0000000000700000
  [   51.088339732,7]   OCC Common Area at 0x7ff800000 size 8MB
  [   51.088342594,7] HOMER: Init chip 8
  [   51.088345257,7]   PBA BAR0 : 0x00000007fdc00000
  [   51.088347872,7]   PBA MASK0: 0x0000000000300000
  [   51.088350519,7]   HOMER Image at 0x7fdc00000 size 4MB
  [   51.088354173,7]   PBA BAR2 : 0x40000007fde00000
  [   51.088356860,7]   PBA MASK2: 0x0000000000000000
  [   51.088359365,7]   SLW Image at 0x7fde00000 size 1MB
  [   51.088362788,7]   PBA BAR3 : 0x00000007ff800000
  [   51.088365419,7]   PBA MASK3: 0x0000000000700000
  [   51.088367946,7]   OCC Common Area at 0x7ff800000 size 8MB
  [   51.088387526,7] CPU idle state device tree init
  [   51.088391002,4] SLW: HB-provided idle states property found
  [   51.088567406,7] AST: PNOR LPC offset: 0x0c000000
  [   51.088650577,5] PLAT: Using virtual UART
  [   51.088977615,7] UART: Using LPC IRQ 4
  [   51.203625382,5] PLAT: Detected Firestone platform
  [   51.219765305,5] PLAT: Detected BMC platform AMI
  [   51.239417466,5] CENTAUR: Found centaur for chip 0x0 channel 4
  [   51.239524825,5] CENTAUR:   FSI host: 0x0 cMFSI0 port 7
  [   51.241283553,5] CENTAUR: Found centaur for chip 0x0 channel 5
  [   51.241759761,5] CENTAUR:   FSI host: 0x0 cMFSI0 port 6
  [   51.242362656,5] PSI[0x000]: Found PSI bridge [active=0]
  [   51.242690427,5] PSI[0x008]: Found PSI bridge [active=0]
  [   51.245117930,5] CPU: All 128 processors called in...
  [    2.472212005,5] FLASH: Found system flash: Macronix MXxxL51235F id:0
  [    2.472354468,5] BT: Interface initialized, IO 0x00e4
  [    3.421491873,5] NVRAM: Size is 576 KB
  [    4.095942958,5] STB: secure mode off
  [    4.096004331,5] STB: trusted mode off
  [    4.096965839,5] CAPI: Preloading ucode 200ea
  [    4.097023615,5] FLASH: Queueing preload of 2/200ea
  [    4.097202595,5] FLASH: Queueing preload of 0/0
  [    4.097723471,5] FLASH: Queueing preload of 1/0
  [    4.097739635,7] FFS: Partition map size: 0x1000
  [    4.101069429,7] FLASH: CAPP partition has ECC
  [    4.117588444,5] STB: sb_verify skipped resource 2, secure_mode=0
  [    4.117607170,5] Chip 0 Found PBCQ0 at /xscom@3fc0000000000/pbcq@2012000
  [    4.117610665,7] PHB3[0:0]: X[PE]=0x02012000 X[PCI]=0x09012000 
X[SPCI]=0x09013c00
  [    4.117690635,7] PHB3[0:0] REGS     = 0x0003fffe40000000 [4k]
  [    4.124862367,7] PHB3[0:0] PCIBAR   = 0x0003fffe40000000
  [    4.144741905,7] PHB3[0:0] MMIO0    = 0x0000200000000000 
[0x0000010000000000]
  [    4.147663099,7] PHB3[0:0] MMIO1    = 0x00003fe000000000 
[0x0000000080000000]
  [    4.151015049,7] PHB3[0:0] BAREN    = 0xf800000000000000
  [    4.151018735,7] PHB3[0:0] NEWBAREN = 0xf800000000000000
  [    4.152491015,7] PHB3[0:0] IRSNC    = 0x0100000000000000
  [    4.177266431,5] STB: tb_measure skipped resource 2, trusted_mode=0
  [    4.177266472,7] PHB3[0:0] IRSNM    = 0xff00000000000000
  [    4.177269336,7] PHB3[0:0] LSI      = 0xff00000000000000
  [    4.177278668,5] Chip 0 Found PBCQ1 at /xscom@3fc0000000000/pbcq@2012400
  [    4.177282022,7] PHB3[0:1]: X[PE]=0x02012400 X[PCI]=0x09012400 
X[SPCI]=0x09013c40
  [    4.178715842,7] PHB3[0:1] REGS     = 0x0003fffe40100000 [4k]
  [    4.183043807,7] PHB3[0:1] PCIBAR   = 0x0003fffe40100000
  [    4.190163295,5] Chip 8 Found PBCQ0 at /xscom@3fc4000000000/pbcq@2012000
  [    4.208231423,5] Chip 8 Found PBCQ1 at /xscom@3fc4000000000/pbcq@2012400
  [    7.170627939,5] Chip 8 Found PBCQ2 at /xscom@3fc4000000000/pbcq@2012800
  [    8.269331117,3] PHB#0000: Base location code not found !
  [   13.422844377,5] STB: sb_verify skipped resource 0, secure_mode=0
  [   13.422853191,7] BT: seq 0x05 netfn 0x06 cmd 0x06: Message sent to host
  [   13.423112031,5] STB: tb_measure skipped resource 0, trusted_mode=0
  [   13.425274455,3] FLASH: No ROOTFS partition
  [   13.435729110,3] PHB#0001: Base location code not found !
  [   13.497563875,3] PHB#0020: Base location code not found !
  [   14.047321740,3] PHB#0021: Base location code not found !
  [   14.109002459,3] PHB#0022: Base location code not found !
  [   14.170907665,5] PCI: Resetting PHBs...
  [   15.273761743,5] PCI: Probing slots...
  [   16.432898479,5] PHB#0000:00:00.0 [ROOT] 1014 03dc R:00 C:060400 B:01..ff 
SLOT=Slot5 
  [   16.434393023,5] PHB#0001:00:00.0 [ROOT] 1014 03dc R:00 C:060400 B:01..ff 
SLOT=Slot4 
  [   16.434910029,5] PHB#0020:00:00.0 [ROOT] 1014 03dc R:00 C:060400 B:01..ff 
SLOT=Slot2 
  [   16.435845882,5] PHB#0021:00:00.0 [ROOT] 1014 03dc R:00 C:060400 B:01..15 
SLOT=Backplane PLX 
  [   16.438571433,5] PHB#0021:01:00.0 [SWUP] 10b5 8725 R:ca C:060400 B:02..15 
LOC_CODE=Backplane PLX
  [   16.440061205,5] PHB#0021:02:01.0 [SWDN] 10b5 8725 R:ca C:060400 B:03..07 
SLOT=Slot3 
  [   16.445628911,5] PHB#0021:02:08.0 [SWDN] 10b5 8725 R:ca C:060400 B:08..0c 
  [   16.447124810,5] PHB#0021:02:09.0 [SWDN] 10b5 8725 R:ca C:060400 B:0d..0d 
SLOT=Backplane USB 
  [   16.449944597,5] PHB#0021:0d:00.0 [EP  ] 104c 8241 R:02 C:0c0330 (      
usb-xhci) LOC_CODE=Backplane USB
  [   16.451469746,5] PHB#0021:02:0a.0 [SWDN] 10b5 8725 R:ca C:060400 B:0e..0e 
SLOT=Backplane SATA 
  [   16.485306975,5] PHB#0021:0e:00.0 [LGCY] 1b4b 9235 R:11 C:010601 (         
 sata) LOC_CODE=Backplane SATA
  [   16.490275695,5] PHB#0021:02:0b.0 [SWDN] 10b5 8725 R:ca C:060400 B:0f..10 
SLOT=Backplane BMC 
  [   16.491741344,5] PHB#0021:0f:00.0 [ETOX] 1a03 1150 R:03 C:060400 B:10..10 
LOC_CODE=Backplane BMC
  [   16.493424358,5] PHB#0021:10:00.0 [PCID] 1a03 2000 R:30 C:030000 (         
  vga) LOC_CODE=Backplane BMC
  [   16.496259662,5] PHB#0021:02:0c.0 [SWDN] 10b5 8725 R:ca C:060400 B:11..15 
   Petitboot (v1.4.4-e414dbe)                           8335-GTA 
0000000000000000
   
──────────────────────────────────────────────────────────────────────────────
      Ubuntu, with Linux 5.4.0-7-generic (recovery mode)
      Ubuntu, with Linux 5.4.0-7-generic
      Ubuntu
    [Network: enP4p1s0f0 / 98:be:94:01:1f:a4]
      netboot enP4p1s0f0 (pxelinux.0)
    [Network: enP4p1s0f2 / 98:be:94:01:1f:a6]
      netboot enP4p1s0f2 (pxelinux.0)
    [Network: enP4p1s0f1 / 98:be:94:01:1f:a5]
      netboot enP4p1s0f1 (pxelinux.0)
    [Network: enP4p1s0f3 / 98:be:94:01:1f:a7]
      netboot enP4p1s0f3 (pxelinux.0)

    System information
    System configuration
    System status log
    Language
    Rescan devices
    Retrieve config from URL
   *Exit to shell                                        
   
──────────────────────────────────────────────────────────────────────────────
   Enter=accept, e=edit, n=new, x=exit, l=language, g=log, h=help
  The system is going down NOW!ig from 
tftp://10.245.71.3/ppc64el/pxelinux.cfg/01-
  Sent SIGTERM to all processes
  Sent SIGKILL to all processes
  cpu 0x78: Vector: 300 (Data Access) at [c0000007ff3b3a00]
      pc: c0000000004a5a0c: xhci_irq+0x44c/0x18a0
      lr: c0000000004a5604: xhci_irq+0x44/0x18a0
      sp: c0000007ff3b3c80
     msr: 9000000000009033
     dar: b0
   dsisr: 40000000
    current = 0xc000000001300280
    paca    = 0xc00000000fe96800         softe: 0        irq_happened: 0x01
      pid   = 0, comm = swapper/120
  enter ? for help
  [c0000007ff3b3c80] c0000000004a6bcc xhci_irq+0x160c/0x18a0 (unreliable)
  [c0000007ff3b3e00] c000000000096bd8 handle_irq_event_percpu+0x58/0x170
  [c0000007ff3b3eb0] c000000000096d5c handle_irq_event+0x6c/0x9c
  [c0000007ff3b3ee0] c00000000009ac18 handle_fasteoi_irq+0xc8/0x184
  [c0000007ff3b3f10] c0000000000961a0 generic_handle_irq+0x34/0x54
  [c0000007ff3b3f30] c00000000000df28 __do_irq+0xb4/0xd0
  [c0000007ff3b3f90] c000000000019d58 call_do_irq+0x14/0x24
  [c00000000133ba80] c00000000000dfd4 do_IRQ+0x90/0xcc
  [c00000000133bad0] c0000000000021a8 hardware_interrupt_common+0x128/0x180
  --- Exception: 501 (Hardware Interrupt) at c00000000000d8f0 
arch_local_irq_restore+0x70/0x80
  [c00000000133bdc0] 0000000000000001 (unreliable)
  [c00000000133bde0] c0000000004db3e0 cpuidle_enter_state+0x1c8/0x238
  [c00000000133be30] c00000000008c39c cpu_startup_entry+0x250/0x2ec
  [c00000000133bee0] c00000000000b4d8 rest_init+0x9c/0xb0
  [c00000000133bf00] c0000000007b3bf8 start_kernel+0x510/0x518
  [c00000000133bf90] c000000000008c60 start_here_common+0x20/0x440
  78:mon> 

  Was able to reboot back into a previous kernel w/o any issues.

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1855143/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to