[Bug 1354459] Re: kernel crash power 8 bare metal
[Expired for linux (Ubuntu) because there has been no activity for 60 days.] ** Changed in: linux (Ubuntu) Status: Incomplete = Expired -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1354459 Title: kernel crash power 8 bare metal To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1354459/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 1354459] Re: kernel crash power 8 bare metal
Looks very similar to bug 1425699 . Is this still an issue with the latest kernel? ** Changed in: linux (Ubuntu) Status: Expired = Incomplete -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1354459 Title: kernel crash power 8 bare metal To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1354459/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 1354459] Re: kernel crash power 8 bare metal
[Expired for linux (Ubuntu) because there has been no activity for 60 days.] ** Changed in: linux (Ubuntu) Status: Incomplete = Expired -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1354459 Title: kernel crash power 8 bare metal To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1354459/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 1354459] Re: kernel crash power 8 bare metal
please retest with neihu 140919-1 or later image which uses kernel 3.13.0-36.63+hwe3 ** Tags added: bdw-bug -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1354459 Title: kernel crash power 8 bare metal To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1354459/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 1354459] Re: kernel crash power 8 bare metal
Also can you check the firmware/boot level on SAS adapters? Thanks! iprconfig -1 detail information for each adapter. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1354459 Title: kernel crash power 8 bare metal To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1354459/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 1354459] Re: kernel crash power 8 bare metal
** Attachment added: another full kernel boot with crash https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1354459/+attachment/4175626/+files/power-nv-crash-log.txt -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1354459 Title: kernel crash power 8 bare metal To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1354459/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 1354459] Re: kernel crash power 8 bare metal
We took an EEH error: [ 44.793204] pnv_pci_dump_phb_diag_data: Unrecognized ioType 33554432 [ 44.793267] EEH: Frozen PE#5 detected on PHB#3 [ 44.793318] CPU: 40 PID: 209 Comm: kworker/40:0 Not tainted 3.13.0-27-generic #50-Ubuntu [ 44.793396] Workqueue: events .work_for_cpu_fn [ 44.793458] Call Trace: [ 44.793487] [c00fe6edb540] [c0016af0] .show_stack+0x170/0x290 (unreliable) [ 44.793575] [c00fe6edb630] [c0966fc0] .dump_stack+0x88/0xb4 [ 44.793651] [c00fe6edb6b0] [c00364b0] .eeh_dev_check_failure+0x430/0x480 [ 44.793737] [c00fe6edb760] [c0036584] .eeh_check_failure+0x84/0xe0 [ 44.793827] [c00fe6edb7f0] [deea33e0] .ipr_mask_and_clear_interrupts+0x190/0x1d0 [ipr] [ 44.793928] [c00fe6edb8a0] [deeaa394] .ipr_probe_ioa+0xc24/0x1370 [ipr] [ 44.794017] [c00fe6edb9d0] [deeb25c4] .ipr_probe+0x44/0x4c0 [ipr] [ 44.794093] [c00fe6edbac0] [c0516cfc] .local_pci_probe+0x4c/0xe0 [ 44.794167] [c00fe6edbb40] [c00bae68] .work_for_cpu_fn+0x38/0x60 [ 44.794242] [c00fe6edbbc0] [c00bf628] .process_one_work+0x1a8/0x4d0 [ 44.794327] [c00fe6edbc60] [c00c04fc] .worker_thread+0x38c/0x4a0 [ 44.794401] [c00fe6edbd30] [c00c98a0] .kthread+0x110/0x130 [ 44.794476] [c00fe6edbe30] [c000a460] .ret_from_kernel_thread+0x5c/0x7c [ 44.794572] EEH: Detected PCI bus error on PHB#3-PE#5 [ 44.794632] EEH: This PCI device has failed 1 times in the last hour [ 44.794693] EEH: Notify device drivers to shutdown [ 44.794749] Unable to handle kernel paging request for data at address 0x0008 [ 44.794821] Faulting instruction address: 0xdeea205c [ 44.794883] Oops: Kernel access of bad area, sig: 11 [#1] [ 44.794931] SMP NR_CPUS=2048 NUMA PowerNV [ 44.794982] Modules linked in: ipr(+) [ 44.795046] CPU: 9 PID: 810 Comm: eehd Not tainted 3.13.0-27-generic #50-Ubuntu [ 44.795120] task: c00fdf7066f0 ti: c00fe33a4000 task.ti: c00fe33a4000 [ 44.795192] NIP: deea205c LR: deea2a14 CTR: c064f720 [ 44.795264] REGS: c00fe33a75b0 TRAP: 0300 Not tainted (3.13.0-27-generic) [ 44.795336] MSR: 90019033 SF,HV,EE,ME,IR,DR,RI,LE CR: 229d0028 XER: 2000 [ 44.795641] CFAR: c0009318 DAR: 0008 DSISR: 4000 SOFTE: 0 GPR00: deea2a14 c00fe33a7830 deec4c58 c00fda20cc60 GPR04: deebc178 0100 90019033 GPR08: 0001 c064f720 GPR12: deeb4978 cfe41f80 c00c9790 c01fd8401600 GPR16: GPR20: c0bf9528 GPR24: c0bf9500 deebc178 0100 GPR28: c00fda20c538 c00fda20c538 [ 44.797447] NIP [deea205c] .ipr_get_free_ipr_cmnd+0x2c/0x90 [ipr] [ 44.797567] LR [deea2a14] ._ipr_initiate_ioa_reset+0xe4/0x130 [ipr] [ 44.797683] Call Trace: [ 44.797736] [c00fe33a78b0] [deea2a14] ._ipr_initiate_ioa_reset+0xe4/0x130 [ipr] [ 44.797900] [c00fe33a7960] [deeab458] .ipr_pci_error_detected+0x1c8/0x230 [ipr] [ 44.798063] [c00fe33a7a00] [c00396bc] .eeh_report_error+0xac/0x120 [ 44.798222] [c00fe33a7a90] [c003840c] .eeh_pe_dev_traverse+0x9c/0x170 [ 44.798382] [c00fe33a7b30] [c0039ce8] .eeh_handle_normal_event+0x128/0x3d0 [ 44.798542] [c00fe33a7bc0] [c0039fd8] .eeh_handle_event+0x48/0x2f0 [ 44.798702] [c00fe33a7c70] [c003a39c] .eeh_event_handler+0x11c/0x1d0 [ 44.798862] [c00fe33a7d30] [c00c98a0] .kthread+0x110/0x130 [ 44.799000] [c00fe33a7e30] [c000a460] .ret_from_kernel_thread+0x5c/0x7c [ 44.799158] Instruction dump: [ 44.799225] 6042 7c0802a6 fbe1fff8 f8010010 f821ff81 7c7f1b78 4808 e8410028 [ 44.799453] 7fe3fb78 e9230729 7fa91840 41de0058 e8e90008 e8c9 3d10 3d400020 [ 44.799678] ---[ end trace 7439fee11bbab045 ]--- This is usually a sign of bad hardware. EEH was not ported for 3.13 but should work on 3.16. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1354459 Title: kernel crash power 8 bare metal To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1354459/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 1354459] Re: kernel crash power 8 bare metal
I spoke to Gavin about this: 1. EEH has endian issues in 3.13 should work in 3.16. (EEH is our I/O error recovery mechanism) 2. There was an issue in the IPR driver with early EEH errors, fixed in 3.15 with commit 6270e5932a01 [SCSI] ipr: Handle early EEH Would it be possible to update to the Utopic kernel? -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1354459 Title: kernel crash power 8 bare metal To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1354459/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 1354459] Re: kernel crash power 8 bare metal
Do you have a list of steps to reproduce this bug? Did this issue start happening after an update/upgrade? Was there a kernel version where you were not having this particular problem? This will help determine if the problem you are seeing is the result of the introduction of a regression, and when this regression was introduced. If this is a regression, we can perform a kernel bisect to identify the commit that introduced the problem. ** Changed in: linux (Ubuntu) Importance: Undecided = High ** Tags added: kernel-da-key ppc64el ** Tags added: trusty -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1354459 Title: kernel crash power 8 bare metal To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1354459/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs