[Group.of.nepali.translators] [Bug 1645826] Re: Crash@pcibios_set_pcie_reset_state+0x118/0x280 in capiredp01 with latest level - 160823-GA3-FlashGT
** Changed in: linux (Ubuntu Xenial) Status: Fix Committed => Fix Released -- You received this bug notification because you are a member of नेपाली भाषा समायोजकहरुको समूह, which is subscribed to Xenial. Matching subscriptions: Ubuntu 16.04 Bugs https://bugs.launchpad.net/bugs/1645826 Title: Crash@pcibios_set_pcie_reset_state+0x118/0x280 in capiredp01 with latest level - 160823-GA3-FlashGT Status in linux package in Ubuntu: Fix Released Status in linux source package in Xenial: Fix Released Bug description: == Comment: #26 - Andrew Donnellan - 2016-11-24 19:55:52 == Ubuntu kernel team, please apply the following fixup to the Xenial kernel tree. -- From 631804b1548b035cada4b2c14ab708310a8aa607 Mon Sep 17 00:00:00 2001 From: Gavin ShanDate: Mon, 12 Sep 2016 10:50:16 +1000 Subject: [PATCH] powerpc/eeh: Remove EEH_PE_PRI_BUS in full hotplug recovery commit 59ae8c6d5b45 ("powerpc/eeh: Fix invalid cached PE primary bus") was wrongly backporting upstream commit a3aa256b7258: It should clear the PE's flag (EEH_PE_PRI_BUS) in full hotplug instead of partial hotplug scenario. This fixes the issue by clearing EEH_PE_PRI_BUS in full hotplug scenario only. Fixes: 59ae8c6d5b45 ("powerpc/eeh: Fix invalid cached PE primary bus") Signed-off-by: Gavin Shan --- arch/powerpc/kernel/eeh_driver.c | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/arch/powerpc/kernel/eeh_driver.c b/arch/powerpc/kernel/eeh_driver.c index c453b53..829ab8e 100644 --- a/arch/powerpc/kernel/eeh_driver.c +++ b/arch/powerpc/kernel/eeh_driver.c @@ -630,13 +630,13 @@ static int eeh_reset_device(struct eeh_pe *pe, struct pci_bus *bus) * rebuilt when adding PCI devices. */ eeh_pe_traverse(pe, eeh_pe_detach_dev, NULL); + eeh_pe_state_clear(pe, EEH_PE_PRI_BUS); pcibios_add_pci_devices(bus); } else if (frozen_bus && removed) { pr_info("EEH: Sleep 5s ahead of partial hotplug\n"); ssleep(5); eeh_pe_traverse(pe, eeh_pe_detach_dev, NULL); - eeh_pe_state_clear(pe, EEH_PE_PRI_BUS); pcibios_add_pci_devices(frozen_bus); } eeh_pe_state_clear(pe, EEH_PE_KEEP); -- 2.1.0 Historical context: State: Open by: ukrishn on 08 September 2016 18:15:32 Seems like this is easily recreatable. Mike Vageline just hit the issue by doing couple of PERST on a FlashGT card. Here is the note from him - I had downloaded 0908, then perst, modprob'd to verify 0908, then rmmod, then perst to factory, modprob'd, verified it was 0903, rmmod, then perst again to user... xmon p8tul12-lp1 login: [ 647.501340] Fatal Hypervisor Maintenance interrupt [Recovered] [ 647.501348] EEH: Fenced PHB#2 detected, location: N/A [ 647.501528] Error detail: Malfunction Alert [ 647.501590]HMER: 8040 [ 647.501637]Unknown Core check stop. [ 647.502584] Fatal Hypervisor Maintenance interrupt [Recovered] [ 647.502588] Error detail: Malfunction Alert [ 647.502590]HMER: 8040 [ 647.502591]Unknown Core check stop. [ 665.369299] PCI: Memory resource 0 not set for host bridge /pciex@3fffe4040/pci@0/device@0 (domain 5) [ 676.293638] Back level AFU, please upgrade. AFU version 160903N0 interface version 0x [ 676.293842] cxlflash 0005:00:00.0: cxlflash_probe: call to init_afu failed rc=-22! [ 704.863543] Unable to handle kernel paging request for data at address 0x0110 [ 704.863673] Faulting instruction address: 0xc0083e08 cpu 0x2: Vector: 300 (Data Access) at [c00f01cbf7d0] pc: c0083e08: pnv_eeh_reset+0x68/0x170 lr: c0083df8: pnv_eeh_reset+0x58/0x170 sp: c00f01cbfa50 msr: 90009033 dar: 110 dsisr: 4000 current = 0xc00f014bc8e0 paca= 0xc7b41300 softe: 0irq_happened: 0x01 pid = 10688, comm = sh enter ? for help [c00f01cbfad0] c0038bb8 pcibios_set_pcie_reset_state+0x118/0x280 [c00f01cbfb50] c05e9450 pci_set_pcie_reset_state+0x30/0x50 [c00f01cbfb80] d7c9f7bc cxl_pci_reset+0x5c/0xc0 [cxl] [c00f01cbfbf0] d7c992a4 reset_adapter_store+0x84/0x120 [cxl] [c00f01cbfc80] c06d2378 dev_attr_store+0x68/0xa0 [c00f01cbfcc0] c0398290 sysfs_kf_write+0x80/0xb0 [c00f01cbfd00] c03971a8 kernfs_fop_write+0x188/0x200 [c00f01cbfd50] c02e1a6c __vfs_write+0x6c/0xe0 [c00f01cbfd90] c02e27a0 vfs_write+0xc0/0x230 [c00f01cbfde0] c02e37dc SyS_write+0x6c/0x110 [c00f01cbfe30] c0009204 system_call+0x38/0xb4 --- Exception: c01 (System
[Group.of.nepali.translators] [Bug 1645826] Re: Crash@pcibios_set_pcie_reset_state+0x118/0x280 in capiredp01 with latest level - 160823-GA3-FlashGT
https://lists.ubuntu.com/archives/kernel-team/2016-November/081180.html ** Also affects: linux (Ubuntu Xenial) Importance: Undecided Status: New ** Changed in: linux (Ubuntu Xenial) Status: New => In Progress ** Changed in: linux (Ubuntu Xenial) Assignee: (unassigned) => Tim Gardner (timg-tpi) ** Changed in: linux (Ubuntu) Status: New => Fix Released ** Changed in: linux (Ubuntu) Assignee: Taco Screen team (taco-screen-team) => (unassigned) -- You received this bug notification because you are a member of नेपाली भाषा समायोजकहरुको समूह, which is subscribed to Xenial. Matching subscriptions: Ubuntu 16.04 Bugs https://bugs.launchpad.net/bugs/1645826 Title: Crash@pcibios_set_pcie_reset_state+0x118/0x280 in capiredp01 with latest level - 160823-GA3-FlashGT Status in linux package in Ubuntu: Fix Released Status in linux source package in Xenial: In Progress Bug description: == Comment: #26 - Andrew Donnellan - 2016-11-24 19:55:52 == Ubuntu kernel team, please apply the following fixup to the Xenial kernel tree. -- From 631804b1548b035cada4b2c14ab708310a8aa607 Mon Sep 17 00:00:00 2001 From: Gavin ShanDate: Mon, 12 Sep 2016 10:50:16 +1000 Subject: [PATCH] powerpc/eeh: Remove EEH_PE_PRI_BUS in full hotplug recovery commit 59ae8c6d5b45 ("powerpc/eeh: Fix invalid cached PE primary bus") was wrongly backporting upstream commit a3aa256b7258: It should clear the PE's flag (EEH_PE_PRI_BUS) in full hotplug instead of partial hotplug scenario. This fixes the issue by clearing EEH_PE_PRI_BUS in full hotplug scenario only. Fixes: 59ae8c6d5b45 ("powerpc/eeh: Fix invalid cached PE primary bus") Signed-off-by: Gavin Shan --- arch/powerpc/kernel/eeh_driver.c | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/arch/powerpc/kernel/eeh_driver.c b/arch/powerpc/kernel/eeh_driver.c index c453b53..829ab8e 100644 --- a/arch/powerpc/kernel/eeh_driver.c +++ b/arch/powerpc/kernel/eeh_driver.c @@ -630,13 +630,13 @@ static int eeh_reset_device(struct eeh_pe *pe, struct pci_bus *bus) * rebuilt when adding PCI devices. */ eeh_pe_traverse(pe, eeh_pe_detach_dev, NULL); + eeh_pe_state_clear(pe, EEH_PE_PRI_BUS); pcibios_add_pci_devices(bus); } else if (frozen_bus && removed) { pr_info("EEH: Sleep 5s ahead of partial hotplug\n"); ssleep(5); eeh_pe_traverse(pe, eeh_pe_detach_dev, NULL); - eeh_pe_state_clear(pe, EEH_PE_PRI_BUS); pcibios_add_pci_devices(frozen_bus); } eeh_pe_state_clear(pe, EEH_PE_KEEP); -- 2.1.0 Historical context: State: Open by: ukrishn on 08 September 2016 18:15:32 Seems like this is easily recreatable. Mike Vageline just hit the issue by doing couple of PERST on a FlashGT card. Here is the note from him - I had downloaded 0908, then perst, modprob'd to verify 0908, then rmmod, then perst to factory, modprob'd, verified it was 0903, rmmod, then perst again to user... xmon p8tul12-lp1 login: [ 647.501340] Fatal Hypervisor Maintenance interrupt [Recovered] [ 647.501348] EEH: Fenced PHB#2 detected, location: N/A [ 647.501528] Error detail: Malfunction Alert [ 647.501590]HMER: 8040 [ 647.501637]Unknown Core check stop. [ 647.502584] Fatal Hypervisor Maintenance interrupt [Recovered] [ 647.502588] Error detail: Malfunction Alert [ 647.502590]HMER: 8040 [ 647.502591]Unknown Core check stop. [ 665.369299] PCI: Memory resource 0 not set for host bridge /pciex@3fffe4040/pci@0/device@0 (domain 5) [ 676.293638] Back level AFU, please upgrade. AFU version 160903N0 interface version 0x [ 676.293842] cxlflash 0005:00:00.0: cxlflash_probe: call to init_afu failed rc=-22! [ 704.863543] Unable to handle kernel paging request for data at address 0x0110 [ 704.863673] Faulting instruction address: 0xc0083e08 cpu 0x2: Vector: 300 (Data Access) at [c00f01cbf7d0] pc: c0083e08: pnv_eeh_reset+0x68/0x170 lr: c0083df8: pnv_eeh_reset+0x58/0x170 sp: c00f01cbfa50 msr: 90009033 dar: 110 dsisr: 4000 current = 0xc00f014bc8e0 paca= 0xc7b41300 softe: 0irq_happened: 0x01 pid = 10688, comm = sh enter ? for help [c00f01cbfad0] c0038bb8 pcibios_set_pcie_reset_state+0x118/0x280 [c00f01cbfb50] c05e9450 pci_set_pcie_reset_state+0x30/0x50 [c00f01cbfb80] d7c9f7bc cxl_pci_reset+0x5c/0xc0 [cxl] [c00f01cbfbf0] d7c992a4 reset_adapter_store+0x84/0x120 [cxl] [c00f01cbfc80] c06d2378 dev_attr_store+0x68/0xa0