[Group.of.nepali.translators] [Bug 1645826] Re: Crash@pcibios_set_pcie_reset_state+0x118/0x280 in capiredp01 with latest level - 160823-GA3-FlashGT

2017-01-11 Thread Luis Henriques
** Changed in: linux (Ubuntu Xenial)
   Status: Fix Committed => Fix Released

-- 
You received this bug notification because you are a member of नेपाली
भाषा समायोजकहरुको समूह, which is subscribed to Xenial.
Matching subscriptions: Ubuntu 16.04 Bugs
https://bugs.launchpad.net/bugs/1645826

Title:
  Crash@pcibios_set_pcie_reset_state+0x118/0x280 in capiredp01 with
  latest level - 160823-GA3-FlashGT

Status in linux package in Ubuntu:
  Fix Released
Status in linux source package in Xenial:
  Fix Released

Bug description:
  == Comment: #26 - Andrew Donnellan - 2016-11-24 19:55:52 ==
  Ubuntu kernel team, please apply the following fixup to the Xenial kernel 
tree.

  --

  From 631804b1548b035cada4b2c14ab708310a8aa607 Mon Sep 17 00:00:00 2001
  From: Gavin Shan 
  Date: Mon, 12 Sep 2016 10:50:16 +1000
  Subject: [PATCH] powerpc/eeh: Remove EEH_PE_PRI_BUS in full hotplug recovery

  commit 59ae8c6d5b45 ("powerpc/eeh: Fix invalid cached PE primary
  bus") was wrongly backporting upstream commit a3aa256b7258: It
  should clear the PE's flag (EEH_PE_PRI_BUS) in full hotplug instead
  of partial hotplug scenario.

  This fixes the issue by clearing EEH_PE_PRI_BUS in full hotplug
  scenario only.

  Fixes: 59ae8c6d5b45 ("powerpc/eeh: Fix invalid cached PE primary bus")
  Signed-off-by: Gavin Shan 
  ---
   arch/powerpc/kernel/eeh_driver.c | 2 +-
   1 file changed, 1 insertion(+), 1 deletion(-)

  diff --git a/arch/powerpc/kernel/eeh_driver.c 
b/arch/powerpc/kernel/eeh_driver.c
  index c453b53..829ab8e 100644
  --- a/arch/powerpc/kernel/eeh_driver.c
  +++ b/arch/powerpc/kernel/eeh_driver.c
  @@ -630,13 +630,13 @@ static int eeh_reset_device(struct eeh_pe *pe, struct 
pci_bus *bus)
 * rebuilt when adding PCI devices.
 */
eeh_pe_traverse(pe, eeh_pe_detach_dev, NULL);
  + eeh_pe_state_clear(pe, EEH_PE_PRI_BUS);
pcibios_add_pci_devices(bus);
} else if (frozen_bus && removed) {
pr_info("EEH: Sleep 5s ahead of partial hotplug\n");
ssleep(5);
   
eeh_pe_traverse(pe, eeh_pe_detach_dev, NULL);
  - eeh_pe_state_clear(pe, EEH_PE_PRI_BUS);
pcibios_add_pci_devices(frozen_bus);
}
eeh_pe_state_clear(pe, EEH_PE_KEEP);
  -- 
  2.1.0

  
  Historical context:
   State: Open by: ukrishn on 08 September 2016 18:15:32 

  Seems like this is easily recreatable. Mike Vageline just hit the
  issue by doing couple of PERST on a FlashGT card.

  Here is the note from him -
  I had downloaded 0908, then perst, modprob'd to verify 0908, then rmmod, then 
perst to factory, modprob'd, verified it was 0903, rmmod, then perst again to 
user... xmon

  p8tul12-lp1 login: [  647.501340] Fatal Hypervisor Maintenance interrupt 
[Recovered]
  [  647.501348] EEH: Fenced PHB#2 detected, location: N/A
  [  647.501528]  Error detail: Malfunction Alert
  [  647.501590]HMER: 8040
  [  647.501637]Unknown Core check stop.
  [  647.502584] Fatal Hypervisor Maintenance interrupt [Recovered]
  [  647.502588]  Error detail: Malfunction Alert
  [  647.502590]HMER: 8040
  [  647.502591]Unknown Core check stop.
  [  665.369299] PCI: Memory resource 0 not set for host bridge 
/pciex@3fffe4040/pci@0/device@0 (domain 5)
  [  676.293638] Back level AFU, please upgrade. AFU version 160903N0 interface 
version 0x
  [  676.293842] cxlflash 0005:00:00.0: cxlflash_probe: call to init_afu failed 
rc=-22!
  [  704.863543] Unable to handle kernel paging request for data at address 
0x0110
  [  704.863673] Faulting instruction address: 0xc0083e08
  cpu 0x2: Vector: 300 (Data Access) at [c00f01cbf7d0]
  pc: c0083e08: pnv_eeh_reset+0x68/0x170
  lr: c0083df8: pnv_eeh_reset+0x58/0x170
  sp: c00f01cbfa50
 msr: 90009033
 dar: 110
   dsisr: 4000
current = 0xc00f014bc8e0
paca= 0xc7b41300 softe: 0irq_happened: 0x01
  pid   = 10688, comm = sh
  enter ? for help
  [c00f01cbfad0] c0038bb8 pcibios_set_pcie_reset_state+0x118/0x280
  [c00f01cbfb50] c05e9450 pci_set_pcie_reset_state+0x30/0x50
  [c00f01cbfb80] d7c9f7bc cxl_pci_reset+0x5c/0xc0 [cxl]
  [c00f01cbfbf0] d7c992a4 reset_adapter_store+0x84/0x120 [cxl]
  [c00f01cbfc80] c06d2378 dev_attr_store+0x68/0xa0
  [c00f01cbfcc0] c0398290 sysfs_kf_write+0x80/0xb0
  [c00f01cbfd00] c03971a8 kernfs_fop_write+0x188/0x200
  [c00f01cbfd50] c02e1a6c __vfs_write+0x6c/0xe0
  [c00f01cbfd90] c02e27a0 vfs_write+0xc0/0x230
  [c00f01cbfde0] c02e37dc SyS_write+0x6c/0x110
  [c00f01cbfe30] c0009204 system_call+0x38/0xb4
  --- Exception: c01 (System 

[Group.of.nepali.translators] [Bug 1645826] Re: Crash@pcibios_set_pcie_reset_state+0x118/0x280 in capiredp01 with latest level - 160823-GA3-FlashGT

2016-11-29 Thread Tim Gardner
https://lists.ubuntu.com/archives/kernel-team/2016-November/081180.html

** Also affects: linux (Ubuntu Xenial)
   Importance: Undecided
   Status: New

** Changed in: linux (Ubuntu Xenial)
   Status: New => In Progress

** Changed in: linux (Ubuntu Xenial)
 Assignee: (unassigned) => Tim Gardner (timg-tpi)

** Changed in: linux (Ubuntu)
   Status: New => Fix Released

** Changed in: linux (Ubuntu)
 Assignee: Taco Screen team (taco-screen-team) => (unassigned)

-- 
You received this bug notification because you are a member of नेपाली
भाषा समायोजकहरुको समूह, which is subscribed to Xenial.
Matching subscriptions: Ubuntu 16.04 Bugs
https://bugs.launchpad.net/bugs/1645826

Title:
  Crash@pcibios_set_pcie_reset_state+0x118/0x280 in capiredp01 with
  latest level - 160823-GA3-FlashGT

Status in linux package in Ubuntu:
  Fix Released
Status in linux source package in Xenial:
  In Progress

Bug description:
  == Comment: #26 - Andrew Donnellan - 2016-11-24 19:55:52 ==
  Ubuntu kernel team, please apply the following fixup to the Xenial kernel 
tree.

  --

  From 631804b1548b035cada4b2c14ab708310a8aa607 Mon Sep 17 00:00:00 2001
  From: Gavin Shan 
  Date: Mon, 12 Sep 2016 10:50:16 +1000
  Subject: [PATCH] powerpc/eeh: Remove EEH_PE_PRI_BUS in full hotplug recovery

  commit 59ae8c6d5b45 ("powerpc/eeh: Fix invalid cached PE primary
  bus") was wrongly backporting upstream commit a3aa256b7258: It
  should clear the PE's flag (EEH_PE_PRI_BUS) in full hotplug instead
  of partial hotplug scenario.

  This fixes the issue by clearing EEH_PE_PRI_BUS in full hotplug
  scenario only.

  Fixes: 59ae8c6d5b45 ("powerpc/eeh: Fix invalid cached PE primary bus")
  Signed-off-by: Gavin Shan 
  ---
   arch/powerpc/kernel/eeh_driver.c | 2 +-
   1 file changed, 1 insertion(+), 1 deletion(-)

  diff --git a/arch/powerpc/kernel/eeh_driver.c 
b/arch/powerpc/kernel/eeh_driver.c
  index c453b53..829ab8e 100644
  --- a/arch/powerpc/kernel/eeh_driver.c
  +++ b/arch/powerpc/kernel/eeh_driver.c
  @@ -630,13 +630,13 @@ static int eeh_reset_device(struct eeh_pe *pe, struct 
pci_bus *bus)
 * rebuilt when adding PCI devices.
 */
eeh_pe_traverse(pe, eeh_pe_detach_dev, NULL);
  + eeh_pe_state_clear(pe, EEH_PE_PRI_BUS);
pcibios_add_pci_devices(bus);
} else if (frozen_bus && removed) {
pr_info("EEH: Sleep 5s ahead of partial hotplug\n");
ssleep(5);
   
eeh_pe_traverse(pe, eeh_pe_detach_dev, NULL);
  - eeh_pe_state_clear(pe, EEH_PE_PRI_BUS);
pcibios_add_pci_devices(frozen_bus);
}
eeh_pe_state_clear(pe, EEH_PE_KEEP);
  -- 
  2.1.0

  
  Historical context:
   State: Open by: ukrishn on 08 September 2016 18:15:32 

  Seems like this is easily recreatable. Mike Vageline just hit the
  issue by doing couple of PERST on a FlashGT card.

  Here is the note from him -
  I had downloaded 0908, then perst, modprob'd to verify 0908, then rmmod, then 
perst to factory, modprob'd, verified it was 0903, rmmod, then perst again to 
user... xmon

  p8tul12-lp1 login: [  647.501340] Fatal Hypervisor Maintenance interrupt 
[Recovered]
  [  647.501348] EEH: Fenced PHB#2 detected, location: N/A
  [  647.501528]  Error detail: Malfunction Alert
  [  647.501590]HMER: 8040
  [  647.501637]Unknown Core check stop.
  [  647.502584] Fatal Hypervisor Maintenance interrupt [Recovered]
  [  647.502588]  Error detail: Malfunction Alert
  [  647.502590]HMER: 8040
  [  647.502591]Unknown Core check stop.
  [  665.369299] PCI: Memory resource 0 not set for host bridge 
/pciex@3fffe4040/pci@0/device@0 (domain 5)
  [  676.293638] Back level AFU, please upgrade. AFU version 160903N0 interface 
version 0x
  [  676.293842] cxlflash 0005:00:00.0: cxlflash_probe: call to init_afu failed 
rc=-22!
  [  704.863543] Unable to handle kernel paging request for data at address 
0x0110
  [  704.863673] Faulting instruction address: 0xc0083e08
  cpu 0x2: Vector: 300 (Data Access) at [c00f01cbf7d0]
  pc: c0083e08: pnv_eeh_reset+0x68/0x170
  lr: c0083df8: pnv_eeh_reset+0x58/0x170
  sp: c00f01cbfa50
 msr: 90009033
 dar: 110
   dsisr: 4000
current = 0xc00f014bc8e0
paca= 0xc7b41300 softe: 0irq_happened: 0x01
  pid   = 10688, comm = sh
  enter ? for help
  [c00f01cbfad0] c0038bb8 pcibios_set_pcie_reset_state+0x118/0x280
  [c00f01cbfb50] c05e9450 pci_set_pcie_reset_state+0x30/0x50
  [c00f01cbfb80] d7c9f7bc cxl_pci_reset+0x5c/0xc0 [cxl]
  [c00f01cbfbf0] d7c992a4 reset_adapter_store+0x84/0x120 [cxl]
  [c00f01cbfc80] c06d2378 dev_attr_store+0x68/0xa0