On Wed, Oct 21, 2015 at 11:44:26AM +1100, Gavin Shan wrote: >On Tue, Oct 20, 2015 at 05:03:00PM +0800, Wei Yang wrote: >>On PHB_IODA2, we enable SRIOV devices by mapping IOV BAR with M64 BARs. If > ^^^^^^^^^ > >s/PHB_IODA2/PHB3 or s/PHB_IODA2/IODA2 PHB > >>a SRIOV device's IOV BAR is not 64bit-prefetchable, this is not assigned >>from 64bit prefetchable window, which means M64 BAR can't work on it. >> >>The reason is PCI bridges support only 2 windows and the kernel code > ^^^^^^^^^ > >It would be more accurate: "2 memory windows". >
Thanks, will change in next version. >>programs bridges in the way that one window is 32bit-nonprefetchable and >>the other one is 64bit-prefetchable. So if devices' IOV BAR is 64bit and >>non-prefetchable, it will be mapped into 32bit space and therefore M64 >>cannot be used for it. >> >>This patch makes this explicit and truncate IOV resource in this case to > ^^^^^^^^ >>save MMIO space. >> >>Signed-off-by: Wei Yang <weiy...@linux.vnet.ibm.com> >>Reviewed-by: Gavin Shan <gws...@linux.vnet.ibm.com> >>Acked-by: Alexey Kardashevskiy <a...@ozlabs.ru> >>--- >> arch/powerpc/platforms/powernv/pci-ioda.c | 34 >> ++++++++++++++++--------------- >> 1 file changed, 18 insertions(+), 16 deletions(-) >> >>diff --git a/arch/powerpc/platforms/powernv/pci-ioda.c >>b/arch/powerpc/platforms/powernv/pci-ioda.c >>index 85cbc96..f042fed 100644 >>--- a/arch/powerpc/platforms/powernv/pci-ioda.c >>+++ b/arch/powerpc/platforms/powernv/pci-ioda.c >>@@ -908,9 +908,6 @@ static int pnv_pci_vf_resource_shift(struct pci_dev *dev, >>int offset) >> if (!res->flags || !res->parent) >> continue; >> >>- if (!pnv_pci_is_mem_pref_64(res->flags)) >>- continue; >>- >> /* >> * The actual IOV BAR range is determined by the start address >> * and the actual size for num_vfs VFs BAR. This check is to >>@@ -939,9 +936,6 @@ static int pnv_pci_vf_resource_shift(struct pci_dev *dev, >>int offset) >> if (!res->flags || !res->parent) >> continue; >> >>- if (!pnv_pci_is_mem_pref_64(res->flags)) >>- continue; >>- >> size = pci_iov_resource_size(dev, i + PCI_IOV_RESOURCES); >> res2 = *res; >> res->start += size * offset; >>@@ -1221,9 +1215,6 @@ static int pnv_pci_vf_assign_m64(struct pci_dev *pdev, >>u16 num_vfs) >> if (!res->flags || !res->parent) >> continue; >> >>- if (!pnv_pci_is_mem_pref_64(res->flags)) >>- continue; >>- >> for (j = 0; j < vf_groups; j++) { >> do { >> win = >> find_next_zero_bit(&phb->ioda.m64_bar_alloc, >>@@ -1510,6 +1501,12 @@ int pnv_pci_sriov_enable(struct pci_dev *pdev, u16 >>num_vfs) >> pdn = pci_get_pdn(pdev); >> >> if (phb->type == PNV_PHB_IODA2) { >>+ if (!pdn->vfs_expanded) { >>+ dev_info(&pdev->dev, "don't support this SRIOV device" >>+ " with non 64bit-prefetchable IOV BAR\n"); >>+ return -ENOSPC; >>+ } >>+ >> /* Calculate available PE for required VFs */ >> mutex_lock(&phb->ioda.pe_alloc_mutex); >> pdn->offset = bitmap_find_next_zero_area( >>@@ -2775,9 +2772,10 @@ static void pnv_pci_ioda_fixup_iov_resources(struct >>pci_dev *pdev) >> if (!res->flags || res->parent) >> continue; >> if (!pnv_pci_is_mem_pref_64(res->flags)) { >>- dev_warn(&pdev->dev, " non M64 VF BAR%d: %pR\n", >>+ dev_warn(&pdev->dev, "Don't support SR-IOV with" >>+ " non M64 VF BAR%d: %pR. \n", >> i, res); >>- continue; >>+ goto truncate_iov; >> } >> >> size = pci_iov_resource_size(pdev, i + PCI_IOV_RESOURCES); >>@@ -2796,11 +2794,6 @@ static void pnv_pci_ioda_fixup_iov_resources(struct >>pci_dev *pdev) >> res = &pdev->resource[i + PCI_IOV_RESOURCES]; >> if (!res->flags || res->parent) >> continue; >>- if (!pnv_pci_is_mem_pref_64(res->flags)) { >>- dev_warn(&pdev->dev, "Skipping expanding VF BAR%d: >>%pR\n", >>- i, res); >>- continue; >>- } >> >> dev_dbg(&pdev->dev, " Fixing VF BAR%d: %pR to\n", i, res); >> size = pci_iov_resource_size(pdev, i + PCI_IOV_RESOURCES); >>@@ -2810,6 +2803,15 @@ static void pnv_pci_ioda_fixup_iov_resources(struct >>pci_dev *pdev) >> i, res, mul); >> } >> pdn->vfs_expanded = mul; >>+ >>+ return; >>+ >>+truncate_iov: >>+ /* To save MMIO space, IOV BAR is truncated. */ >>+ for (i = 0; i < PCI_SRIOV_NUM_BARS; i++) { >>+ res = &pdev->resource[i + PCI_IOV_RESOURCES]; >>+ res->end = res->start - 1; >>+ } > >res->flags isn't cleared out, the IOV BAR will be counted in the optional list >in resource sizing stage. However, the size has became zero. It's obvious not >necessary to do that. I doubt this piece of code has been really testified on >real PCI adapter or one with emulated M32 IOV BAR? > Below is the log during my test: [ 0.436209] pci 0001:08:00.0: reg 0x10: [mem 0x3fe080820000-0x3fe08082ffff 64bit] [ 0.436230] pci 0001:08:00.0: reg 0x18: [mem 0x3fe080830000-0x3fe08083ffff 64bit] [ 0.436268] pci 0001:08:00.0: reg 0x30: [mem 0x00000000-0x0001ffff pref] [ 0.436333] pci 0001:08:00.0: PME# supported from D0 D3hot D3cold [ 0.436402] pci 0001:08:00.0: reg 0x16c: [mem 0x00000000-0x0000ffff 64bit] [ 0.436404] pci 0001:08:00.0: VF(n) BAR0 space: [mem 0x00000000-0x001fffff 64bit] (contains BAR0 for 32 VFs) [ 0.436515] pci 0001:08:00.0: reg 0x174: [mem 0x00000000-0x0000ffff 64bit] [ 0.436517] pci 0001:08:00.0: VF(n) BAR2 space: [mem 0x00000000-0x001fffff 64bit] (contains BAR2 for 32 VFs) IOV BAR not truncated: [root@tian-lp1 ~]# grep 0001:08:00.0 /proc/iomem 3fe080800000-3fe08081ffff : 0001:08:00.0 3fe080820000-3fe08082ffff : 0001:08:00.0 3fe080830000-3fe08083ffff : 0001:08:00.0 3fe080840000-3fe080a3ffff : 0001:08:00.0 3fe080a40000-3fe080c3ffff : 0001:08:00.0 IOV BAR truncated: [ywywyang@tian-lp1 ~]$ grep 0001:08:00.0 /proc/iomem 3fe080800000-3fe08081ffff : 0001:08:00.0 3fe080820000-3fe08082ffff : 0001:08:00.0 3fe080830000-3fe08083ffff : 0001:08:00.0 We could see, after applying this patch, the 2 IOV BAR will not be allocated in iomem range. While I think your proposal is reasonable. By clearing the flag, IOV BAR will not be involved in the sizing/assign stage. Will change this in next version. >> } >> #endif /* CONFIG_PCI_IOV */ >> >>-- >>2.5.0 >> -- Richard Yang Help you, Help me _______________________________________________ Linuxppc-dev mailing list Linuxppc-dev@lists.ozlabs.org https://lists.ozlabs.org/listinfo/linuxppc-dev