Re: [PATCH V6 0/4] mm/kvm/vfio/ppc64: Migrate compound pages out of CMA region

2019-01-10 Thread David Gibson
On Wed, Jan 09, 2019 at 02:11:25PM +0530, Aneesh Kumar K.V wrote:
> Andrew Morton  writes:
> 
> > On Tue,  8 Jan 2019 10:21:06 +0530 "Aneesh Kumar K.V" 
> >  wrote:
> >
> >> ppc64 use CMA area for the allocation of guest page table (hash page 
> >> table). We won't
> >> be able to start guest if we fail to allocate hash page table. We have 
> >> observed
> >> hash table allocation failure because we failed to migrate pages out of 
> >> CMA region
> >> because they were pinned. This happen when we are using VFIO. VFIO on 
> >> ppc64 pins
> >> the entire guest RAM. If the guest RAM pages get allocated out of CMA 
> >> region, we
> >> won't be able to migrate those pages. The pages are also pinned for the 
> >> lifetime of the
> >> guest.
> >> 
> >> Currently we support migration of non-compound pages. With THP and with 
> >> the addition of
> >>  hugetlb migration we can end up allocating compound pages from CMA 
> >> region. This
> >> patch series add support for migrating compound pages. The first path adds 
> >> the helper
> >> get_user_pages_cma_migrate() which pin the page making sure we migrate 
> >> them out of
> >> CMA region before incrementing the reference count. 
> >
> > Does this code do anything for architectures other than powerpc?  If
> > not, should we be adding the ifdefs to avoid burdening other
> > architectures with unused code?
> 
> Any architecture enabling CMA may need this. I will move most of this below
> CONFIG_CMA.

In theory it could affect any architecture using CMA.  I suspect it's
much less likely to bite in practice on architectures other than ppc.
IIUC the main use of CMA there is to allocate things like framebuffers
or other large contiguous blocks used for hardware devices.  That's
usually going to happen rarely and during boot up.  What makes ppc
different is that we need a substantial CMA allocation every time we
start a (POWER8) guest for the HPT.  It's the fact that running guests
on a system both means we need the CMA unfragment and (with vfio added
in) can cause CMA fragmentation which makes this particularly
problematic.

-- 
David Gibson| I'll have my music baroque, and my code
david AT gibson.dropbear.id.au  | minimalist, thank you.  NOT _the_ _other_
| _way_ _around_!
http://www.ozlabs.org/~dgibson


signature.asc
Description: PGP signature


Re: [PATCH V6 0/4] mm/kvm/vfio/ppc64: Migrate compound pages out of CMA region

2019-01-09 Thread Aneesh Kumar K.V
Andrew Morton  writes:

> On Tue,  8 Jan 2019 10:21:06 +0530 "Aneesh Kumar K.V" 
>  wrote:
>
>> ppc64 use CMA area for the allocation of guest page table (hash page table). 
>> We won't
>> be able to start guest if we fail to allocate hash page table. We have 
>> observed
>> hash table allocation failure because we failed to migrate pages out of CMA 
>> region
>> because they were pinned. This happen when we are using VFIO. VFIO on ppc64 
>> pins
>> the entire guest RAM. If the guest RAM pages get allocated out of CMA 
>> region, we
>> won't be able to migrate those pages. The pages are also pinned for the 
>> lifetime of the
>> guest.
>> 
>> Currently we support migration of non-compound pages. With THP and with the 
>> addition of
>>  hugetlb migration we can end up allocating compound pages from CMA region. 
>> This
>> patch series add support for migrating compound pages. The first path adds 
>> the helper
>> get_user_pages_cma_migrate() which pin the page making sure we migrate them 
>> out of
>> CMA region before incrementing the reference count. 
>
> Does this code do anything for architectures other than powerpc?  If
> not, should we be adding the ifdefs to avoid burdening other
> architectures with unused code?

Any architecture enabling CMA may need this. I will move most of this below
CONFIG_CMA.

-aneesh



Re: [PATCH V6 0/4] mm/kvm/vfio/ppc64: Migrate compound pages out of CMA region

2019-01-08 Thread Andrew Morton
On Tue,  8 Jan 2019 10:21:06 +0530 "Aneesh Kumar K.V" 
 wrote:

> ppc64 use CMA area for the allocation of guest page table (hash page table). 
> We won't
> be able to start guest if we fail to allocate hash page table. We have 
> observed
> hash table allocation failure because we failed to migrate pages out of CMA 
> region
> because they were pinned. This happen when we are using VFIO. VFIO on ppc64 
> pins
> the entire guest RAM. If the guest RAM pages get allocated out of CMA region, 
> we
> won't be able to migrate those pages. The pages are also pinned for the 
> lifetime of the
> guest.
> 
> Currently we support migration of non-compound pages. With THP and with the 
> addition of
>  hugetlb migration we can end up allocating compound pages from CMA region. 
> This
> patch series add support for migrating compound pages. The first path adds 
> the helper
> get_user_pages_cma_migrate() which pin the page making sure we migrate them 
> out of
> CMA region before incrementing the reference count. 

Does this code do anything for architectures other than powerpc?  If
not, should we be adding the ifdefs to avoid burdening other
architectures with unused code?



[PATCH V6 0/4] mm/kvm/vfio/ppc64: Migrate compound pages out of CMA region

2019-01-07 Thread Aneesh Kumar K.V
ppc64 use CMA area for the allocation of guest page table (hash page table). We 
won't
be able to start guest if we fail to allocate hash page table. We have observed
hash table allocation failure because we failed to migrate pages out of CMA 
region
because they were pinned. This happen when we are using VFIO. VFIO on ppc64 pins
the entire guest RAM. If the guest RAM pages get allocated out of CMA region, we
won't be able to migrate those pages. The pages are also pinned for the 
lifetime of the
guest.

Currently we support migration of non-compound pages. With THP and with the 
addition of
 hugetlb migration we can end up allocating compound pages from CMA region. This
patch series add support for migrating compound pages. The first path adds the 
helper
get_user_pages_cma_migrate() which pin the page making sure we migrate them out 
of
CMA region before incrementing the reference count. 

Changes from V5:
* Add PF_MEMALLOC_NOCMA
* remote __GFP_THISNODE when allocating target page for migration

Changes from V4:
* use __GFP_NOWARN when allocating pages to avoid page allocation failure 
warnings.

Changes from V3:
* Move the hugetlb check before transhuge check
* Use compound head page when isolating hugetlb page


Aneesh Kumar K.V (4):
  mm/cma: Add PF flag to force non cma alloc
  mm: Add get_user_pages_cma_migrate
  powerpc/mm/iommu: Allow migration of cma allocated pages during
mm_iommu_get
  powerpc/mm/iommu: Allow large IOMMU page size only for hugetlb backing

 arch/powerpc/mm/mmu_context_iommu.c | 144 ---
 include/linux/hugetlb.h |   2 +
 include/linux/migrate.h |   3 +
 include/linux/sched.h   |   1 +
 include/linux/sched/mm.h|  36 +--
 mm/hugetlb.c|   4 +-
 mm/migrate.c| 149 
 7 files changed, 227 insertions(+), 112 deletions(-)

-- 
2.20.1