On Thu, Apr 05, 2018 at 10:59:27AM +0200, Michal Hocko wrote:
> On Tue 03-04-18 13:54:11, Kirill A. Shutemov wrote:
> > On Tue, Apr 03, 2018 at 10:34:51AM +0200, Michal Hocko wrote:
> > > On Tue 03-04-18 08:24:06, Naoya Horiguchi wrote:
> > > > On Tue, Apr 03, 2018 at 09:59:28AM +0200, Michal Hocko wrote:
> > > > > On Tue 03-04-18 13:46:28, Naoya Horiguchi wrote:
> > > > > > My testing for the latest kernel supporting thp migration found out 
> > > > > > an
> > > > > > infinite loop in offlining the memory block that is filled with 
> > > > > > shmem
> > > > > > thps.  We can get out of the loop with a signal, but kernel should
> > > > > > return with failure in this case.
> > > > > >
> > > > > > What happens in the loop is that scan_movable_pages() repeats 
> > > > > > returning
> > > > > > the same pfn without any progress. That's because page migration 
> > > > > > always
> > > > > > fails for shmem thps.
> > > > >
> > > > > Why does it fail? Shmem pages should be movable without any issues.
> > > > 
> > > > .. because try_to_unmap_one() explicitly skips unmapping for migration.
> > > > 
> > > >   #ifdef CONFIG_ARCH_ENABLE_THP_MIGRATION
> > > >                   /* PMD-mapped THP migration entry */
> > > >                   if (!pvmw.pte && (flags & TTU_MIGRATION)) {
> > > >                           VM_BUG_ON_PAGE(PageHuge(page) || 
> > > > !PageTransCompound(page), page);
> > > >   
> > > >                           if (!PageAnon(page))
> > > >                                   continue;
> > > >   
> > > >                           set_pmd_migration_entry(&pvmw, page);
> > > >                           continue;
> > > >                   }
> > > >   #endif
> > > > 
> > > > When I implemented this code, I felt hard to work on both of anon thp
> > > > and shmem thp at one time, so I separated the proposal into smaller 
> > > > steps.
> > > > Shmem uses pagecache so we need some non-trivial effort (including 
> > > > testing)
> > > > to extend thp migration for shmem. But I think it's a reasonable next 
> > > > step.
> > > 
> > > OK, I see. I have forgot about this part. Please be explicit about that
> > > in the changelog. Also the proper fix is to not use movable zone for
> > > shmem page THP rather than hack around it in the hotplug specific code
> > > IMHO.
> > 
> > No. We should just split the page before running
> > try_to_unmap(TTU_MIGRATION) on the page.
> 
> Something like this or it is completely broken. I completely forgot the
> whole page_vma_mapped_walk business.

No, this wouldn't work. We need to split page, not pmd to make migration
work.

> 
> diff --git a/mm/rmap.c b/mm/rmap.c
> index 9eaa6354fe70..cbbfbcb08b83 100644
> --- a/mm/rmap.c
> +++ b/mm/rmap.c
> @@ -1356,6 +1356,7 @@ static bool try_to_unmap_one(struct page *page, struct 
> vm_area_struct *vma,
>               return true;
>  
>       if (flags & TTU_SPLIT_HUGE_PMD) {
> +split:
>               split_huge_pmd_address(vma, address,
>                               flags & TTU_SPLIT_FREEZE, page);
>       }
> @@ -1375,7 +1376,7 @@ static bool try_to_unmap_one(struct page *page, struct 
> vm_area_struct *vma,
>                       VM_BUG_ON_PAGE(PageHuge(page) || 
> !PageTransCompound(page), page);
>  
>                       if (!PageAnon(page))
> -                             continue;
> +                             goto split;
>  
>                       set_pmd_migration_entry(&pvmw, page);
>                       continue;
> -- 
> Michal Hocko
> SUSE Labs

-- 
 Kirill A. Shutemov

Reply via email to