On 8/27/21 5:25 PM, Jason Gunthorpe wrote:
> On Fri, Aug 27, 2021 at 03:58:13PM +0100, Joao Martins wrote:
> 
>>  #if defined(CONFIG_ARCH_HAS_PTE_DEVMAP) && 
>> defined(CONFIG_TRANSPARENT_HUGEPAGE)
>>  static int __gup_device_huge(unsigned long pfn, unsigned long addr,
>>                           unsigned long end, unsigned int flags,
>>                           struct page **pages, int *nr)
>>  {
>> -    int nr_start = *nr;
>> +    int refs, nr_start = *nr;
>>      struct dev_pagemap *pgmap = NULL;
>>      int ret = 1;
>>  
>>      do {
>> -            struct page *page = pfn_to_page(pfn);
>> +            struct page *head, *page = pfn_to_page(pfn);
>> +            unsigned long next = addr + PAGE_SIZE;
>>  
>>              pgmap = get_dev_pagemap(pfn, pgmap);
>>              if (unlikely(!pgmap)) {
>> @@ -2252,16 +2265,25 @@ static int __gup_device_huge(unsigned long pfn, 
>> unsigned long addr,
>>                      ret = 0;
>>                      break;
>>              }
>> -            SetPageReferenced(page);
>> -            pages[*nr] = page;
>> -            if (unlikely(!try_grab_page(page, flags))) {
>> -                    undo_dev_pagemap(nr, nr_start, flags, pages);
>> +
>> +            head = compound_head(page);
>> +            /* @end is assumed to be limited at most one compound page */
>> +            if (PageHead(head))
>> +                    next = end;
>> +            refs = record_subpages(page, addr, next, pages + *nr);
>> +
>> +            SetPageReferenced(head);
>> +            if (unlikely(!try_grab_compound_head(head, refs, flags))) {
>> +                    if (PageHead(head))
>> +                            ClearPageReferenced(head);
>> +                    else
>> +                            undo_dev_pagemap(nr, nr_start, flags, pages);
>>                      ret = 0;
>>                      break;
> 
> Why is this special cased for devmap?
> 
> Shouldn't everything processing pud/pmds/etc use the same basic loop
> that is similar in idea to the 'for_each_compound_head' scheme in
> unpin_user_pages_dirty_lock()?
> 
> Doesn't that work for all the special page type cases here?

We are iterating over PFNs to create an array of base pages (regardless of page 
table
type), rather than iterating over an array of pages to work on. Given that all 
these gup
functions already give you the boundary (end of pmd or end of pud, etc) then we 
just need
to grab the ref to pgmap and head page and save the tails. But sadly we need to 
handle the
base page case which is why there's this outer loop exists sadly. If it was 
just head
pages we wouldn't need the outer loop (and hence no for_each_compound_head, 
like the
hugetlb variant).

But maybe I am being dense and you just mean to replace the outer loop with
for_each_compound_range(). I am a little stuck on the part that I anyways need 
to record
back the tail pages when iterating over the (single) head page. So I feel that 
it wouldn't
bring that much improvement, unless I missed your point.

        Joao

Reply via email to