On 9/12/25 08:28, Mika Penttilä wrote:

> On 9/12/25 08:04, Balbir Singh wrote:
>
>> On 9/11/25 21:52, Mika Penttilä wrote:
>>> sending again for the v5 thread..
>>>
>>> On 9/8/25 03:04, Balbir Singh wrote:
>>>
>>>> MIGRATE_VMA_SELECT_COMPOUND will be used to select THP pages during
>>>> migrate_vma_setup() and MIGRATE_PFN_COMPOUND will make migrating
>>>> device pages as compound pages during device pfn migration.
>>>>
>>>> migrate_device code paths go through the collect, setup
>>>> and finalize phases of migration.
>>>>
>>>> The entries in src and dst arrays passed to these functions still
>>>> remain at a PAGE_SIZE granularity. When a compound page is passed,
>>>> the first entry has the PFN along with MIGRATE_PFN_COMPOUND
>>>> and other flags set (MIGRATE_PFN_MIGRATE, MIGRATE_PFN_VALID), the
>>>> remaining entries (HPAGE_PMD_NR - 1) are filled with 0's. This
>>>> representation allows for the compound page to be split into smaller
>>>> page sizes.
>>>>
>>>> migrate_vma_collect_hole(), migrate_vma_collect_pmd() are now THP
>>>> page aware. Two new helper functions migrate_vma_collect_huge_pmd()
>>>> and migrate_vma_insert_huge_pmd_page() have been added.
>>>>
>>>> migrate_vma_collect_huge_pmd() can collect THP pages, but if for
>>>> some reason this fails, there is fallback support to split the folio
>>>> and migrate it.
>>>>
>>>> migrate_vma_insert_huge_pmd_page() closely follows the logic of
>>>> migrate_vma_insert_page()
>>>>
>>>> Support for splitting pages as needed for migration will follow in
>>>> later patches in this series.
>>>>
>>>> Cc: Andrew Morton <a...@linux-foundation.org>
>>>> Cc: David Hildenbrand <da...@redhat.com>
>>>> Cc: Zi Yan <z...@nvidia.com>
>>>> Cc: Joshua Hahn <joshua.hah...@gmail.com>
>>>> Cc: Rakie Kim <rakie....@sk.com>
>>>> Cc: Byungchul Park <byungc...@sk.com>
>>>> Cc: Gregory Price <gou...@gourry.net>
>>>> Cc: Ying Huang <ying.hu...@linux.alibaba.com>
>>>> Cc: Alistair Popple <apop...@nvidia.com>
>>>> Cc: Oscar Salvador <osalva...@suse.de>
>>>> Cc: Lorenzo Stoakes <lorenzo.stoa...@oracle.com>
>>>> Cc: Baolin Wang <baolin.w...@linux.alibaba.com>
>>>> Cc: "Liam R. Howlett" <liam.howl...@oracle.com>
>>>> Cc: Nico Pache <npa...@redhat.com>
>>>> Cc: Ryan Roberts <ryan.robe...@arm.com>
>>>> Cc: Dev Jain <dev.j...@arm.com>
>>>> Cc: Barry Song <bao...@kernel.org>
>>>> Cc: Lyude Paul <ly...@redhat.com>
>>>> Cc: Danilo Krummrich <d...@kernel.org>
>>>> Cc: David Airlie <airl...@gmail.com>
>>>> Cc: Simona Vetter <sim...@ffwll.ch>
>>>> Cc: Ralph Campbell <rcampb...@nvidia.com>
>>>> Cc: Mika Penttilä <mpent...@redhat.com>
>>>> Cc: Matthew Brost <matthew.br...@intel.com>
>>>> Cc: Francois Dugast <francois.dug...@intel.com>
>>>>
>>>> Signed-off-by: Balbir Singh <balb...@nvidia.com>
>>>> ---
>>>>  include/linux/migrate.h |   2 +
>>>>  mm/migrate_device.c     | 456 ++++++++++++++++++++++++++++++++++------
>>>>  2 files changed, 395 insertions(+), 63 deletions(-)
>>>>
>>>> diff --git a/include/linux/migrate.h b/include/linux/migrate.h
>>>> index 1f0ac122c3bf..41b4cc05a450 100644
>>>> --- a/include/linux/migrate.h
>>>> +++ b/include/linux/migrate.h
>>>> @@ -125,6 +125,7 @@ static inline int migrate_misplaced_folio(struct folio 
>>>> *folio, int node)
>>>>  #define MIGRATE_PFN_VALID (1UL << 0)
>>>>  #define MIGRATE_PFN_MIGRATE       (1UL << 1)
>>>>  #define MIGRATE_PFN_WRITE (1UL << 3)
>>>> +#define MIGRATE_PFN_COMPOUND      (1UL << 4)
>>>>  #define MIGRATE_PFN_SHIFT 6
>>>>  
>>>>  static inline struct page *migrate_pfn_to_page(unsigned long mpfn)
>>>> @@ -143,6 +144,7 @@ enum migrate_vma_direction {
>>>>    MIGRATE_VMA_SELECT_SYSTEM = 1 << 0,
>>>>    MIGRATE_VMA_SELECT_DEVICE_PRIVATE = 1 << 1,
>>>>    MIGRATE_VMA_SELECT_DEVICE_COHERENT = 1 << 2,
>>>> +  MIGRATE_VMA_SELECT_COMPOUND = 1 << 3,
>>>>  };
>>>>  
>>>>  struct migrate_vma {
>>>> diff --git a/mm/migrate_device.c b/mm/migrate_device.c
>>>> index f45ef182287d..1dfcf4799ea5 100644
>>>> --- a/mm/migrate_device.c
>>>> +++ b/mm/migrate_device.c
>>>> @@ -14,6 +14,7 @@
>>>>  #include <linux/pagewalk.h>
>>>>  #include <linux/rmap.h>
>>>>  #include <linux/swapops.h>
>>>> +#include <linux/pgalloc.h>
>>>>  #include <asm/tlbflush.h>
>>>>  #include "internal.h"
>>>>  
>>>> @@ -44,6 +45,23 @@ static int migrate_vma_collect_hole(unsigned long start,
>>>>    if (!vma_is_anonymous(walk->vma))
>>>>            return migrate_vma_collect_skip(start, end, walk);
>>>>  
>>>> +  if (thp_migration_supported() &&
>>>> +          (migrate->flags & MIGRATE_VMA_SELECT_COMPOUND) &&
>>>> +          (IS_ALIGNED(start, HPAGE_PMD_SIZE) &&
>>>> +           IS_ALIGNED(end, HPAGE_PMD_SIZE))) {
>>>> +          migrate->src[migrate->npages] = MIGRATE_PFN_MIGRATE |
>>>> +                                          MIGRATE_PFN_COMPOUND;
>>>> +          migrate->dst[migrate->npages] = 0;
>>>> +          migrate->npages++;
>>>> +          migrate->cpages++;
>>>> +
>>>> +          /*
>>>> +           * Collect the remaining entries as holes, in case we
>>>> +           * need to split later
>>>> +           */
>>>> +          return migrate_vma_collect_skip(start + PAGE_SIZE, end, walk);
>>>> +  }
>>>> +
>>> seems you have to split_huge_pmd() for the huge zero page here in case
>>> of !thp_migration_supported() afaics
>>>
>> Not really, if pfn is 0, we do a vm_insert_page (please see if (!page) line 
>> 1107) and
>> folio  handling in migrate_vma_finalize line 1284
> Ok actually seems it is handled by migrate_vma_insert_page() which does
>
>         if (!pmd_none(*pmdp)) {
>                 if (pmd_trans_huge(*pmdp)) {
>                         if (!is_huge_zero_pmd(*pmdp))
>                                 goto abort;
>                         folio_get(pmd_folio(*pmdp));
>                         split_huge_pmd(vma, pmdp, addr);   <----- here
>                 } else if (pmd_leaf(*pmdp))
>                         goto abort;
>         }
>
While at it, think the folio_get(pmd_folio(*pmdp)); is wrong for here,
we split the pmd for huge zero page.

>> Thanks,
>> Balbir
>>
> --Mika
>

Reply via email to