On 05/05/2026 11:59, Sumit Garg wrote:
> On Mon, May 04, 2026 at 07:53:54PM +0200, Casey Connolly wrote:
>> Currently set_one_region() implicitly assumes that we want to map a
>> region and aggressively splits blocks into tables to do this, but when
>> called with PTE_TYPE_FAULT to unmap a currently mapped region it may
>> try to unnecessarily split blocks which doesn't make sense if the entire
>> block should actually be unmapped. In the end it then has to walk every
>> single page and create a bunch of empty tables.
>>
>> Introduce a check for this kind of behaviour and optimise with a fast
>> path, if we're unmapping a region >= the size of this entry then we can
>> just unmap the entire PTE and whatever it contains.
>>
>> This fixes some bogus empty tables being left behind when carving out
>> reserved memory regions on Qualcomm, and should improve the performance
>> of the break-before-make in mmu_change_region_attr().
>>
>> Signed-off-by: Casey Connolly <[email protected]>
>> ---
>>  arch/arm/cpu/armv8/cache_v8.c | 16 ++++++++++++++++
>>  1 file changed, 16 insertions(+)
> 
> Nice optimization, FWIW:
> 
> Acked-by: Sumit Garg <[email protected]>
> Tested-by: Sumit Garg <[email protected]>

Could you drop this on the v2 instead? Sorry for the confusion this
version had a silly bug.
> 
> -Sumit
> 
>>
>> diff --git a/arch/arm/cpu/armv8/cache_v8.c b/arch/arm/cpu/armv8/cache_v8.c
>> index 39479df7b21f..b1a1fd571341 100644
>> --- a/arch/arm/cpu/armv8/cache_v8.c
>> +++ b/arch/arm/cpu/armv8/cache_v8.c
>> @@ -939,8 +939,20 @@ static u64 set_one_region(u64 start, u64 size, u64 
>> attrs, bool flag, int level)
>>      int levelshift = level2shift(level);
>>      u64 levelsize = 1ULL << levelshift;
>>      u64 *pte = find_pte(start, level);
>>  
>> +    /*
>> +     * If we're trying to unmap a region then check if it's already 
>> unmapped or if it's bigger
>> +     * then the PTE we're looking at right now, in the first case we can do 
>> nothing and in the
>> +     * second case we just need to unmap this page/block.
>> +     * Otherwise we will needlessly create new tables until we have 
>> traversed every single page
>> +     * in the region.
>> +     */
>> +    if (attrs == PTE_TYPE_FAULT && (pte_type(pte) == PTE_TYPE_FAULT || size 
>> >= levelsize)) {
>> +            *pte &= ~PMD_ATTRMASK;
>> +            return levelsize;
>> +    }
>> +
>>      /* Can we can just modify the current level block PTE? */
>>      if (is_aligned(start, size, levelsize)) {
>>              if (flag) {
>>                      *pte &= ~PMD_ATTRMASK;
>> @@ -1081,8 +1093,12 @@ void mmu_change_region_attr(phys_addr_t addr, size_t 
>> siz, u64 attrs)
>>      flush_dcache_range(gd->arch.tlb_addr,
>>                         gd->arch.tlb_addr + gd->arch.tlb_size);
>>      __asm_invalidate_tlb_all();
>>  
>> +    /* If we were unmapping a region then we're done! */
>> +    if (attrs == PTE_TYPE_FAULT)
>> +            return;
>> +
>>      mmu_change_region_attr_nobreak(addr, siz, attrs);
>>  }
>>  
>>  int pgprot_set_attrs(phys_addr_t addr, size_t size, enum pgprot_attrs perm)
>> -- 
>> 2.53.0
>>

-- 
// Casey (she/her)

Reply via email to