[PATCH v2] powerpc/mm: Avoid calling arch_enter/leave_lazy_mmu() in set_ptes
    Michael Ellerman 
    mpe at ellerman.id.au
       
    Thu Nov  2 22:39:44 AEDT 2023
    
    
  
Matthew Wilcox <willy at infradead.org> writes:
> On Tue, Oct 24, 2023 at 08:06:04PM +0530, Aneesh Kumar K.V wrote:
>>  		ptep++;
>> -		pte = __pte(pte_val(pte) + (1UL << PTE_RPN_SHIFT));
>>  		addr += PAGE_SIZE;
>> +		/*
>> +		 * increment the pfn.
>> +		 */
>> +		pte = pfn_pte(pte_pfn(pte) + 1, pte_pgprot((pte)));
>
> when i looked at this, it generated shit code.  did you check?
I didn't look ...
<goes and looks>
It's not super clear cut. There's some difference because pfn_pte()
contains two extra VM_BUG_ONs.
But with DEBUG_VM *off* the version using pfn_pte() generates *better*
code, or at least less code, ~160 instructions vs ~200.
For some reason the version using PTE_RPN_SHIFT seems to be byte
swapping the pte an extra two times, each of which generates ~8
instructions. But I can't see why.
I tried a few other things and couldn't come up with anything that
generated better code. But I'll keep poking at it tomorrow.
cheers
    
    
More information about the Linuxppc-dev
mailing list