[PATCH] powerpc/mm: Add cond_resched() while removing hpte mappings

Vaibhav Jain vaibhav at linux.ibm.com
Tue Apr 6 14:30:47 AEST 2021


Hi Mpe,

Thanks for looking into this patch.

Michael Ellerman <mpe at ellerman.id.au> writes:

> Vaibhav Jain <vaibhav at linux.ibm.com> writes:
>> While removing large number of mappings from hash page tables for
>> large memory systems as soft-lockup is reported because of the time
>> spent inside htap_remove_mapping() like one below:
>>
>>  watchdog: BUG: soft lockup - CPU#8 stuck for 23s!
>>  <snip>
>>  NIP plpar_hcall+0x38/0x58
>>  LR  pSeries_lpar_hpte_invalidate+0x68/0xb0
>>  Call Trace:
>>   0x1fffffffffff000 (unreliable)
>>   pSeries_lpar_hpte_removebolted+0x9c/0x230
>>   hash__remove_section_mapping+0xec/0x1c0
>>   remove_section_mapping+0x28/0x3c
>>   arch_remove_memory+0xfc/0x150
>>   devm_memremap_pages_release+0x180/0x2f0
>>   devm_action_release+0x30/0x50
>>   release_nodes+0x28c/0x300
>>   device_release_driver_internal+0x16c/0x280
>>   unbind_store+0x124/0x170
>>   drv_attr_store+0x44/0x60
>>   sysfs_kf_write+0x64/0x90
>>   kernfs_fop_write+0x1b0/0x290
>>   __vfs_write+0x3c/0x70
>>   vfs_write+0xd4/0x270
>>   ksys_write+0xdc/0x130
>>   system_call+0x5c/0x70
>>
>> Fix this by adding a cond_resched() to the loop in
>> htap_remove_mapping() that issues hcall to remove hpte mapping. This
>> should prevent the soft-lockup from being reported.
>
> Can/should we also/instead be using H_BLOCK_REMOVE?
>
> cheers

Current mmp_ops implementation seems to use H_BULK_REMOVE for hugepages
so for removing mappings for regular pages I am looking into adding a
new mmu_op that can take a range to be unmapped and 
I did try implmenting a new mmu_op for this which can reduce the number
of hash_pte lookups needed to invalidate this range. But that would need
some more work so as a stop gap I have sent out a v2 with Christophe's
suggestion to add a cond_resched() every HZ interval.

-- 
Cheers
~ Vaibhav


More information about the Linuxppc-dev mailing list