[PATCH] powerpc/pseries: Track LMB nid instead of using device tree
Nathan Fontenot
nfont at linux.vnet.ibm.com
Tue Sep 25 00:33:06 AEST 2018
On 09/19/2018 11:38 PM, Michael Ellerman wrote:
> Nathan Fontenot <nfont at linux.vnet.ibm.com> writes:
>
>> When removing memory we need to remove the memory from the node
>> it was added to instead of looking up the node it should be in
>> in the device tree.
>>
>> During testing we have seen scenarios where the affinity for a
>> LMB changes due to a partition migration or PRRN event. In these
>> cases the node the LMB exists in may not match the node the device
>> tree indicates it belongs in. This can lead to a system crash
>> when trying to DLAPR remove the LMB after a migration or PRRN
>> event. The current code looks up the node in the device tree to
>> remove the LMB from, the crash occurs when we try to offline this
>> node and it does not have any data, i.e. node_data[nid] == NULL.
>
> This isn't building for 32-bit etc:
>
> arch/powerpc/mm/drmem.c: In function 'init_drmem_v1_lmbs':
> arch/powerpc/mm/drmem.c:371:14: error: implicit declaration of function 'memory_add_physaddr_to_nid' [-Werror=implicit-function-declaration]
> lmb->nid = memory_add_physaddr_to_nid(lmb->base_addr);
> ^
> cc1: all warnings being treated as errors
> scripts/Makefile.build:317: recipe for target 'arch/powerpc/mm/drmem.o' failed
>
> See the failed checks here:
> https://patchwork.ozlabs.org/patch/969150/
>
>
> Probably drmem.c should only be compiled for 64-bit NUMA etc.
Looks like the root cause is that memory hotplug relies on sparsemem which
is not supported on 32-bit.
This patch is also going to need a refresh to apply cleanly due to other
patches that have gone in. I'll re-submit after looking at the build break issues more.
-Nathan
>
> cheers
>
More information about the Linuxppc-dev
mailing list