[PATCH] openvswitch: make for_each_node loops work with sparse numa systems
nacc at linux.vnet.ibm.com
Wed Jul 22 08:04:30 AEST 2015
On 21.07.2015 [11:30:58 -0500], Chris J Arges wrote:
> On Tue, Jul 21, 2015 at 09:24:18AM -0700, Nishanth Aravamudan wrote:
> > On 21.07.2015 [10:32:34 -0500], Chris J Arges wrote:
> > > Some architectures like POWER can have a NUMA node_possible_map that
> > > contains sparse entries. This causes memory corruption with openvswitch
> > > since it allocates flow_cache with a multiple of num_possible_nodes() and
> > Couldn't this also be fixed by just allocationg with a multiple of
> > nr_node_ids (which seems to have been the original intent all along)?
> > You could then make your stats array be sparse or not.
> Yea originally this is what I did, but I thought it would be wasting memory.
> > > assumes the node variable returned by for_each_node will index into
> > > flow->stats[node].
> > >
> > > For example, if node_possible_map is 0x30003, this patch will map node to
> > > node_cnt as follows:
> > > 0,1,16,17 => 0,1,2,3
> > >
> > > The crash was noticed after 3af229f2 was applied as it changed the
> > > node_possible_map to match node_online_map on boot.
> > > Fixes: 3af229f2071f5b5cb31664be6109561fbe19c861
> > My concern with this version of the fix is that you're relying on,
> > implicitly, the order of for_each_node's iteration corresponding to the
> > entries in stats 1:1. But what about node hotplug? It seems better to
> > have the enumeration of the stats array match the topology accurately,
> > rather, or to maintain some sort of internal map in the OVS code between
> > the NUMA node and the entry in the stats array?
> > I'm willing to be convinced otherwise, though :)
> > -Nish
> The method I described should work for hotplug since it's using possible map
> which AFAIK is static rather than the online map.
Oh you're right, I'm sorry!
More information about the Linuxppc-dev