[PATCH 1/1] powerpc/pseries: Enable RAS hotplug events late

Balbir Singh bsingharora at gmail.com
Mon Feb 12 16:46:41 AEDT 2018


On Mon, Feb 12, 2018 at 11:19 AM, Sam Bobroff <sam.bobroff at au1.ibm.com> wrote:
> Currently if the kernel receives a memory hot-unplug event early
> enough, it may get stuck in an infinite loop in
> dissolve_free_huge_pages(). This appears as a stall just after:
>
> pseries-hotplug-mem: Attempting to hot-remove XX LMB(s) at YYYYYYYY
>
> It appears to be caused by "minimum_order" being uninitialized, due to
> init_ras_IRQ() executing before hugetlb_init().
>
> To correct this, extract the part of init_ras_IRQ() that enables
> hotplug event processing and place it in the machine_late_initcall
> phase, which is guaranteed to be after hugetlb_init() is called.
>
> Signed-off-by: Sam Bobroff <sam.bobroff at au1.ibm.com>
> ---
>  arch/powerpc/platforms/pseries/ras.c | 29 +++++++++++++++++++++--------
>  1 file changed, 21 insertions(+), 8 deletions(-)
>
> diff --git a/arch/powerpc/platforms/pseries/ras.c b/arch/powerpc/platforms/pseries/ras.c
> index 81d8614e7379..ba284949af06 100644
> --- a/arch/powerpc/platforms/pseries/ras.c
> +++ b/arch/powerpc/platforms/pseries/ras.c
> @@ -66,6 +66,26 @@ static int __init init_ras_IRQ(void)
>                 of_node_put(np);
>         }
>
> +       /* EPOW Events */
> +       np = of_find_node_by_path("/event-sources/epow-events");
> +       if (np != NULL) {
> +               request_event_sources_irqs(np, ras_epow_interrupt, "RAS_EPOW");
> +               of_node_put(np);
> +       }
> +
> +       return 0;
> +}
> +machine_subsys_initcall(pseries, init_ras_IRQ);
> +
> +/*
> + * Enable the hotplug interrupt late because processing them may touch other
> + * devices or systems (e.g. hugepages) that have not been initialized at the
> + * subsys stage.
> + */
> +int __init init_ras_hotplug_IRQ(void)
> +{
> +       struct device_node *np;
> +
>         /* Hotplug Events */
>         np = of_find_node_by_path("/event-sources/hot-plug-events");
>         if (np != NULL) {
> @@ -75,16 +95,9 @@ static int __init init_ras_IRQ(void)
>                 of_node_put(np);
>         }
>
> -       /* EPOW Events */
> -       np = of_find_node_by_path("/event-sources/epow-events");
> -       if (np != NULL) {
> -               request_event_sources_irqs(np, ras_epow_interrupt, "RAS_EPOW");
> -               of_node_put(np);
> -       }
> -
>         return 0;
>  }
> -machine_subsys_initcall(pseries, init_ras_IRQ);
> +machine_late_initcall(pseries, init_ras_hotplug_IRQ);
>

Seems reasonable to me, the other RAS events internal error and epow
seem like they are in the right place.

Acked-by: Balbir Singh <bsingharora at gmail.com>


More information about the Linuxppc-dev mailing list