[Skiboot] [RFC PATCH] skiboot machine check handler

Mahesh J Salgaonkar mahesh at linux.vnet.ibm.com
Thu Jan 16 18:03:04 AEDT 2020


On 2019-12-11 20:01:18 Wed, Nicholas Piggin wrote:
> Provide facilities to decode machine checks into human readable
> strings, with only sufficient information required to deal with
> them sanely.
> 
> The old machine check stuff was over engineered. The philosophy
> here is that OPAL should correct anything it possibly can, what
> it can't handle but the OS might be able to do something with
> (e.g., uncorrected memory error or SLB multi-hit), it passes back
> to Linux. Anything else, the OS doesn't care. It doesn't want a
> huge struct of severities and levels and originators etc that it
> can't do anything with -- just provide human readable strings
> for what happened and what was done with it.
> 
> A Linux driver for this will be able to cope with new processors.
> 
> This also uses the same facility to decode machine checks in OPAL
> boot.
> 
> The code is a bit in flux because it's sitting on top of a few
> other RFC patches and not quite complete, just wanted opinions
> about it.

opal_handle_mce() may have to be treated as special opal call. For MCE
that occurs in OPAL context, Linux making opal call will clobber
original opal call stack which hit MCE. Same is true with nested MCE in
OPAL. Should it just continue using same r1 to avoid clobbering or have
a separate stack for mce opal call ?

Thanks,
-Mahesh.



More information about the Skiboot mailing list