[Skiboot] [PATCH 0/2] Enable reporting of frozen NVLink bricks

Stewart Smith stewart at linux.vnet.ibm.com
Mon Jan 15 17:39:10 AEDT 2018


Alistair Popple <alistair at popple.id.au> writes:
> When a GPU does an invalid access via NVLink2 it can cause the NPU to
> freeze the associated PE. An interrupt is raised when this occurs however
> no interrupt handler is registered. This series fixes a bug with the
> existing NPU2 interrupt setup and adds an interrupt handler to report the
> error as an EEH event.
>
> This is similar to what is done for NVLink1 and allows the operating system
> to report the error instead of it being ignored.
>
> Alistair Popple (2):
>   npu2.c: Fix XIVE IRQ alignment
>   npu2.c: Add PE error detection
>
>  hw/npu2.c           | 57 ++++++++++++++++++++++++++++++++++++++++++++++++++---
>  include/npu2-regs.h | 17 +---------------
>  2 files changed, 55 insertions(+), 19 deletions(-)

Series merged to master as of 695bb562a315d4402fe3e82e93ed72265cefa8db

-- 
Stewart Smith
OPAL Architect, IBM.



More information about the Skiboot mailing list