[Skiboot] [PATCH 0/2] Enable reporting of frozen NVLink bricks
stewart at linux.vnet.ibm.com
Mon Jan 15 17:39:10 AEDT 2018
Alistair Popple <alistair at popple.id.au> writes:
> When a GPU does an invalid access via NVLink2 it can cause the NPU to
> freeze the associated PE. An interrupt is raised when this occurs however
> no interrupt handler is registered. This series fixes a bug with the
> existing NPU2 interrupt setup and adds an interrupt handler to report the
> error as an EEH event.
> This is similar to what is done for NVLink1 and allows the operating system
> to report the error instead of it being ignored.
> Alistair Popple (2):
> npu2.c: Fix XIVE IRQ alignment
> npu2.c: Add PE error detection
> hw/npu2.c | 57 ++++++++++++++++++++++++++++++++++++++++++++++++++---
> include/npu2-regs.h | 17 +---------------
> 2 files changed, 55 insertions(+), 19 deletions(-)
Series merged to master as of 695bb562a315d4402fe3e82e93ed72265cefa8db
OPAL Architect, IBM.
More information about the Skiboot