[Skiboot] [PATCH skiboot] npu2: Reset NVLinks when resetting a GPU

Stewart Smith stewart at linux.ibm.com
Mon Jun 3 12:04:26 AEST 2019


Alexey Kardashevskiy <aik at ozlabs.ru> writes:
> Resetting a V100 GPU brings its NVLinks down and if an NPU tries using
> those, an HMI occurs. We were lucky not to observe this as the bare metal
> does not normally reset a GPU and when passed through, GPUs are usually
> before NPUs in QEMU command line or Libvirt XML and because of that NPUs
> are naturally reset first. However simple change of the device order
> brings HMIs.
>
> This defines a bus control filter for a PCI slot with a GPU with NVLinks
> so when the host system issues secondary bus reset to the slot, it resets
> associated NVLinks.
>
> Signed-off-by: Alexey Kardashevskiy <aik at ozlabs.ru>

Merged to master as of 7c977c734e1c4d3be9a036a075798530d352d8e3. Sorry
for the delay.

Does this need to also go to stable?

-- 
Stewart Smith
OPAL Architect, IBM.



More information about the Skiboot mailing list