[Skiboot] skiboot 6.0.6 released!

Stewart Smith stewart at linux.ibm.com
Thu Jul 19 19:57:51 AEST 2018


skiboot-6.0.6
*************

skiboot 6.0.6 was released on Thursday July 19th, 2018. It replaces
skiboot-6.0.5 as the current stable release in the 6.0.x series.

It is recommended that 6.0.5 be used instead of any previous 6.0.x
version, especially in the case where NVLINK2 GPUs and/or Mellanox CX5
adapters are being used.

Over skiboot-6.0.5 we have several important performance related bug
fixes and one stability bug fix:

* phb4/CAPI: Reallocate PEC2 DMA-Read engines to improve GPU-Direct
  bandwidth

  We reallocate additional 16/8 DMA-Read engines allocated to stack0/1
  on PEC2 respectively. This is needed to improve bandwidth available
  to the Mellanox CX5 adapter when trying to read GPU memory (GPU-
  Direct).

  If kernel cxl driver indicates a request to allocate maximum
  possible DMA read engines when calling enable_capi_mode() and card
  is attached to PEC2/stack0 slot then we assume its a Mellanox CX5
  adapter. We then allocate additional 16/8 extra DMA read engines to
  stack0 and stack1 respectively on PEC2. This is done by populating
  the XPEC_PCI_PRDSTKOVR and XPEC_NEST_READ_STACK_OVERRIDE as
  suggested by the h/w team.

* phb4: Disable nodal scoped DMA accesses when PB pump mode is
  enabled

  By default when a PCIe device issues a read request via the PHB it
  is first issued with nodal scope. When accessing GPU memory the NPU
  does not know at the time of response if the requested memory page
  is off node or not. Therefore every read of GPU memory by a PHB is
  retried with larger scope which introduces bandwidth and latency
  issues.

  On smaller boxes which have pump mode enabled nodal and group scoped
  reads are treated the same and both types of request are broadcast
  to one chip. Therefore we can avoid the retry by disabling nodal
  scope on the PHB for these boxes. On larger boxes nodal (single
  chip) and group (multiple chip) scoped reads are treated
  differently. Therefore we avoid disabling nodal scope on large boxes
  which have pump mode disabled to avoid all PHB requests being
  broadcast to multiple chips.

* npu2/hw-procedures: Enable parity and credit overflow checks

  Enable these error checking features by setting the appropriate bits
  in our one-off initialization of each “NTL Misc Config 2” register.

  The exception is NDL RX parity checking, which should be disabled
  during the link training procedures.

-- 
Stewart Smith
OPAL Architect, IBM.



More information about the Skiboot mailing list