[OpenPower-Firmware] op-build v2.0.5 released

Thu Jul 19 22:34:34 AEST 2018

Release Notes for OpenPower Firmware v2.0.5
*******************************************

op-build v2.0.5 was released on Thursday July 19th, 2018 and replaces
op-build v2.0.4 as the current stable release in the 2.0.x series.

It is recommended that v2.0.4 be used over any previous v2.0.x version
due to the bug fixes contained within.

Updated Packages
================

+-----------------------+-----------------------+-----------------------+------------------------------------------------------+
| Package               | Old Version           | New Version           | Platforms                                            |
+=======================+=======================+=======================+======================================================+
| skiboot               | v6.0.5                | v6.0.6                | openpower_mambo, firestone, firenze, garrison,       |
|                       |                       |                       | zaius, p9dsu, palmetto, pseries, vesnin,             |
|                       |                       |                       | witherspoon, habanero, openpower_p9_mambo, zz,       |
|                       |                       |                       | barreleye, romulus                                   |
+-----------------------+-----------------------+-----------------------+------------------------------------------------------+

Skiboot changes
===============

* phb4/CAPI: Reallocate PEC2 DMA-Read engines to improve GPU-Direct
  bandwidth

  We reallocate additional 16/8 DMA-Read engines allocated to stack0/1
  on PEC2 respectively. This is needed to improve bandwidth available
  to the Mellanox CX5 adapter when trying to read GPU memory (GPU-
  Direct).

  If kernel cxl driver indicates a request to allocate maximum
  possible DMA read engines when calling enable_capi_mode() and card
  is attached to PEC2/stack0 slot then we assume its a Mellanox CX5
  adapter. We then allocate additional 16/8 extra DMA read engines to
  stack0 and stack1 respectively on PEC2. This is done by populating
  the XPEC_PCI_PRDSTKOVR and XPEC_NEST_READ_STACK_OVERRIDE as
  suggested by the h/w team.

* phb4: Disable nodal scoped DMA accesses when PB pump mode is
  enabled

  By default when a PCIe device issues a read request via the PHB it
  is first issued with nodal scope. When accessing GPU memory the NPU
  does not know at the time of response if the requested memory page
  is off node or not. Therefore every read of GPU memory by a PHB is
  retried with larger scope which introduces bandwidth and latency
  issues.

  On smaller boxes which have pump mode enabled nodal and group scoped
  reads are treated the same and both types of request are broadcast
  to one chip. Therefore we can avoid the retry by disabling nodal
  scope on the PHB for these boxes. On larger boxes nodal (single
  chip) and group (multiple chip) scoped reads are treated
  differently. Therefore we avoid disabling nodal scope on large boxes
  which have pump mode disabled to avoid all PHB requests being
  broadcast to multiple chips.

* npu2/hw-procedures: Enable parity and credit overflow checks

  Enable these error checking features by setting the appropriate bits
  in our one-off initialization of each “NTL Misc Config 2” register.

  The exception is NDL RX parity checking, which should be disabled
  during the link training procedures.

-- 
Stewart Smith
OPAL Architect, IBM.