[PATCH] hwmon: (peci/dimmtemp) Do not provide fake thresholds data

Winiarska, Iwona iwona.winiarska at intel.com
Tue Jan 28 03:40:52 AEDT 2025


On Thu, 2025-01-23 at 15:20 +0300, Paul Fertser wrote:
> When an Icelake or Sapphire Rapids CPU isn't providing the maximum and
> critical thresholds for particular DIMM the driver should return an
> error to the userspace instead of giving it stale (best case) or wrong
> (the structure contains all zeros after kzalloc() call) data.
> 
> The issue can be reproduced by binding the peci driver while the host is
> fully booted and idle, this makes PECI interaction unreliable enough.
> 
> Fixes: 73bc1b885dae ("hwmon: peci: Add dimmtemp driver")
> Fixes: 621995b6d795 ("hwmon: (peci/dimmtemp) Add Sapphire Rapids support")
> Cc: stable at vger.kernel.org
> Signed-off-by: Paul Fertser <fercerpav at gmail.com>

Hi!

Thank you for the patch.
Did you have a chance to test it with OpenBMC dbus-sensors?
In general, the change looks okay to me, but since it modifies the behavior
(applications will need to handle this, and returning an error will happen more
often) we need to confirm that it does not cause any regressions for userspace.

Once we are able to confirm that:

Reviewed-by: Iwona Winiarska <iwona.winiarska at intel.com>

Thanks
-Iwona

> ---
>  drivers/hwmon/peci/dimmtemp.c | 10 ++++------
>  1 file changed, 4 insertions(+), 6 deletions(-)
> 
> diff --git a/drivers/hwmon/peci/dimmtemp.c b/drivers/hwmon/peci/dimmtemp.c
> index d6762259dd69..fbe82d9852e0 100644
> --- a/drivers/hwmon/peci/dimmtemp.c
> +++ b/drivers/hwmon/peci/dimmtemp.c
> @@ -127,8 +127,6 @@ static int update_thresholds(struct peci_dimmtemp *priv,
> int dimm_no)
>  		return 0;
>  
>  	ret = priv->gen_info->read_thresholds(priv, dimm_order, chan_rank,
> &data);
> -	if (ret == -ENODATA) /* Use default or previous value */
> -		return 0;
>  	if (ret)
>  		return ret;
>  
> @@ -509,11 +507,11 @@ read_thresholds_icx(struct peci_dimmtemp *priv, int
> dimm_order, int chan_rank, u
>  
>  	ret = peci_ep_pci_local_read(priv->peci_dev, 0, 13, 0, 2, 0xd4,
> &reg_val);
>  	if (ret || !(reg_val & BIT(31)))
> -		return -ENODATA; /* Use default or previous value */
> +		return -ENODATA;
>  
>  	ret = peci_ep_pci_local_read(priv->peci_dev, 0, 13, 0, 2, 0xd0,
> &reg_val);
>  	if (ret)
> -		return -ENODATA; /* Use default or previous value */
> +		return -ENODATA;
>  
>  	/*
>  	 * Device 26, Offset 224e0: IMC 0 channel 0 -> rank 0
> @@ -546,11 +544,11 @@ read_thresholds_spr(struct peci_dimmtemp *priv, int
> dimm_order, int chan_rank, u
>  
>  	ret = peci_ep_pci_local_read(priv->peci_dev, 0, 30, 0, 2, 0xd4,
> &reg_val);
>  	if (ret || !(reg_val & BIT(31)))
> -		return -ENODATA; /* Use default or previous value */
> +		return -ENODATA;
>  
>  	ret = peci_ep_pci_local_read(priv->peci_dev, 0, 30, 0, 2, 0xd0,
> &reg_val);
>  	if (ret)
> -		return -ENODATA; /* Use default or previous value */
> +		return -ENODATA;
>  
>  	/*
>  	 * Device 26, Offset 219a8: IMC 0 channel 0 -> rank 0



More information about the openbmc mailing list