[PATCH] powerpc/papr_scm: Fix nvdimm event mappings
kajoljain
kjain at linux.ibm.com
Tue Jun 28 16:04:51 AEST 2022
On 6/27/22 12:05, Michael Ellerman wrote:
> Hi Kajol,
>
> A few comments below ...
Hi Michael,
Thanks for reviewing the patch. I will make the changes suggested by
you and send version 2 of this patch.
Thanks,
Kajol Jain
>
> Kajol Jain <kjain at linux.ibm.com> writes:
>> Commit 4c08d4bbc089 ("powerpc/papr_scm: Add perf interface support")
>> adds performance monitoring support for papr-scm nvdimm devices via
> ^
> We're talking about a commit that's already happened so we should use
> past tense, so "added".
>
>> perf interface. It also adds one array in papr_scm_priv
> "added"
>> structure called "nvdimm_events_map", to dynamically save the stat_id
>> for events specified in nvdimm driver code "nd_perf.c".
>>
>> Right now the mapping is done based on the result of
>> H_SCM_PERFORMANCE_STATS hcall, when all the stats are
>> requested. Currently there is an assumption, that a
>> certain stat will always be found at a specific offset
>> in the stat buffer.
> ^
> "returned by the hypervisor."
>
> To make it clear where the stat buffer comes from, and that it's out of
> our control.
>
>> The assumption may not be true or documented as part of PAPR
>> documentation.
>
> That reads as the assumption "may not be documented as part of PAPR". I
> think what you mean is the assumption *is not* documented by PAPR, and
> although it happens to be true on current systems it may not be true in
> future.
>
>> Fixing it, by adding a static mapping for nvdimm events to
> Fix it
>> corresponding stat-id, and removing the map from
>> papr_scm_priv structure.
>>
>> Fixes: 4c08d4bbc089 ("powerpc/papr_scm: Add perf interface support")
>> Reported-by: Aneesh Kumar K.V <aneesh.kumar at linux.ibm.com>
>> Signed-off-by: Kajol Jain <kjain at linux.ibm.com>
>> ---
>> arch/powerpc/platforms/pseries/papr_scm.c | 59 ++++++++++-------------
>> 1 file changed, 25 insertions(+), 34 deletions(-)
>>
>> diff --git a/arch/powerpc/platforms/pseries/papr_scm.c b/arch/powerpc/platforms/pseries/papr_scm.c
>> index 181b855b3050..5434c654a797 100644
>> --- a/arch/powerpc/platforms/pseries/papr_scm.c
>> +++ b/arch/powerpc/platforms/pseries/papr_scm.c
>> @@ -350,6 +347,26 @@ static ssize_t drc_pmem_query_stats(struct papr_scm_priv *p,
>> #ifdef CONFIG_PERF_EVENTS
>> #define to_nvdimm_pmu(_pmu) container_of(_pmu, struct nvdimm_pmu, pmu)
>>
>> +static const char * const nvdimm_events_map[] = {
>> + "N/A",
>> + "CtlResCt",
>> + "CtlResTm",
>> + "PonSecs ",
>> + "MemLife ",
>> + "CritRscU",
>> + "HostLCnt",
>> + "HostSCnt",
>> + "HostSDur",
>> + "HostLDur",
>> + "MedRCnt ",
>> + "MedWCnt ",
>> + "MedRDur ",
>> + "MedWDur ",
>> + "CchRHCnt",
>> + "CchWHCnt",
>> + "FastWCnt",
>> +};
>
> The order of the strings in that array becomes ABI. Because it defines
> the mapping from perf_event.attr.config (perf user ABI) to the actual
> event we request from the hypervisor.
>
> So I'd like that made more explicit by using designated initialisers, eg:
>
> static const char * const nvdimm_events_map[] = {
> [1] = "CtlResCt",
> [2] = "CtlResTm",
> ...
>
> That way an accidental reordering of the array won't break anything.
Yes make sense. Will do update it.
>
> You shouldn't need to specify 0 either as it's not used.
>
>> @@ -370,7 +387,7 @@ static int papr_scm_pmu_get_value(struct perf_event *event, struct device *dev,
>>
>> stat = &stats->scm_statistic[0];
>> memcpy(&stat->stat_id,
>> - &p->nvdimm_events_map[event->attr.config * sizeof(stat->stat_id)],
>> + nvdimm_events_map[event->attr.config],
>> sizeof(stat->stat_id));
>
> It's not clear that this won't index off the end of the array.
>
> There is a check in papr_scm_pmu_event_init(), but I'd probably be
> happier if we did an explicit check in here as well, eg:
>
> if (event->attr.config >= ARRAY_SIZE(nvdimm_events_map))
> return -EINVAL;
>
>
>> stat->stat_val = 0;
>>
>> @@ -460,10 +477,9 @@ static void papr_scm_pmu_del(struct perf_event *event, int flags)
>>
>> static int papr_scm_pmu_check_events(struct papr_scm_priv *p, struct nvdimm_pmu *nd_pmu)
>> {
>> - struct papr_scm_perf_stat *stat;
>> struct papr_scm_perf_stats *stats;
>> u32 available_events;
>> - int index, rc = 0;
>> + int rc = 0;
>
> You shouldn't need to initialise rc here. It's not used until the call
> to drc_pmem_query_stats() below.
Ok sure.
>
>> available_events = (p->stat_buffer_len - sizeof(struct papr_scm_perf_stats))
>> / sizeof(struct papr_scm_perf_stat);
>> @@ -473,34 +489,12 @@ static int papr_scm_pmu_check_events(struct papr_scm_priv *p, struct nvdimm_pmu
>> /* Allocate the buffer for phyp where stats are written */
>> stats = kzalloc(p->stat_buffer_len, GFP_KERNEL);
>> if (!stats) {
>> - rc = -ENOMEM;
>> - return rc;
>> + return -ENOMEM;
>> }
>>
>> /* Called to get list of events supported */
>> rc = drc_pmem_query_stats(p, stats, 0);
>> - if (rc)
>> - goto out;
>>
>> - /*
>> - * Allocate memory and populate nvdimm_event_map.
>> - * Allocate an extra element for NULL entry
>> - */
>> - p->nvdimm_events_map = kcalloc(available_events + 1,
>> - sizeof(stat->stat_id),
>> - GFP_KERNEL);
>> - if (!p->nvdimm_events_map) {
>> - rc = -ENOMEM;
>> - goto out;
>> - }
>> -
>> - /* Copy all stat_ids to event map */
>> - for (index = 0, stat = stats->scm_statistic;
>> - index < available_events; index++, ++stat) {
>> - memcpy(&p->nvdimm_events_map[index * sizeof(stat->stat_id)],
>> - &stat->stat_id, sizeof(stat->stat_id));
>> - }
>> -out:
>> kfree(stats);
>> return rc;
>> }
>
> cheers
More information about the Linuxppc-dev
mailing list