[PATCH V2] powerpc/perf: Enable PMU counters post partition migration if PMU is active
Michael Ellerman
mpe at ellerman.id.au
Fri Oct 22 11:19:10 AEDT 2021
Nathan Lynch <nathanl at linux.ibm.com> writes:
> Athira Rajeev <atrajeev at linux.vnet.ibm.com> writes:
>> During Live Partition Migration (LPM), it is observed that perf
>> counter values reports zero post migration completion. However
>> 'perf stat' with workload continues to show counts post migration
>> since PMU gets disabled/enabled during sched switches. But incase
>> of system/cpu wide monitoring, zero counts were reported with 'perf
>> stat' after migration completion.
>>
>> Example:
>> ./perf stat -e r1001e -I 1000
>> time counts unit events
>> 1.001010437 22,137,414 r1001e
>> 2.002495447 15,455,821 r1001e
>> <<>> As seen in next below logs, the counter values shows zero
>> after migration is completed.
>> <<>>
>> 86.142535370 129,392,333,440 r1001e
>> 87.144714617 0 r1001e
>> 88.146526636 0 r1001e
>> 89.148085029 0 r1001e
>
> Confirmed in my environment:
>
> 51.099987985 300,338 cache-misses
> 52.101839374 296,586 cache-misses
> 53.116089796 263,150 cache-misses
> 54.117949249 232,290 cache-misses
> 55.602029375 68,700,421,711 cache-misses
> 56.610073969 0 cache-misses
> 57.614732000 0 cache-misses
>
> I wonder what it means that there is a very unlikely huge value before
> the counter stops working -- I believe your example has this phenomenon
> too.
AFAICS the patch is not reading the PMC values before the migration, so
I suspect we're losing some counts just before the migration and then
the delta is going negative somewhere, leading to an implausibly large
count.
cheers
More information about the Linuxppc-dev
mailing list