Oops (NULL pointer) with 'perf record' of selftest 'null_syscall'
Madhavan Srinivasan
maddy at linux.ibm.com
Wed Jun 16 16:33:44 AEST 2021
On 6/16/21 11:56 AM, Christophe Leroy wrote:
>
>
> Le 16/06/2021 à 05:40, Athira Rajeev a écrit :
>>
>>
>>> On 16-Jun-2021, at 8:53 AM, Madhavan Srinivasan
>>> <maddy at linux.ibm.com> wrote:
>>>
>>>
>>> On 6/15/21 8:35 PM, Christophe Leroy wrote:
>>>> For your information, I'm getting the following Oops. Detected with
>>>> 5.13-rc6, it also oopses on 5.12 and 5.11.
>>>> Runs ok on 5.10. I'm starting bisecting now.
>>>
>>>
>>> Thanks for reporting, got the issue. What has happened in this case
>>> is that, pmu device is not registered
>>> and trying to access the instruction point which will land in
>>> perf_instruction_pointer(). And recently I have added
>>> a workaround patch for power10 DD1 which has caused this breakage.
>>> My bad. We are working on a fix patch
>>> for the same and will post it out. Sorry again.
>>>
>>
>> Hi Christophe,
>>
>> Can you please try with below patch in your environment and test if
>> it works for you.
>>
>> From 55d3afc9369dfbe28a7152c8e9f856c11c7fe43d Mon Sep 17 00:00:00 2001
>> From: Athira Rajeev <atrajeev at linux.vnet.ibm.com>
>> Date: Tue, 15 Jun 2021 22:28:11 -0400
>> Subject: [PATCH] powerpc/perf: Fix crash with
>> 'perf_instruction_pointer' when
>> pmu is not set
>>
>> On systems without any specific PMU driver support registered, running
>> perf record causes oops:
>>
>> [ 38.841073] NIP [c00000000013af54]
>> perf_instruction_pointer+0x24/0x100
>> [ 38.841079] LR [c0000000003c7358] perf_prepare_sample+0x4e8/0x820
>> [ 38.841085] --- interrupt: 300
>> [ 38.841088] [c00000001cf03440] [c0000000003c6ef8]
>> perf_prepare_sample+0x88/0x820 (unreliable)
>> [ 38.841096] [c00000001cf034a0] [c0000000003c76d0]
>> perf_event_output_forward+0x40/0xc0
>> [ 38.841104] [c00000001cf03520] [c0000000003b45e8]
>> __perf_event_overflow+0x88/0x1b0
>> [ 38.841112] [c00000001cf03570] [c0000000003b480c]
>> perf_swevent_hrtimer+0xfc/0x1a0
>> [ 38.841119] [c00000001cf03740] [c0000000002399cc]
>> __hrtimer_run_queues+0x17c/0x380
>> [ 38.841127] [c00000001cf037c0] [c00000000023a5f8]
>> hrtimer_interrupt+0x128/0x2f0
>> [ 38.841135] [c00000001cf03870] [c00000000002962c]
>> timer_interrupt+0x13c/0x370
>> [ 38.841143i] [c00000001cf038d0] [c000000000009ba4]
>> decrementer_common_virt+0x1a4/0x1b0
>> [ 38.841151] --- interrupt: 900 at copypage_power7+0xd4/0x1c0
>>
>> During perf record session, perf_instruction_pointer() is called to
>> capture the sample ip. This function in core-book3s accesses
>> ppmu->flags.
>> If a platform specific PMU driver is not registered, ppmu is set to NULL
>> and accessing its members results in a crash. Fix this crash by checking
>> if ppmu is set.
>>
>> Signed-off-by: Athira Rajeev <atrajeev at linux.vnet.ibm.com>
>> Reported-by: Christophe Leroy <christophe.leroy at csgroup.eu>
>
> Fixes: 2ca13a4cc56c ("powerpc/perf: Use regs->nip when SIAR is zero")
> Cc: stable at vger.kernel.org
> Tested-by: Christophe Leroy <christophe.leroy at csgroup.eu>
Thanks, but just wonder what is the system config and processor version
in which you got this fail.
Reason, we do have generic-pmu which should kick-in in absence of a
platform specific driver.
Maddy
>
>> ---
>> arch/powerpc/perf/core-book3s.c | 2 +-
>> 1 file changed, 1 insertion(+), 1 deletion(-)
>>
>> diff --git a/arch/powerpc/perf/core-book3s.c
>> b/arch/powerpc/perf/core-book3s.c
>> index 16d4d1b6a1ff..816756588cb7 100644
>> --- a/arch/powerpc/perf/core-book3s.c
>> +++ b/arch/powerpc/perf/core-book3s.c
>> @@ -2254,7 +2254,7 @@ unsigned long perf_instruction_pointer(struct
>> pt_regs *regs)
>> bool use_siar = regs_use_siar(regs);
>> unsigned long siar = mfspr(SPRN_SIAR);
>> - if (ppmu->flags & PPMU_P10_DD1) {
>> + if (ppmu && ppmu->flags & PPMU_P10_DD1) {
>> if (siar)
>> return siar;
>> else
>>
More information about the Linuxppc-dev
mailing list