[PATCH] perf/core: Fix the mask in perf_output_sample_regs
Madhavan Srinivasan
maddy at linux.vnet.ibm.com
Tue Aug 16 15:29:05 AEST 2016
On Thursday 11 August 2016 05:57 PM, Peter Zijlstra wrote:
> Sorry, found it in my inbox while clearing out backlog..
>
> On Sun, Jul 03, 2016 at 11:31:58PM +0530, Madhavan Srinivasan wrote:
>> When decoding the perf_regs mask in perf_output_sample_regs(),
>> we loop through the mask using find_first_bit and find_next_bit functions.
>> While the exisitng code works fine in most of the case,
>> the logic is broken for 32bit kernel (Big Endian).
>> When reading u64 mask using (u32 *)(&val)[0], find_*_bit() assumes it gets
>> lower 32bits of u64 but instead gets upper 32bits which is wrong.
>> Proposed fix is to swap the words of the u64 to handle this case.
>> This is _not_ endianness swap.
> But it looks an awful lot like it..
Hit this issue when testing my perf_arch_regs patchset. Yep exactly
the reason for adding that comment in the commit message.
>
>> +++ b/kernel/events/core.c
>> @@ -5205,8 +5205,10 @@ perf_output_sample_regs(struct perf_output_handle *handle,
>> struct pt_regs *regs, u64 mask)
>> {
>> int bit;
>> + DECLARE_BITMAP(_mask, 64);
>>
>> - for_each_set_bit(bit, (const unsigned long *) &mask,
>> + bitmap_from_u64(_mask, mask);
>> + for_each_set_bit(bit, _mask,
>> sizeof(mask) * BITS_PER_BYTE) {
>> u64 val;
>> +++ b/lib/bitmap.c
>> +void bitmap_from_u64(unsigned long *dst, u64 mask)
>> +{
>> + dst[0] = mask & ULONG_MAX;
>> +
>> + if (sizeof(mask) > sizeof(unsigned long))
>> + dst[1] = mask >> 32;
>> +}
>> +EXPORT_SYMBOL(bitmap_from_u64);
> Looks small enough for an inline.
>
> Alternatively you can go all the way and add bitmap_from_u64array(), but
> that seems massive overkill.
Ok will make it inline and resend.
Maddy
>
> Tedious stuff.. I can't come up with anything prettier :/
>
More information about the Linuxppc-dev
mailing list