[PATCH kernel] vfio/spapr: Add trace points for map/unmap
Robin Murphy
robin.murphy at arm.com
Thu Nov 30 03:43:46 AEDT 2017
On 29/11/17 16:32, Alex Williamson wrote:
> On Thu, 23 Nov 2017 15:13:37 +1100
> Alexey Kardashevskiy <aik at ozlabs.ru> wrote:
>
>> On 17/11/17 17:58, Alexey Kardashevskiy wrote:
>>> On 17/11/17 11:13, Alex Williamson wrote:
>>>> On Tue, 14 Nov 2017 10:47:12 +1100
>>>> Alexey Kardashevskiy <aik at ozlabs.ru> wrote:
>>>>
>>>>> On 27/10/17 14:00, Alexey Kardashevskiy wrote:
>>>>>> This adds trace_map/trace_unmap tracepoints to spapr driver. Type1 already
>>>>>> uses these via the IOMMU API (iommu_map/__iommu_unmap).
>>>>>>
>>>>>> Signed-off-by: Alexey Kardashevskiy <aik at ozlabs.ru>
>>>>
>>>> Is this really legitimate to include tracepoints from a different
>>>> subsystem?> The vfio type1 backend gets these trace points by virtue of
>>>> it actually using the IOMMU API, it doesn't call them itself. I'm kind
>>>> of surprised these are actually available to be called from a module.
>>>
>>> They are explicitly exported:
>>>
>>> EXPORT_TRACEPOINT_SYMBOL_GPL(map);
>>> EXPORT_TRACEPOINT_SYMBOL_GPL(unmap);
>>>
>>> I would think this is for things like drivers/vfio/vfio_iommu_spapr_tce.c ,
>>> why else?...
>>>
>>>
>>>> I suspect the way to do this is probably to define our own tracepoints
>>>> in the vfio/spapr backend or insert tracepoints into the IOMMU layers
>>>> that that code calls into rather than masquerading as tracepoints from
>>>> a different subsystem. Right?
>>>
>>> This makes sense too. But it is going to be just cut-n-paste and some
>>> confusion -
>>> /sys/kernel/debug/tracing/events/iommu/map will still be present but
>>> won't work and
>>> /sys/kernel/debug/tracing/events/vfio/vfio_iommu_spapr_tce/map will.
>
> But iommu/map does work, it's just not called by the vfio spapr tce
> backend, which is absolutely correct. In fact, that's part of my
> reservation about this approach is that it could be interpreted as
> implying a call path that doesn't exist on this arch.
Come to think of it, the inverse also applies - if users are led to
expect x86-like behaviour where the iommu/map tracepoint just happens to
reflect VFIO calls, they'll likely also get a nasty surprise on
platforms like arm64 where regular DMA API calls by drivers may be
hammering iommu_map() in the background.
Robin.
>>> Judges? :)
>>
>>
>> Still nak? I discussed this locally, the conclusion was it is a matter of
>> taste and this proposal is not that disgusting. Thanks.
>
> Is that our goal now, proposals that aren't too disgusting? I think
> it's an opportunity to introduce our own tracing infrastructure and it
> feels a lot more correct to do that than to generalize existing trace
> points from other subsystems. Thanks,
>
> Alex
>
>
>>>>>> ---
>>>>>>
>>>>>> Example:
>>>>>> qemu-system-ppc-8655 [096] 724.662740: unmap: IOMMU: iova=0x000000003ffff000 size=4096 unmapped_size=4096
>>>>>> qemu-system-ppc-8656 [104] 724.970912: map: IOMMU: iova=0x0800000000000000 paddr=0x00007ffef7ff0000 size=65536
>>>>>> ---
>>>>>> drivers/vfio/vfio_iommu_spapr_tce.c | 12 ++++++++++--
>>>>>> 1 file changed, 10 insertions(+), 2 deletions(-)
>>>>>>
>>>>>> diff --git a/drivers/vfio/vfio_iommu_spapr_tce.c b/drivers/vfio/vfio_iommu_spapr_tce.c
>>>>>> index 63112c36ab2d..4531486c77c6 100644
>>>>>> --- a/drivers/vfio/vfio_iommu_spapr_tce.c
>>>>>> +++ b/drivers/vfio/vfio_iommu_spapr_tce.c
>>>>>> @@ -22,6 +22,7 @@
>>>>>> #include <linux/vmalloc.h>
>>>>>> #include <linux/sched/mm.h>
>>>>>> #include <linux/sched/signal.h>
>>>>>> +#include <trace/events/iommu.h>
>>>>>>
>>>>>> #include <asm/iommu.h>
>>>>>> #include <asm/tce.h>
>>>>>> @@ -502,17 +503,19 @@ static int tce_iommu_clear(struct tce_container *container,
>>>>>> struct iommu_table *tbl,
>>>>>> unsigned long entry, unsigned long pages)
>>>>>> {
>>>>>> - unsigned long oldhpa;
>>>>>> + unsigned long oldhpa, unmapped, firstentry = entry, totalpages = pages;
>>>>>> long ret;
>>>>>> enum dma_data_direction direction;
>>>>>>
>>>>>> - for ( ; pages; --pages, ++entry) {
>>>>>> + for (unmapped = 0; pages; --pages, ++entry) {
>>>>>> direction = DMA_NONE;
>>>>>> oldhpa = 0;
>>>>>> ret = iommu_tce_xchg(tbl, entry, &oldhpa, &direction);
>>>>>> if (ret)
>>>>>> continue;
>>>>>>
>>>>>> + ++unmapped;
>>>>>> +
>>>>>> if (direction == DMA_NONE)
>>>>>> continue;
>>>>>>
>>>>>> @@ -523,6 +526,9 @@ static int tce_iommu_clear(struct tce_container *container,
>>>>>>
>>>>>> tce_iommu_unuse_page(container, oldhpa);
>>>>>> }
>>>>>> + trace_unmap(firstentry << tbl->it_page_shift,
>>>>>> + totalpages << tbl->it_page_shift,
>>>>>> + unmapped << tbl->it_page_shift);
>>>>>>
>>>>>> return 0;
>>>>>> }
>>>>>> @@ -965,6 +971,8 @@ static long tce_iommu_ioctl(void *iommu_data,
>>>>>> direction);
>>>>>>
>>>>>> iommu_flush_tce(tbl);
>>>>>> + if (!ret)
>>>>>> + trace_map(param.iova, param.vaddr, param.size);
>>>>>>
>>>>>> return ret;
>>>>>> }
>>>>>>
>>>>>
>>>>>
>>>>
>>>
>>>
>>
>>
>
> _______________________________________________
> iommu mailing list
> iommu at lists.linux-foundation.org
> https://lists.linuxfoundation.org/mailman/listinfo/iommu
>
More information about the Linuxppc-dev
mailing list