[Skiboot] [PATCH v2 0/3] Support PCI Virtual Device

Alistair Popple alistair at popple.id.au
Thu Sep 8 17:04:19 AEST 2016


On Thu, 8 Sep 2016 03:46:45 PM Gavin Shan wrote:
> On Wed, Sep 07, 2016 at 06:04:16PM +1000, Gavin Shan wrote:
> >On Wed, Sep 07, 2016 at 04:50:10PM +1000, Alistair Popple wrote:
> >>Hi Gavin,
> >>
> >>On Thu, 11 Aug 2016 12:12:35 PM Gavin Shan wrote:
> >>> The patches support PCI virtual device. The NVLinks are presented to 
Linux
> >>> kernel via PCI virtual (emulated) device. As we're going to support 
NVLink2
> >>> pretty soon, we need support PCI virtual device in common layer so that 
it
> >>> can be shared by NVLINK1/2.
> >>> 
> >>> Changelog
> >>> =========
> >>> v1 -> v2:
> >>>    * Rebased to master branch per requested by Stewart. No code changes
> >>>      since v1. The last commit is f102c9d19078 ("Merge skiboot-5.3.1
> >>>      release notes").
> >>>    * Had quick test on Garrison1 and no obvious issues found.
> >>
> >>We should run actual GPU workloads to test any NPU changes (eg. a CUDA 
> >>bandwidth test - let me know if you're not sure how). A simple boot test 
is 
> >>not enough as it won't exercise many of the code paths that have been 
changed 
> >>here. It seems patch 3 breaks something as the nVidia driver no longer 
works 
> >>with nvlink on master - could you please take a look at this? Thanks!
> >>
> >>Stewart - we may need to revert "[PATCH v2 3/3] hw/npu: Use PCI virtual 
> >>device" if we can't get this fixed soon - will let you know.
> >>
> >
> >Yeah, agree. I will talk to you about how to run CUDA bandwidth
> >test tomorrow morning when you have available time. After that,
> >I will investigate what's broken by PATCH[3/3]. I assume the
> >issue is existing on Garrison only.
> >
> 
> Another patch was sent to fix something incorrect in PCI config filter. With
> it's applied, the bandwidth testing succeeded. Alistair, please help confirm
> the result is close to the expected one? It seems we have some performance
> lost and I can sort it out later.

That's strange. There's no obvious reason why these patches should impact 
nvlink bandwidth at all.

> https://patchwork.ozlabs.org/patch/667255/
> 
> ---
> 
> garrison1# 
/home/alistair/NVIDIA_CUDA-7.5_Samples/1_Utilities/bandwidthTest/bandwidthTest 
--memory=pinned
> [CUDA Bandwidth Test] - Starting...
> Running on...
> 
>  Device 0: Graphics Device
>  Quick Mode
> 
>  Host to Device Bandwidth, 1 Device(s)
>  PINNED Memory Transfers
>    Transfer Size (Bytes)	Bandwidth(MB/s)
>    33554432			29467.5
> 
>  Device to Host Bandwidth, 1 Device(s)
>  PINNED Memory Transfers
>    Transfer Size (Bytes)	Bandwidth(MB/s)
>    33554432			21323.7
> 
>  Device to Device Bandwidth, 1 Device(s)
>  PINNED Memory Transfers
>    Transfer Size (Bytes)	Bandwidth(MB/s)
>    33554432			345036.7
> 
> Result = PASS
> 
> NOTE: The CUDA Samples are not meant for performance measurements. Results 
may vary when GPU Boost is enabled.
> 
> Thanks,
> Gavin
> 



More information about the Skiboot mailing list