[Skiboot] [PATCH v2 0/3] Support PCI Virtual Device
Gavin Shan
gwshan at linux.vnet.ibm.com
Thu Sep 8 15:46:45 AEST 2016
On Wed, Sep 07, 2016 at 06:04:16PM +1000, Gavin Shan wrote:
>On Wed, Sep 07, 2016 at 04:50:10PM +1000, Alistair Popple wrote:
>>Hi Gavin,
>>
>>On Thu, 11 Aug 2016 12:12:35 PM Gavin Shan wrote:
>>> The patches support PCI virtual device. The NVLinks are presented to Linux
>>> kernel via PCI virtual (emulated) device. As we're going to support NVLink2
>>> pretty soon, we need support PCI virtual device in common layer so that it
>>> can be shared by NVLINK1/2.
>>>
>>> Changelog
>>> =========
>>> v1 -> v2:
>>> * Rebased to master branch per requested by Stewart. No code changes
>>> since v1. The last commit is f102c9d19078 ("Merge skiboot-5.3.1
>>> release notes").
>>> * Had quick test on Garrison1 and no obvious issues found.
>>
>>We should run actual GPU workloads to test any NPU changes (eg. a CUDA
>>bandwidth test - let me know if you're not sure how). A simple boot test is
>>not enough as it won't exercise many of the code paths that have been changed
>>here. It seems patch 3 breaks something as the nVidia driver no longer works
>>with nvlink on master - could you please take a look at this? Thanks!
>>
>>Stewart - we may need to revert "[PATCH v2 3/3] hw/npu: Use PCI virtual
>>device" if we can't get this fixed soon - will let you know.
>>
>
>Yeah, agree. I will talk to you about how to run CUDA bandwidth
>test tomorrow morning when you have available time. After that,
>I will investigate what's broken by PATCH[3/3]. I assume the
>issue is existing on Garrison only.
>
Another patch was sent to fix something incorrect in PCI config filter. With
it's applied, the bandwidth testing succeeded. Alistair, please help confirm
the result is close to the expected one? It seems we have some performance
lost and I can sort it out later.
https://patchwork.ozlabs.org/patch/667255/
---
garrison1# /home/alistair/NVIDIA_CUDA-7.5_Samples/1_Utilities/bandwidthTest/bandwidthTest --memory=pinned
[CUDA Bandwidth Test] - Starting...
Running on...
Device 0: Graphics Device
Quick Mode
Host to Device Bandwidth, 1 Device(s)
PINNED Memory Transfers
Transfer Size (Bytes) Bandwidth(MB/s)
33554432 29467.5
Device to Host Bandwidth, 1 Device(s)
PINNED Memory Transfers
Transfer Size (Bytes) Bandwidth(MB/s)
33554432 21323.7
Device to Device Bandwidth, 1 Device(s)
PINNED Memory Transfers
Transfer Size (Bytes) Bandwidth(MB/s)
33554432 345036.7
Result = PASS
NOTE: The CUDA Samples are not meant for performance measurements. Results may vary when GPU Boost is enabled.
Thanks,
Gavin
More information about the Skiboot
mailing list