[Skiboot] [PATCH v2 0/3] Support PCI Virtual Device
Alistair Popple
alistair at popple.id.au
Thu Sep 8 11:03:39 AEST 2016
On Wed, 7 Sep 2016 06:13:39 PM Stewart Smith wrote:
> Alistair Popple <alistair at popple.id.au> writes:
> > On Thu, 11 Aug 2016 12:12:35 PM Gavin Shan wrote:
> >> The patches support PCI virtual device. The NVLinks are presented to
Linux
> >> kernel via PCI virtual (emulated) device. As we're going to support
NVLink2
> >> pretty soon, we need support PCI virtual device in common layer so that
it
> >> can be shared by NVLINK1/2.
> >>
> >> Changelog
> >> =========
> >> v1 -> v2:
> >> * Rebased to master branch per requested by Stewart. No code changes
> >> since v1. The last commit is f102c9d19078 ("Merge skiboot-5.3.1
> >> release notes").
> >> * Had quick test on Garrison1 and no obvious issues found.
> >
> > We should run actual GPU workloads to test any NPU changes (eg. a CUDA
> > bandwidth test - let me know if you're not sure how). A simple boot test
is
> > not enough as it won't exercise many of the code paths that have been
changed
> > here. It seems patch 3 breaks something as the nVidia driver no longer
works
> > with nvlink on master - could you please take a look at this? Thanks!
>
> I'm thinking we need something in op-test-framework that runs things
> like a CUDA workload (if possible), FWTS and a smoke test with HTX.
I believe at least one version of HTX includes some CUDA tests.
> However... that does run into the whole problem of having an OS to run
> all of these packages on, which creates a bit more work to make it
> happen.
Which for nVidia is even more fun as we need to have the out-of-tree driver
complied against the kernel. Although buildroot does have a package for
that...
> Do we have a victim^Wvolunteer to maybe work on that? :)
... which leads me to dob Sam B in :)
I know at one point he was working on a buildroot based system for testing
qemu/kvm. Not sure how far he got but it might be a useful base as having
automated testing for this stuff would be wonderful.
Regards,
Alistair
>
> > Stewart - we may need to revert "[PATCH v2 3/3] hw/npu: Use PCI virtual
> > device" if we can't get this fixed soon - will let you know.
>
> ack.
>
>
More information about the Skiboot
mailing list