[PATCH 3/6] KVM: PPC: Book3E: Increase FPU laziness
Scott Wood
scottwood at freescale.com
Thu Jul 4 03:17:34 EST 2013
On 07/03/2013 11:59:45 AM, Alexander Graf wrote:
>
> On 03.07.2013, at 17:41, Caraman Mihai Claudiu-B02008 wrote:
>
> >>>>> Increase FPU laziness by calling kvmppc_load_guest_fp() just
> before
> >>>>> returning to guest instead of each sched in. Without this
> improvement
> >>>>> an interrupt may also claim floting point corrupting guest
> state.
> >>>>
> >>>> Not sure I follow. Could you please describe exactly what's
> happening?
> >>>
> >>> This was already discussed on the list, I will forward you the
> thread.
> >>
> >> The only thing I've seen in that thread was some pathetic
> theoretical
> >> case where an interrupt handler would enable fp and clobber state
> >> carelessly. That's not something I'm worried about.
> >
> > Neither me though I don't find it pathetic. Please refer it to
> Scott.
>
> If from Linux's point of view we look like a user space program with
> active floating point registers, we don't have to worry about this
> case. Kernel code that would clobber that fp state would clobber
> random user space's fp state too.
This patch makes it closer to how it works with a user space program.
Or rather, it reduces the time window when we don't (and can't) act
like a normal userspace program -- and ensures that we have interrupts
disabled during that window. An interrupt can't randomly clobber FP
state; it has to call enable_kernel_fp() just like KVM does.
enable_kernel_fp() clears the userspace MSR_FP to ensure that the state
it saves gets restored before userspace uses it again, but that won't
have any effect on guest execution (especially in HV-mode). Thus
kvmppc_load_guest_fp() needs to be atomic with guest entry.
Conceptually it's like taking an automatic FP unavailable trap when we
enter the guest, since we can't be lazy in HV-mode.
> >> I really don't see where this patch improves anything tbh. It
> certainly
> >> makes the code flow more awkward.
> >
> > I was pointing you to this: The idea of FPU/AltiVec laziness that
> the kernel
> > is struggling to achieve is to reduce the number of store/restore
> operations.
> > Without this improvement we restore the unit each time we are sched
> it. If an
> > other process take the ownership of the unit (on SMP it's even
> worse but don't
> > bother with this) the kernel store the unit state to qemu task.
> This can happen
> > multiple times during handle_exit().
> >
> > Do you see it now?
>
> Yup. Looks good. The code flow is very hard to follow though - there
> are a lot of implicit assumptions that don't get documented anywhere.
> For example the fact that we rely on giveup_fpu() to remove MSR_FP
> from our thread.
That's not new to this patch...
-Scott
More information about the Linuxppc-dev
mailing list