[PATCH 0/2] Faster MMU lookups for Book3s v3
avi at redhat.com
Thu Jul 1 23:42:14 EST 2010
On 07/01/2010 03:52 PM, Alexander Graf wrote:
>>> Don't you use lazy spte updates?
>> We do, but given enough time, the guest will touch its entire memory.
> Oh, so that's the major difference. On PPC we have the HTAB with a
> fraction of all the mapped pages in it. We don't have a notion of a full
> page table for a guest process. We always only have a snapshot of some
> mappings and shadow those lazily.
> So at worst, we have HPTEG_CACHE_NUM shadow pages mapped, which would be
> (1<< 15) * 4k which again would be at most 128MB of guest memory. We
> can't hold more mappings than that anyways, so chances are low we have a
> mapping for each hva.
Doesn't that seriously impact performance? A guest that recycles pages
from its lru will touch pages at random from its entire address space.
On bare metal that isn't a problem (I imagine) due to large tlbs. But
virtualized on 4K pages that means the htlb will be thrashed.
>>> But then again I probably do need an rmap for the mmu_notifier magic,
>>> right? But I'd rather prefer to have that code path be slow and the
>>> dirty bitmap invalidation fast than the other way around. Swapping is
>>> slow either way.
>> It's not just swapping, it's also page ageing. That needs to be
>> fast. Does ppc have a hardware-set referenced bit? If so, you need a
>> fast rmap for mmu notifiers.
> Page ageing is difficult. The HTAB has a hardware set referenced bit,
> but we don't have a guarantee that the entry is still there when we look
> for it. Something else could have overwritten it by then, but the entry
> could still be lingering around in the TLB.
Whoever's dropping the HTAB needs to update the host struct page, and
also reflect the bit into the guest's HTAB, no?
In fact, on x86 shadow, we don't have an spte for a gpte that is not
accessed, precisely so we know the exact point in time when the accessed
bit is set.
> So I think the only reasonable way to implement page ageing is to unmap
> pages. And that's slow, because it means we have to map them again on
> access. Bleks. Or we could look for the HTAB entry and only unmap them
> if the entry is moot.
I think it works out if you update struct page when you clear out an HTAB.
error compiling committee.c: too many arguments to function
More information about the Linuxppc-dev