[RESEND PATCH V2 0/3] Allow user to request memory to be locked on page fault
Andrew Morton
akpm at linux-foundation.org
Thu Jun 11 07:59:29 AEST 2015
On Wed, 10 Jun 2015 09:26:47 -0400 Eric B Munson <emunson at akamai.com> wrote:
> mlock() allows a user to control page out of program memory, but this
> comes at the cost of faulting in the entire mapping when it is
s/mapping/locked area/
> allocated. For large mappings where the entire area is not necessary
> this is not ideal.
>
> This series introduces new flags for mmap() and mlockall() that allow a
> user to specify that the covered are should not be paged out, but only
> after the memory has been used the first time.
The comparison with MCL_FUTURE is hiding over in the 2/3 changelog.
It's important so let's copy it here.
: MCL_ONFAULT is preferrable to MCL_FUTURE for the use cases enumerated
: in the previous patch becuase MCL_FUTURE will behave as if each mapping
: was made with MAP_LOCKED, causing the entire mapping to be faulted in
: when new space is allocated or mapped. MCL_ONFAULT allows the user to
: delay the fault in cost of any given page until it is actually needed,
: but then guarantees that that page will always be resident.
I *think* it all looks OK. I'd like someone else to go over it also if
poss.
I guess the 2/3 changelog should have something like
: munlockall() will clear MCL_ONFAULT on all vma's in the process's VM.
It's pretty obvious, but the manpage delta should make this clear also.
Also the changelog(s) and manpage delta should explain that munlock()
clears MCL_ONFAULT.
And now I'm wondering what happens if userspace does
mmap(MAP_LOCKONFAULT) and later does munlock() on just part of that
region. Does the vma get split? Is this tested? Should also be in
the changelogs and manpage.
Ditto mlockall(MCL_ONFAULT) followed by munlock(). I'm not sure that
even makes sense but the behaviour should be understood and tested.
What's missing here is a syscall to set VM_LOCKONFAULT on an arbitrary
range of memory - mlock() for lock-on-fault. It's a shame that mlock()
didn't take a `mode' argument. Perhaps we should add such a syscall -
that would make the mmap flag unneeded but I suppose it should be kept
for symmetry.
More information about the Linuxppc-dev
mailing list