[PATCH v6 17/18] arm64/mm: __always_inline to improve fork() perf

Catalin Marinas catalin.marinas at arm.com
Fri Feb 16 23:34:23 AEDT 2024


On Thu, Feb 15, 2024 at 10:32:04AM +0000, Ryan Roberts wrote:
> As set_ptes() and wrprotect_ptes() become a bit more complex, the
> compiler may choose not to inline them. But this is critical for fork()
> performance. So mark the functions, along with contpte_try_unfold()
> which is called by them, as __always_inline. This is worth ~1% on the
> fork() microbenchmark with order-0 folios (the common case).
> 
> Acked-by: Mark Rutland <mark.rutland at arm.com>
> Signed-off-by: Ryan Roberts <ryan.roberts at arm.com>

Acked-by: Catalin Marinas <catalin.marinas at arm.com>


More information about the Linuxppc-dev mailing list