[PATCH] powerpc/64s: optimise syscall entry for virtual, relocatable case

Balbir Singh bsingharora at gmail.com
Tue Sep 20 14:00:23 AEST 2016



On 15/09/16 19:03, Nicholas Piggin wrote:
> The mflr r10 instruction was left over saving of lr when the code used
> lr to branch to system_call_entry from the exception handler. That was
> changed by 6a404806d to use the count register. The value is never used
> now, so mflr can be removed, and r10 can be used for storage rather than
> spilling to the SPR scratch register.
> 
> The scratch register spill causes a long pipeline stall due to the SPR
> read after write. This change brings getppid syscall cost from 406 to
> 376 cycles on POWER8. getppid for non-relocatable case is 371 cycles.
> 
> Signed-off-by: Nicholas Piggin <npiggin at gmail.com>
> ---
> 
>  arch/powerpc/kernel/exceptions-64s.S | 7 ++-----
>  1 file changed, 2 insertions(+), 5 deletions(-)
> 
> diff --git a/arch/powerpc/kernel/exceptions-64s.S b/arch/powerpc/kernel/exceptions-64s.S
> index df6d45e..2cdd64f 100644
> --- a/arch/powerpc/kernel/exceptions-64s.S
> +++ b/arch/powerpc/kernel/exceptions-64s.S
> @@ -63,15 +63,12 @@ END_FTR_SECTION_IFSET(CPU_FTR_REAL_LE)				\
>  	 * is volatile across system calls.
>  	 */
>  #define SYSCALL_PSERIES_2_DIRECT				\
> -	mflr	r10 ;						\
>  	ld	r12,PACAKBASE(r13) ; 				\
>  	LOAD_HANDLER(r12, system_call_entry) ;			\
>  	mtctr	r12 ;						\
>  	mfspr	r12,SPRN_SRR1 ;					\
> -	/* Re-use of r13... No spare regs to do this */	\
> -	li	r13,MSR_RI ;					\
> -	mtmsrd 	r13,1 ;						\
> -	GET_PACA(r13) ;	/* get r13 back */			\
> +	li	r10,MSR_RI ;					\
> +	mtmsrd 	r10,1 ;						\
>  	bctr ;
>  #else
>  	/* We can branch directly */
> 

The patch makes sense

Acked-by: Balbir Singh <bsingharora at gmail.com>


More information about the Linuxppc-dev mailing list