[PATCH 1/2] powerpc: string: implement optimized memset variants

Michael Ellerman mpe at ellerman.id.au
Tue Apr 4 22:00:07 AEST 2017


"Naveen N. Rao" <naveen.n.rao at linux.vnet.ibm.com> writes:
> (generic) is with Matt's arch-independent patches applied. Profiling 
> indicates that most of the overhead is actually with the lzo 
> decompression...
>
> Also, with a simple module to memset64() a 1GB vmalloc'ed buffer, here 
> are the results:
> generic:	0.245315533 seconds time elapsed	( +-  1.83% )
> optimized:	0.169282701 seconds time elapsed	( +-  1.96% )

Great, that's pretty conclusive.

I'm pretty sure I can take these 2 patches independently of Matt's
series, they just won't be used by much until his series goes in, so
I'll do that unless someone yells.

cheers


More information about the Linuxppc-dev mailing list