[PATCH 1/2] powerpc: string: implement optimized memset variants

Michael Ellerman mpe at ellerman.id.au
Tue Apr 18 16:45:00 AEST 2017


Michael Ellerman <mpe at ellerman.id.au> writes:

> "Naveen N. Rao" <naveen.n.rao at linux.vnet.ibm.com> writes:
>> (generic) is with Matt's arch-independent patches applied. Profiling 
>> indicates that most of the overhead is actually with the lzo 
>> decompression...
>>
>> Also, with a simple module to memset64() a 1GB vmalloc'ed buffer, here 
>> are the results:
>> generic:	0.245315533 seconds time elapsed	( +-  1.83% )
>> optimized:	0.169282701 seconds time elapsed	( +-  1.96% )
>
> Great, that's pretty conclusive.
>
> I'm pretty sure I can take these 2 patches independently of Matt's
> series, they just won't be used by much until his series goes in, so
> I'll do that unless someone yells.

Hmm, just went to merge these, but I don't see Matt's series in
linux-next, so I'll hold off for now.

cheers


More information about the Linuxppc-dev mailing list