[PATCH 1/2] powerpc: string: implement optimized memset variants
PrasannaKumar Muralidharan
prasannatsmkumar at gmail.com
Wed Apr 5 15:51:09 AEST 2017
On 30 March 2017 at 12:46, Naveen N. Rao
<naveen.n.rao at linux.vnet.ibm.com> wrote:
> Also, with a simple module to memset64() a 1GB vmalloc'ed buffer, here
> are the results:
> generic: 0.245315533 seconds time elapsed ( +- 1.83% )
> optimized: 0.169282701 seconds time elapsed ( +- 1.96% )
Wondering what makes gcc not to produce efficient assembly code. Can
you please post the disassembly of C implementation of memset64? Just
for info purpose.
Thanks,
Prasanna
More information about the Linuxppc-dev
mailing list