[PATCH v1 0/3] powerpc: memcmp() optimization

wei.guo.simon at gmail.com wei.guo.simon at gmail.com
Tue Sep 19 20:03:56 AEST 2017


From: Simon Guo <wei.guo.simon at gmail.com>

There is some room to optimize memcmp() in powerpc for following 2 cases:
(1) Even src/dst addresses are not aligned with 8 bytes at the beginning,
memcmp() can align them and go with .Llong comparision mode without
fallback to .Lshort comparision mode do compare buffer byte by byte.
(2) VMX instructions can be used to speed up for large size comparision.

This patch set also updates selftest case to make it compiled.


Simon Guo (3):
  powerpc: Align bytes before fall back to .Lshort in powerpc memcmp
  powerpc: enhance memcmp() with VMX instruction for long bytes
    comparision
  powerpc:selftest update memcmp selftest according to kernel change

 arch/powerpc/include/asm/asm-prototypes.h          |   2 +-
 arch/powerpc/lib/copypage_power7.S                 |   2 +-
 arch/powerpc/lib/memcmp_64.S                       | 165 ++++++++++++++++++++-
 arch/powerpc/lib/memcpy_power7.S                   |   2 +-
 arch/powerpc/lib/vmx-helper.c                      |   2 +-
 .../selftests/powerpc/copyloops/asm/ppc_asm.h      |   2 +-
 .../selftests/powerpc/stringloops/asm/ppc_asm.h    |  31 ++++
 7 files changed, 197 insertions(+), 9 deletions(-)

-- 
1.8.3.1



More information about the Linuxppc-dev mailing list