[PATCH v1 0/3] powerpc: memcmp() optimization
wei.guo.simon at gmail.com
wei.guo.simon at gmail.com
Tue Sep 19 20:03:56 AEST 2017
From: Simon Guo <wei.guo.simon at gmail.com>
There is some room to optimize memcmp() in powerpc for following 2 cases:
(1) Even src/dst addresses are not aligned with 8 bytes at the beginning,
memcmp() can align them and go with .Llong comparision mode without
fallback to .Lshort comparision mode do compare buffer byte by byte.
(2) VMX instructions can be used to speed up for large size comparision.
This patch set also updates selftest case to make it compiled.
Simon Guo (3):
powerpc: Align bytes before fall back to .Lshort in powerpc memcmp
powerpc: enhance memcmp() with VMX instruction for long bytes
comparision
powerpc:selftest update memcmp selftest according to kernel change
arch/powerpc/include/asm/asm-prototypes.h | 2 +-
arch/powerpc/lib/copypage_power7.S | 2 +-
arch/powerpc/lib/memcmp_64.S | 165 ++++++++++++++++++++-
arch/powerpc/lib/memcpy_power7.S | 2 +-
arch/powerpc/lib/vmx-helper.c | 2 +-
.../selftests/powerpc/copyloops/asm/ppc_asm.h | 2 +-
.../selftests/powerpc/stringloops/asm/ppc_asm.h | 31 ++++
7 files changed, 197 insertions(+), 9 deletions(-)
--
1.8.3.1
More information about the Linuxppc-dev
mailing list