glibc: optimized ppc bcopy

Rob Latham rob at
Sat Apr 12 07:04:49 EST 2003

i noticed something when comparing lmbench numbers between os x and
linux on the same hardware:  linux beats os x at every category except
one: Bcopy (libc)

*Local* Communication bandwidths in MB/s - bigger is better
Host                OS  Pipe AF    TCP  File   Mmap  Bcopy  Bcopy  Mem   Mem
                             UNIX      reread reread (libc) (hand) read write
--------- ------------- ---- ---- ---- ------ ------ ------ ------ ---- -----
aragorn   Linux 2.4.20- 198. 216. 90.1  206.5  173.1  122.8  124.2 173. 358.3
os-x         Darwin 6.4 124. 121. 80.5  150.6  178.1  239.8  123.4 178. 411.9

So i looked a bit closer at glibc: there are no optimized powerpc
string or memory operations.  ( later confirmed by the glibc web

I know there are a zillion powerpc variants:  would it be hard to
write assembly that works with all of them?  I know almost zero about
powerpc assembly, but this might be a fun place to start learning.  Of
course, if anyone else has already started such an undertaking, i'll
defer to them and go work on something else.

For those curious, the full lmbench run can be found here:
(linux does quite well :> )


Rob Latham                                        Chicago, IL USA

** Sent via the linuxppc-dev mail list. See

More information about the Linuxppc-dev mailing list