[Cbe-oss-dev] [RFC 3/3] powerpc: copy_4K_page tweaked for Cell
Mark Nelson
markn at au1.ibm.com
Fri Jun 20 12:25:02 EST 2008
On Fri, 20 Jun 2008 07:28:50 am Arnd Bergmann wrote:
> On Thursday 19 June 2008, Mark Nelson wrote:
> > .align 7
> > _GLOBAL(copy_4K_page)
> > dcbt 0,r4 /* Prefetch ONE SRC cacheline */
> >
> > addi r6,r3,-8 /* prepare for stdu */
> > addi r4,r4,-8 /* prepare for ldu */
> >
> > li r10,32 /* copy 32 cache lines for a 4K page */
> > li r12,128+8 /* prefetch distance*/
>
> Since you have a loop here anyway instead of the fully unrolled
> code, why not provide a copy_64K_page function as well, jumping in
> here?
That is a good idea. What effect will that have on how the code
patching will work?
>
> The inline 64k copy_page function otherwise just adds code size,
> as well as being a tiny bit slower. It may even be good to
> have an out-of-line copy_64K_page for the regular code, just
> calling copy_4K_page repeatedly.
Doing that sounds like it'll make the code patching easier.
Thanks!
Mark
More information about the Linuxppc-dev
mailing list