[Cbe-oss-dev] [RFC 0/3] powerpc: memory copy routines tweaked for Cell

Gunnar von Boehn VONBOEHN at de.ibm.com
Thu Jun 19 22:11:22 EST 2008


Hi Arnd,

I have no results for P5/P6, but I did some tests on JS21 aka PPC-970.
On PPC-970 the CELL memcpy is faster than the current Linux routine.
This becomes really visible when you really copy memory-to-memory and are
not only working in the 2ndlevelcache.


Kind regards

Gunnar von Boehn




                                                                           
             Arnd Bergmann                                                 
             <arnd at arndb.de>                                               
                                                                        To 
             19/06/2008 13:53          linuxppc-dev at ozlabs.org             
                                                                        cc 
                                       Mark Nelson <markn at au1.ibm.com>,    
                                       cbe-oss-dev at ozlabs.org, Gunnar von  
                                       Boehn/Germany/Contr/IBM at IBMDE,      
                                       Michael Ellerman                    
                                       <ellerman at au1.ibm.com>              
                                                                   Subject 
                                       Re: [RFC 0/3] powerpc: memory copy  
                                       routines tweaked for Cell           
                                                                           
                                                                           
                                                                           
                                                                           
                                                                           
                                                                           




On Thursday 19 June 2008, Mark Nelson wrote:
> The plan is to use Michael Ellerman's code patching work so that at
runtime
> if we're running on a Cell machine the new routines are called but
otherwise
> the existing memory copy routines are used.

Have you tried running this code on other platforms to see if it
actually performs worse on any of them? I would guess that the
older code also doesn't work too well on Power 5 and Power 6, so the
cell optimized version could give us a significant advantage as well,
albeit less than another CPU specific version.

             Arnd <><





More information about the cbe-oss-dev mailing list