82xx performance

Rune Torgersen runet at innovsys.com
Tue Jul 15 02:34:03 EST 2008


Hi

We are looking into switching kernels from 2.6.18 (ppc) to 2.6.25
(powerpc).
I have been trying to run some benchmarks to see how the new kernel
compares to the old one.

So far it is performing worse.

One test I ran was just compiling a 2.6.18 kernel on the system.
The .25 performed 5 to 7 % slower:

2.6.18, make vmlinux
real    74m1.328s
user    68m48.196s
sys     4m35.961s

2.6.25, make vmlinux
real    79m13.361s
user    72m41.318s
sys     5m46.744s

I also ran lmbench3. (slightly outdated, but still works)
Most (if not all) results are worse on .25, especially context
switching.

Is this expected behaviour or is there anything I need to look at in my
config?
(I'll send config if anybody is interested)


                 L M B E N C H  3 . 0   S U M M A R Y
                 ------------------------------------
                 (Alpha software, do not distribute)

Basic system parameters
------------------------------------------------------------------------
------
Host                 OS Description              Mhz  tlb  cache  mem
scal
                                                     pages line   par
load
                                                           bytes
--------- ------------- ----------------------- ---- ----- ----- ------
----
9919_unit  Linux 2.6.25       powerpc-linux-gnu  434    32    32 1.0000
1
9919_unit  Linux 2.6.18       powerpc-linux-gnu  445    32    32 1.0100
1

Processor, Processes - times in microseconds - smaller is better
------------------------------------------------------------------------
------
Host                 OS  Mhz null null      open slct sig  sig  fork
exec sh
                             call  I/O stat clos TCP  inst hndl proc
proc proc
--------- ------------- ---- ---- ---- ---- ---- ---- ---- ---- ----
---- ----
9919_unit  Linux 2.6.25  434 0.47 1.26 10.7 35.6 34.1 1.76 14.3 2646
9964 33.K
9919_unit  Linux 2.6.18  445 0.35 1.24 9.27 22.9 32.7 1.87 13.8 2157
7825 26.K

Basic integer operations - times in nanoseconds - smaller is better
-------------------------------------------------------------------
Host                 OS  intgr intgr  intgr  intgr  intgr
                          bit   add    mul    div    mod
--------- ------------- ------ ------ ------ ------ ------
9919_unit  Linux 2.6.25 2.3300 0.0100   10.7   46.2   56.0
9919_unit  Linux 2.6.18 2.2300 0.0100   10.3   45.4   54.1

Basic float operations - times in nanoseconds - smaller is better
-----------------------------------------------------------------
Host                 OS  float  float  float  float
                         add    mul    div    bogo
--------- ------------- ------ ------ ------ ------
9919_unit  Linux 2.6.25 9.9500   10.1   46.2   66.2
9919_unit  Linux 2.6.18 9.1100 9.0800   45.8   67.1

Basic double operations - times in nanoseconds - smaller is better
------------------------------------------------------------------
Host                 OS  double double double double
                         add    mul    div    bogo
--------- ------------- ------  ------ ------ ------
9919_unit  Linux 2.6.25 9.3400   11.6   78.6  100.2
9919_unit  Linux 2.6.18 9.1600   11.1   77.2   97.8

Context switching - times in microseconds - smaller is better
------------------------------------------------------------------------
-
Host                 OS  2p/0K 2p/16K 2p/64K 8p/16K 8p/64K 16p/16K
16p/64K
                         ctxsw  ctxsw  ctxsw ctxsw  ctxsw   ctxsw
ctxsw
--------- ------------- ------ ------ ------ ------ ------ -------
-------
9919_unit  Linux 2.6.25   20.6   86.2   28.5  103.8   38.7   111.8
57.4
9919_unit  Linux 2.6.18 5.3300   63.2   17.9   73.4   23.1    74.9
26.2

*Local* Communication latencies in microseconds - smaller is better
---------------------------------------------------------------------
Host                 OS 2p/0K  Pipe AF     UDP  RPC/   TCP  RPC/ TCP
                        ctxsw       UNIX         UDP         TCP conn
--------- ------------- ----- ----- ---- ----- ----- ----- ----- ----
9919_unit  Linux 2.6.25  20.6  68.8 131. 353.1 533.4 461.7       1269
9919_unit  Linux 2.6.18 5.330  36.1 87.8 225.3 402.7 331.8 520.1 970.

File & VM system latencies in microseconds - smaller is better
------------------------------------------------------------------------
-------
Host                 OS   0K File      10K File     Mmap    Prot   Page
100fd
                        Create Delete Create Delete Latency Fault  Fault
selct
--------- ------------- ------ ------ ------ ------ ------- -----
------- -----
9919_unit  Linux 2.6.25  222.3  172.4 1003.0  350.5   41.5K 1.734
10.5  18.0
9919_unit  Linux 2.6.18  181.5  144.3  789.3  293.9   23.9K
7.09560  19.3

*Local* Communication bandwidths in MB/s - bigger is better
------------------------------------------------------------------------
-----
Host                OS  Pipe AF    TCP  File   Mmap  Bcopy  Bcopy  Mem
Mem
                             UNIX      reread reread (libc) (hand) read
write
--------- ------------- ---- ---- ---- ------ ------ ------ ------ ----
-----
9919_unit  Linux 2.6.25 34.2 34.7 21.5   55.5  161.8   79.9   79.2 160.
116.1
9919_unit  Linux 2.6.18 40.1 37.4 29.7   60.0  165.8   80.6   81.1 165.
117.8

Memory latencies in nanoseconds - smaller is better
    (WARNING - may not be correct, check graphs)
------------------------------------------------------------------------
------
Host                 OS   Mhz   L1 $   L2 $    Main mem    Rand mem
Guesses
--------- -------------   ---   ----   ----    --------    --------
-------
9919_unit  Linux 2.6.25   434 4.8150  174.6  183.3       511.8    No L2
cache?
9919_unit  Linux 2.6.18   445 4.6880  174.1  175.4       497.5    No L2
cache?



More information about the Linuxppc-dev mailing list