[Cbe-oss-dev] spu-top
Yury Serdyuk
Yury at serdyuk.botik.ru
Fri Nov 21 21:37:23 EST 2008
>>>spu-top: Context View
>>>Cpu(s) load avg: 0.09, 0.13, 0.22
>>>Spu(s) load avg: 3.43, 1.60, 1.18
>>>Cpu(s): 0.1%us, 0.3%sys, 0.2%wait, 0.0%nice, 99.4%idle
>>>Spu(s): 49.9%us, 0.1%sys, 0.0%wait, 50.0%idle
>>>
>>> PID TID USERNAME S F %SPU SPE TIME BINARY
>>> 24429 24443 user002 U 0.2 4 14.230 mono
>>> 24429 24442 user002 U 0.2 5 14.231 mono
>>> 24429 24441 user002 U 0.2 6 14.232 mono
>>> 24429 24440 user002 U 0.2 7 14.232 mono
>>> 24429 24439 user002 L 0.0 -1 14.305 mono
>>> 24429 24438 user002 L 0.0 -1 14.305 mono
>>> 24429 24437 user002 L 0.0 -1 14.306 mono
>>> 24429 24436 user002 L 0.0 -1 14.308 mono
>>> 24429 24435 user002 U 0.2 0 14.342 mono
>>> 24429 24434 user002 U 0.2 1 14.343 mono
>>> 24429 24433 user002 U 0.2 2 14.345 mono
>>> 24429 24432 user002 U 0.2 3 14.347 mono
>>>
>>>
>>In fact, real workload is 100% for each SPU (except processes with "-1"),
>>and a total workload ( for 8 SPUs) is near 50 %:
>>
>>
>>
>>>Spu(s): 49.9%us, 0.1%sys, 0.0%wait, 50.0%idle
>>>
>>>
>
>That still sounds reasonable, you probably do a lot of context switches.
>
I think it's not true due to the following:
at first, I have tried a well known matrix multiplication program
(http://tu-dresden.de/die_tu_dresden/zentrale_einrichtungen/zih/forschung/architektur_und_leistungsanalyse_von_hochleistungsrechnern/cell/matmul/index_html/document_view?body_language=en)
on the boxes with the different SDKs.
Here are the results:
1) QS22 blade, Fedora Core 7, SDK 3.0
> $ uname -a
> Linux cell8i 2.6.22-5.20070920bsc #1 SMP Tue Sep 25 10:49:16 CEST 2007
> ppc64 ppc64 ppc64 GNU/Linux
An application run:
> @cell8i matmul]$ ./matmul -m 6144 -s 1
spu-top during the run:
> spu-top: Context View
> Cpu(s) load avg: 0.3%, 0.1%, 0.0%
> spu-top: Context View
> Cpu(s) load avg: 0.4%, 0.1%, 0.0%
> Spu(s) load avg: 20.2%, 9.8%, 3.8%
> Cpu(s): 24.9%us, 0.4%sys, 0.0%wait, 0.0%nice, 74.7%idle
> Spu(s): 6.2%us, 0.0%sys, 0.0%wait, 93.8%idle
>
> PID TID USERNAME S F %SPU SPE TIME BINARY
> 30078 30079 user002 U 100.0 7 3.234 matmul
2) QS22 blade, Fedora Core 9, SDK 3.1:
> $ uname -a
> Linux cell8i-3 2.6.25.14-108.20080910bsc.ppc64 #1 SMP Fri Sep 12
> 11:44:36 CEST 2008 ppc64 ppc64 ppc64 GNU/Linux
The same application run:
> cell8i-3 matmul]$ ./matmul -m 6144 -s 1
spu-top:
> spu-top: Context View
> Cpu(s) load avg: 0.56, 0.18, 0.06
> Spu(s) load avg: 0.10, 0.03, 0.01
> Cpu(s): 16.4%us, 0.3%sys, 0.0%wait, 0.0%nice, 83.3%idle
> Spu(s): 6.2%us, 0.0%sys, 0.0%wait, 93.8%idle
>
> PID TID USERNAME S F %SPU SPE TIME BINARY
> 12887 12888 user002 U 0.0 7 4.328 matmul
Thanks.
Yury
PS
There are also some kernel issues which are present on QS blades in
contrast
with PlayStation 3 (please see the following letter ).
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.ozlabs.org/pipermail/cbe-oss-dev/attachments/20081121/52ab73ce/attachment.htm>
More information about the cbe-oss-dev
mailing list