How to debug a hung multi-core system....

Morrison, Tom tmorrison at empirix.com
Fri May 22 01:23:30 EST 2009


Just had a little conference with several co-workers...to go over
results

We think that LT0 (the one that maps the kernel) has been corrupted:

       Entry  EPN          RPN    TID  TMASK   WIMGE  TSIZ U0:3  X0:1
       ---------------------------------------------------------------
       LT0  C0000000     00000000 00     0FF     04     9     0     0

       PID  TS  PROT SHEN   UR   UW   UX   SR   SW   SX  TIDZ VAL
       ---------------------------------------------------------------
       0    0    P    P    E    E    D    E    E    D    D    V

Is absolutely wrong - this is TLB for the kernel - and as you can see 
...it does NOT have execution privileges (and in fact the user space 
HAS executive privileges for this area (complete opposite of what it 
should be)...

This is why it is stuck AT that instruction (can't even single step
from that location)..

(one of) The first problem(s) is how can/when did this TLB get
corrupted!

Tom




More information about the Linuxppc-dev mailing list