[Cbe-oss-dev] intermittent trouble on startup on a QS22 with 2.6.30

Marcus Daniels mdaniels at lanl.gov
Sat Dec 12 04:46:43 EST 2009


On 12/11/09 4:52 AM, Arnd Bergmann wrote:
> On Friday 11 December 2009, Marcus G. Daniels wrote:
>    
>> I copied the apnet.[hc] from the 2.6.32 tree into the 2.6.30 tree and
>> rebuilt modules.  Before I had only copied the axon driver.  The apnet
>> interface now don't freeze as above.   The remaining mystery is why
>> 2.6.32 fails as it did..
>>      
> As I mentioned, I did not test the code, so it's not that surprising.
> Apparently, the NUMA layout changed between 2.6.30 and 2.6.30,
> please try with this patch:
>
> --- a/arch/powerpc/platforms/cell/pmu.c
> +++ b/arch/powerpc/platforms/cell/pmu.c
> @@ -381,7 +381,7 @@ static int __init cbe_init_pm_irq(void)
>   	unsigned int irq;
>   	int rc, node;
>
> -	for_each_node(node) {
> +	for_each_online_node(node) {
>   		irq = irq_create_mapping(NULL, IIC_IRQ_IOEX_PMI |
>   					       (node<<  IIC_IRQ_NODE_SHIFT));
>   		if (irq == NO_IRQ) {
>    
That works!   I can use netcat/telnet to send data over the apnet 
interfaces.

I do get the following message from dmesg on startup, but just once..

Wonderful, thank you!

Marcus

apnet0: TX descriptors mapped at 0xd000080082004100
apnet0: TX stopped, remote ring not ready!
apnet1: TX descriptors mapped at 0xd000080082804100
apnet1: TX stopped, remote ring not ready!
NETDEV WATCHDOG: apnet0 (): transmit queue 0 timed out
------------[ cut here ]------------
Badness at net/sched/sch_generic.c:261
NIP: c0000000004d9d6c LR: c0000000004d9d68 CTR: 0000000000000001
REGS: c00000000ffe7ac0 TRAP: 0700   Not tainted  (2.6.32)
MSR: 9000000000029032 <EE,ME,CE,IR,DR>  CR: 28000024  XER: 20000000
TASK = c0000000fe812980[0] 'swapper' THREAD: c0000000fe118000 CPU: 3
GPR00: c0000000004d9d68 c00000000ffe7d40 c0000000009d7cb0 000000000000003d
GPR04: 0000000000000000 ffffffffffffffff 0000000000000001 c00000000088c46c
GPR08: 000000000001ffff 0000000000000000 c000000000aa229c 0000000000000001
GPR12: 0000000028000042 c000000000a72c00 ffffffffffffffff ffffffffffffffff
GPR16: ffffffffffffffff ffffffffffffffff ffffffffffffffff ffffffffffffffff
GPR20: 01020304cabebabe ffffffffffffffff c000000000a83e60 0000000000000001
GPR24: 0000000000000003 c0000001fca88000 0000000000000100 0000000000000003
GPR28: 0000000000000001 0000000000000000 c00000000095d9e0 c0000001fca88000
NIP [c0000000004d9d6c] .dev_watchdog+0x1d4/0x308
LR [c0000000004d9d68] .dev_watchdog+0x1d0/0x308
Call Trace:
[c00000000ffe7d40] [c0000000004d9d68] .dev_watchdog+0x1d0/0x308 (unreliable)
[c00000000ffe7e20] [c0000000000b4bd8] .run_timer_softirq+0x25c/0x368
[c00000000ffe7ee0] [c0000000000ac538] .__do_softirq+0x138/0x258
[c00000000ffe7f90] [c00000000002aa00] .call_do_softirq+0x14/0x24
[c0000000fe11b810] [c00000000000db7c] .do_softirq+0x88/0xf0
[c0000000fe11b8b0] [c0000000000ac74c] .irq_exit+0x54/0xb0
[c0000000fe11b930] [c000000000027d30] .timer_interrupt+0x1b0/0x1e0
[c0000000fe11b9c0] [c000000000068e5c] .cbe_system_reset_exception+0x74/0xb0
[c0000000fe11ba40] [c000000000028e14] .system_reset_exception+0x44/0xd8
[c0000000fe11bac0] [c000000000003414] system_reset_common+0x114/0x180
--- Exception: 100 at .cbe_power_save+0x98/0xb4
     LR = .cpu_idle+0x10c/0x1d0
[c0000000fe11bdb0] [0000000000000000] (null) (unreliable)
[c0000000fe11be40] [c000000000013f54] .cpu_idle+0x10c/0x1d0
[c0000000fe11bed0] [c00000000057124c] .start_secondary+0x3a0/0x3e0
[c0000000fe11bf90] [c0000000000082e0] .start_secondary_prolog+0x10/0x14
Instruction dump:
40be0040 38810070 7fe3fb78 38a00040 4bfe0be1 60000000 7fa6eb78 7c651b78
e87e8058 7fe4fb78 4bbcb5a9 60000000 <0fe00000> 38000001 e93e8050 90090000
apnet0: no IPv6 routers present
apnet1: no IPv6 routers present



More information about the cbe-oss-dev mailing list