2.6.11 e1000 EEH MMIO failure

Linas Vepstas linas at austin.ibm.com
Wed May 4 08:46:32 EST 2005


Recent e1000 code has some new kind of whiz-bang watchdog timer
code that is causing the device to DMA off into hyperspace,
thus triggering the EEH code.  It's not clear to me if the
2.6.11 kernel has this code.

Am cc'ing two people who should know....

--linas

On Tue, May 03, 2005 at 01:02:12AM -0400, Sonny Rao was heard to remark:
> I'm guessing this means a bad e1000 card but I wanted to check with
> the experts.  The box is a p690 w/ some expansion drawers attached, 
> and is running a pretty-much stock 2.6.11 kernel, system is booted in
> SMP mode. 
> 
> Could it be related to e1000 errata "23" mentioned earlier on the
> mailing list?
> 
> Here are the messages:
> 
> Intel(R) PRO/1000 Network Driver - version 5.6.10.1-k2
> Copyright (c) 1999-2004 Intel Corporation.
> 
> <snip>
> 
> e1000: eth3: e1000_probe: Intel(R) PRO/1000 Network Connection
> PCI: Enabling device: (000a:01:01.0), cmd 143
> e1000: eth4: e1000_probe: Intel(R) PRO/1000 Network Connection
> PCI: Enabling device: (000a:01:01.1), cmd 143
> e1000: eth5: e1000_probe: Intel(R) PRO/1000 Network Connection
> PCI: Enabling device: (000e:21:01.0), cmd 143
> e1000: eth6: e1000_probe: Intel(R) PRO/1000 Network Connection
> PCI: Enabling device: (0011:21:01.0), cmd 143
> e1000: eth7: e1000_probe: Intel(R) PRO/1000 Network Connection
> RTAS: event: 15, Type: Retry, Severity: 2
> EEH: MMIO failure (2) on device: ethernet /pci at 3ffe7f0a000/pci at 2,2/ethernet at 1
> Call Trace:
> [c00000103873a910] [c000000000631630] 0xc000000000631630 (unreliable)
> [c00000103873a990] [c000000000036a6c] .eeh_dn_check_failure+0x2e4/0x334
> [c00000103873aa70] [c000000000036c20] .eeh_check_failure+0x164/0x1b0
> [c00000103873ab10] [d0000000002a6b04] .e1000_check_for_link+0x5ac/0x664 [e1000]
> [c00000103873abd0] [d00000000029a5e0] .e1000_watchdog+0x48/0x79c [e1000]
> [c00000103873ac90] [c00000000005f558] .run_timer_softirq+0x15c/0x280
> [c00000103873ad60] [c00000000005a3c4] .__do_softirq+0xdc/0x1c8
> [c00000103873ae20] [c00000000005a538] .do_softirq+0x88/0x8c
> [c00000103873aeb0] [c000000000011520] .timer_interrupt+0x294/0x35c
> [c00000103873afb0] [c00000000000a2b8] decrementer_common+0xb8/0x100
> --- Exception: 901 at ._spin_unlock_irqrestore+0x1c/0x28
>     LR = .rtas_call+0x1a4/0x2b4
> [c00000103873b2a0] [c0000000001e8128] .snprintf+0x30/0x44 (unreliable)
> [c00000103873b2e0] [c00000000003421c] .rtas_call+0x110/0x2b4
> [c00000103873b3a0] [c0000000000366ec] .read_slot_reset_state+0x94/0xac
> [c00000103873b420] [c000000000036890] .eeh_dn_check_failure+0x108/0x334
> [c00000103873b500] [c000000000036c20] .eeh_check_failure+0x164/0x1b0
> [c00000103873b5a0] [d00000000029f174] .e1000_up+0x404/0x40c [e1000]
> [c00000103873b650] [d00000000029f5cc] .e1000_open+0x54/0xc0 [e1000]
> [c00000103873b6e0] [c0000000002fec84] .dev_open+0x118/0x13c
> [c00000103873b780] [c0000000002fcef8] .dev_change_flags+0x19c/0x1d4
> [c00000103873b820] [c000000000357878] .devinet_ioctl+0x66c/0x820
> [c00000103873b930] [c000000000358794] .inet_ioctl+0x260/0x2e0
> [c00000103873b9c0] [c0000000002f03a0] .sock_ioctl+0x28c/0x418
> [c00000103873ba70] [c0000000000c7564] .do_ioctl+0x124/0x13c
> [c00000103873bb10] [c0000000000c777c] .vfs_ioctl+0x200/0x4e0
> [c00000103873bbc0] [c0000000000c7ab8] .sys_ioctl+0x5c/0xa4
> [c00000103873bc70] [c00000000001e8c0] .dev_ifsioc+0x8c/0x348
> [c00000103873bd50] [c0000000000e7d24] .compat_sys_ioctl+0x46c/0x4c4
> [c00000103873be30] [c00000000000d500] syscall_exit+0x0/0x18
> e1000: eth7: e1000_watchdog: NIC Link is Up 1000 Mbps Full Duplex
> RTAS: event: 16, Type: Retry, Severity: 2
> EEH: MMIO failure (2) on device: ethernet /pci at 3ffe7f0a000/pci at 2,2/ethernet at 1
> Call Trace:
> [c00000103873b3a0] [c000000000631630] 0xc000000000631630 (unreliable)
> [c00000103873b420] [c000000000036a6c] .eeh_dn_check_failure+0x2e4/0x334
> [c00000103873b500] [c000000000036c20] .eeh_check_failure+0x164/0x1b0
> [c00000103873b5a0] [d00000000029f174] .e1000_up+0x404/0x40c [e1000]
> [c00000103873b650] [d00000000029f5cc] .e1000_open+0x54/0xc0 [e1000]
> [c00000103873b6e0] [c0000000002fec84] .dev_open+0x118/0x13c
> [c00000103873b780] [c0000000002fcef8] .dev_change_flags+0x19c/0x1d4
> [c00000103873b820] [c000000000357878] .devinet_ioctl+0x66c/0x820
> [c00000103873b930] [c000000000358794] .inet_ioctl+0x260/0x2e0
> [c00000103873b9c0] [c0000000002f03a0] .sock_ioctl+0x28c/0x418
> [c00000103873ba70] [c0000000000c7564] .do_ioctl+0x124/0x13c
> [c00000103873bb10] [c0000000000c777c] .vfs_ioctl+0x200/0x4e0
> [c00000103873bbc0] [c0000000000c7ab8] .sys_ioctl+0x5c/0xa4
> [c00000103873bc70] [c00000000001e8c0] .dev_ifsioc+0x8c/0x348
> [c00000103873bd50] [c0000000000e7d24] .compat_sys_ioctl+0x46c/0x4c4
> [c00000103873be30] [c00000000000d500] syscall_exit+0x0/0x18
> EEH: MMIO failure (2), notifiying device 0011:21:01.0 
> EEH: MMIO failure (2), notifiying device 0011:21:01.0 
> PCI: Enabling device: (0014:01:01.0), cmd 143
> e1000: eth8: e1000_probe: Intel(R) PRO/1000 Network Connection
> PCI: Enabling device: (0014:01:01.1), cmd 143
> e1000: eth9: e1000_probe: Intel(R) PRO/1000 Network Connection
> PCI: Enabling device: (0017:01:01.0), cmd 143
> e1000: eth10: e1000_probe: Intel(R) PRO/1000 Network Connection
> 
> Sonny
> _______________________________________________
> Linuxppc64-dev mailing list
> Linuxppc64-dev at ozlabs.org
> https://ozlabs.org/cgi-bin/mailman/listinfo/linuxppc64-dev
> 



More information about the Linuxppc64-dev mailing list