device not available because of BAR 0 collisions
Benjamin Herrenschmidt
benh at kernel.crashing.org
Fri Apr 29 06:55:41 EST 2011
On Thu, 2011-04-28 at 13:29 -0400, Steven A. Falco wrote:
> On 04/27/2011 03:51 PM, Steven A. Falco wrote:
> > On 04/26/2011 07:39 PM, Benjamin Herrenschmidt wrote:
> >> On Tue, 2011-04-26 at 09:38 -0400, Steven A. Falco wrote:
> >>> On 04/25/2011 08:01 PM, Benjamin Herrenschmidt wrote:
> >>>> On Mon, 2011-04-25 at 16:10 -0400, Steven A. Falco wrote:
> >>>>> I'm getting an error message when trying to talk to some custom
> >>>>> hardware:
> >>>>>
> >>>>> dx83xx 0001:43:00.0: device not available because of BAR 0
> >>>>> [0xa1000000-0xa1ffffff] collisions
>
> I believe I've gotten to the root cause of this issue. Turns out the
> ASIC is reporting a class code of 0000, in violation of the PCI spec.
>
> As a result, Linux is refusing to set up the ASIC, and so it remains
> with the settings made in U-Boot, which are in conflict with the
> allocation Linux makes for the bridge the ASIC is connected to.
>
> So Ben, you are absolutely right that there was some left-over
> configuration.
Hrm. that's odd still, do you know where in linux is the code that tests
for the class code and decides to not setup the device resources ? I
can't see that...
> My choices appear to be:
>
> 1) Fix the ASIC (yeah, right)
>
> 2) Force Linux to use the U-Boot settings
>
> 3) Hack Linux to set up a device with a bogus class.
>
> I'm not sure why this hardware works in x86 - I guess it is less
> fussy.
x86 probably just re-uses whatever setting the BIOS does, but I'm still
a bit surprised by your class code story.
I supose you can do a PCI header quirk that overrides the class code in
struct pci_dev. Something like:
static void __devinit quirk_your_asic_class(struct pci_dev *dev)
{
dev->class = foobar;
}
DECLARE_PCI_FIXUP_HEADER(PCI_VENDOR_xxx, PCI_DEVICE_ID_yyy, quirk_your_asic_class);
But I'd like to figure out where that is tested bcs I haven't found so
far...
> Steve
>
> >>>>>
> >>>>> I see in setup-res.c that this message comes out when there is no
> >>>>> parent for
> >>>>> a device resource.
> >>>>
> >>>> .../...
> >>>>
> >>>> It mostly happens in arch/powerpc/kernel/pci-common.c and the generic
> >>>> setup-res.c
> >>>>
> >>>> Try #define DEBUG at the top (before the #includes) of pci-common.c and
> >>>> pci_32.c (remove the exiting #undef in the last one) and send us the
> >>>> full dmesg log, along with the output of cat /proc/iomem
> >>
> >> Have you set any specific flags ? IE. Modified the value of
> >> ppc_pci_flags from what the 4xx code sets originally ?
> >
> > For fun, I just tried changing:
> >
> > ppc_pci_set_flags(PPC_PCI_REASSIGN_ALL_RSRC);
> >
> > to:
> >
> > ppc_pci_set_flags(PPC_PCI_PROBE_ONLY);
> >
> > I realize that is the exact opposite of what you were suggesting, but
> > please bear with me for a bit.
> >
> > I also changed the PCIE 0 ranges from:
> >
> > ranges = <0x02000000 0x00000000 0x80000000 0x90000000 0x00000000 0x10000000
> > 0x01000000 0x00000000 0x00000000 0xe8010000 0x00000000 0x00010000>;
> >
> > ranges = <0x02000000 0x00000000 0x90000000 0x90000000 0x00000000 0x10000000
> > 0x01000000 0x00000000 0x00000000 0xe8010000 0x00000000 0x00010000>;
> >
> > I changed the ranges not because I wanted a 1:1 map, but because 90000000 is
> > what U-Boot chooses when it scans PCIe 1.
> >
> > At this point, everything is working. Here is /proc/iomap:
> >
> > 90000000-9fffffff : /plb/pciex at 0c0000000
> > 90000000-94ffffff : PCI Bus 0001:41
> > 90000000-9001ffff : 0001:41:00.0
> > 90100000-94ffffff : PCI Bus 0001:42
> > 90100000-92ffffff : PCI Bus 0001:43
> > 91000000-91ffffff : 0001:43:00.0 //<--- was missing before
> > 92000000-92ffffff : 0001:43:00.0 //<--- was missing before
> > 93000000-94ffffff : PCI Bus 0001:44
> > 93000000-93ffffff : 0001:44:00.0 //<--- was missing before
> > 94000000-94ffffff : 0001:44:00.0 //<--- was missing before
> > e0000000-e7ffffff : /plb/pciex at 0a0000000
> > e0000000-e7ffffff : PCI Bus 0000:01
> > e0000000-e00fffff : 0000:01:00.0
> > e0100000-e01fffff : 0000:01:00.0
> > e4000000-e7ffffff : 0000:01:00.0
> > ef600200-ef600207 : serial
> > ef600300-ef600307 : serial
> > ef600600-ef600606 : spi_ppc4xx_of
> > ef6c0000-ef6cffff : dwc_otg.0
> > ef6c0000-ef6cffff : dwc_otg
> > fc000000-ffffffff : fc000000.nor_flash
> >
> > Now I see the bars for the ASICs (flagged above). I could stop here,
> > and declare success, but I don't really like this solution, because it
> > requires me to be sure the dts has the same bus addresses that U-Boot
> > will choose. Seems risky.
> >
> > Tentative conclusion: Either I still have something set wrong in my dts
> > or there is a bug in the Linux PCI bus mapping code.
> >
> > Steve
> >
> >>
> >> It does look to me like some of your device BARs have been setup already
> >> by the firmware in a way that conflict with the way you configure your
> >> ranges, and the kernel doesn't appear to detect nor try to remap that
> >> which would happen if you have the "probe only" flag set.
> >>
> >> IE. On your c0000000 bus, you have memory at 90000000 CPU space mapped
> >> to 80000000 PCI space. However, when probing, the kernel finds:
> >>
> >> pci 0001:41:00.0: reg 10 32bit mmio: [0x90000000-0x9001ffff]
> >>
> >> IE. A BAR was already set with a value of 90000000 PCI-side which is out
> >> of the bounds you have for your bus.
> >>
> >> Maybe you really want to configure that second bus to have CPU 90000000
> >> mapped to 90000000 PCI-side ? (IE. a 1:1 mapping). That would be
> >> something to fix in your "ranges" property.
> >>
> >> Cheers,
> >> Ben.
> >
> >
>
>
More information about the Linuxppc-dev
mailing list