[PATCH 2.6.14] mm: 8xx MM fix for

Tom Rini trini at kernel.crashing.org
Tue Nov 8 05:22:11 EST 2005


On Mon, Nov 07, 2005 at 07:14:15PM +0100, Joakim Tjernlund wrote:
> > -----Original Message-----
> > From: Tom Rini [mailto:trini at kernel.crashing.org] 
> > Sent: 07 November 2005 16:52
> > To: Marcelo Tosatti
> > Cc: Joakim Tjernlund; Pantelis Antoniou; Dan Malek; 
> > linuxppc-embedded at ozlabs.org; gtolstolytkin at ru.mvista.com
> > Subject: Re: [PATCH 2.6.14] mm: 8xx MM fix for
> > 
> > On Mon, Nov 07, 2005 at 08:16:18AM -0200, Marcelo Tosatti wrote:
> > > Joakim!
> > > 
> > > On Mon, Nov 07, 2005 at 03:32:52PM +0100, Joakim Tjernlund wrote:
> > > > Hi Marcelo
> > > > 
> > > > [SNIP] 
> > > > > The root of the problem are the changes against the 8xx TLB 
> > > > > handlers introduced
> > > > > during v2.6. What happens is the TLBMiss handlers load the 
> > > > > zeroed pte into
> > > > > the TLB, causing the TLBError handler to be invoked (thats 
> > > > > two TLB faults per 
> > > > > pagefault), which then jumps to the generic MM code to 
> > setup the pte.
> > > > > 
> > > > > The bug is that the zeroed TLB is not invalidated (the 
> > same reason
> > > > > for the "dcbst" misbehaviour), resulting in infinite 
> > TLBError faults.
> > > > > 
> > > > > Dan, I wonder why we just don't go back to v2.4 behaviour.
> > > > 
> > > > This is one reason why it is the way it is:
> > > > 
> > http://ozlabs.org/pipermail/linuxppc-embedded/2005-January/016382.html
> > > > This details are little fuzzy ATM, but I think the reason for the
> > > > current
> > > > impl. was only that it was less intrusive to impl.
> > > 
> > > Ah, I see. I wonder if the bug is processor specific: we 
> > don't have such
> > > changes in our v2.4 tree and never experienced such problem.
> > > 
> > > It should be pretty easy to hit it right? (instruction 
> > pagefaults should
> > > fail).
> > > 
> > > Grigori, Tom, can you enlight us about the issue on the URL 
> > above. How
> > > can it be triggered?
> > 
> > So after looking at the code in 2.6.14 and current git, I think the
> > above URL isn't relevant, unless there was a change I missed (which
> > could totally be possible) that reverted the patch there and 
> > fixed that
> > issue in a different manner.  But since I didn't figure that 
> > out until I
> > had finished researching it again:
> 
> I wasn't clear enough. What I meant was that the above patch made me
> think and
> the result was that I came up with a simpler fix, the "two exception"
> fix that
> is in current kernels. See
> http://linux.bkbits.net:8080/linux-2.6/diffs/arch/ppc/kernel/head_8xx.S@
> 1.19?nav=index.html|src/.|src/arch|src/arch/ppc|src/arch/ppc/kernel|hist
> /arch/ppc/kernel/head_8xx.S
> It appears this fix has some other issues :(
> 
> How do the other ppc arches do? I am guessing that they don't double
> fault, but bails
> out to do_page_fault from the TLB Miss handler, like 8xx used to do.

Assuming Dan doesn't come up with a more simple & better fix, maybe we
should go back to the original patch I made?

-- 
Tom Rini
http://gate.crashing.org/~trini/



More information about the Linuxppc-embedded mailing list