wmb vs mmiowb
Linus Torvalds
torvalds at linux-foundation.org
Thu Aug 23 04:07:32 EST 2007
On Wed, 22 Aug 2007, Nick Piggin wrote:
>
> It took me more than a glance to see what the difference is supposed to be
> between wmb() and mmiowb(). I think especially because mmiowb isn't really
> like a write barrier.
Well, it is, but it isn't. Not on its own - but together with a "normal"
barrier it is.
> wmb is supposed to order all writes coming out of a single CPU, so that's
> pretty simple.
No. wmb orders all *normal* writes coming out of a single CPU.
It may not do anything at all for "uncached" IO writes that aren't part of
the cache coherency, and that are handled using totally different queues
(both inside and outside of the CPU)!
Now, on x86, the CPU actually tends to order IO writes *more* than it
orders any other writes (they are mostly entirely synchronous, unless the
area has been marked as write merging), but at least on PPC, it's the
other way around: without the cache as a serialization entry, you end up
having a totally separate queueu to serialize, and a regular-memory write
barrier does nothing at all to the IO queue.
So think of the IO write queue as something totally asynchronous that has
zero connection to the normal write ordering - and then think of mmiowb()
as a way to *insert* a synchronization point.
In particular, the normal synchronization primitives (spinlocks, mutexes
etc) are guaranteed to synchronize only normal memory accesses. So if you
do MMIO inside a spinlock, since the MMIO writes are totally asyncronous
wrt the normal memory accesses, the MMIO write can escape outside the
spinlock unless you have somethign that serializes the MMIO accesses with
the normal memory accesses.
So normally you'd see "mmiowb()" always *paired* with a normal memory
barrier! The "mmiowb()" ends up synchronizing the MMIO writes with the
normal memory accesses, and then the normal memory barrier acts as a
barrier for subsequent writes.
Of course, the normal memory barrier would usually be a "spin_unlock()" or
something like that, not a "wmb()". In fact, I don't think the powerpc
implementation (as an example of this) will actually synchronize with
anything *but* a spin_unlock().
> It really seems like it is some completely different concept from a
> barrier. And it shows, on the platform where it really matters (sn2), where
> the thing actually spins.
I agree that it probably isn't a "write barrier" per se. Think of it as a
"tie two subsystems together" thing.
(And it doesn't just matter on sn2. It also matters on powerpc64, although
I think they just set a flag and do the *real* sync in the spin_unlock()
path).
Side note: the thing that makes "mmiowb()" even more exciting is that it's
not just the CPU, it's the fabric outside the CPU that matters too. That's
why the sn2 needs this - but the powerpc example shows a case where the
ordering requirement actually comes from the CPU itself.
Linus
More information about the Linuxppc-dev
mailing list