Re: [PATCH] synchronize_irq needs a barrier
From: Benjamin Herrenschmidt
Date: Wed Oct 17 2007 - 22:41:25 EST
On Wed, 2007-10-17 at 19:12 -0700, Linus Torvalds wrote:
>
> So, what exactly does it protect against? At a minimum, this needs a
> comment in the changelog, and probably preferably in the source code too.
I replied to Andrew, but I agree, it's worth a comment, I'll add one.
> The thing is, synchronize_irq() can only protect against interrupts that
> are *already*running* on another CPU, and the caller must have made sure
> that no new interrupts are coming in (or at least that whatever new
> interrupts that come in will not pick up a certain piece of data).
>
> So I can imagine that the smb_mb() is there so that the caller - who has
> cleared some list or done something like that - will have any preceding
> writes that it did be serialized with actually checking the old state of
> "desc->status".
Yes.
> Fair enough - I can see that a smp_mb() is useful. But I think it needs
> documenting as such, and preferably with an example of how this actually
> happened in the first place (do you have one?)
One could argue that it's the caller that should do the mb() but that
would really be asking for trouble. Since most usage scenario require
it, and it's not a hot path, I prefer having it here.
Note that some kind of read barrier or compiler barrier should be needed
regardless, or we are just not sync'ing with anything at all (we may
have loaded the value ages ago and thus operate on a totally stale
value). I prefer a full barrier to also ensure all previous stores are
pushed out.
> The synchronize_irq() users that are really fundamental (ie the irq
> datastructures themselves) all should use the irq descriptor spinlock, and
> should *not* be needing any of this, since they would have serialized with
> whoever actually set the IRQ_INPROGRESS bit in the first place.
That isn't always the case. You can have very efficient lock-less fast
path and use synchronize_irq as a kind of RCU-like way to have the slow
path do the right thing.
> So in *that* sense, I think the memory barrier is useless, and I can't
> make up my mind whether it's good or bad. Which is why I'd really like to
> have an example scenario spelled out where it makes a difference..
Well, typically, I can clear a pointer and want to make sure my IRQ
handler isn't using it anymore before freeing the memory. Or set a "HW
is off" flag. Having spinlocks all over isn't always the best approach
on things that are really hot path, it's fair enough to use lockless
approaches like that by moving the burden to the slow path that does
synchronize_irq.
In general, I tend to think that for this function to make any sense
(that is, to synchronize anything at all), it needs a barrier or you are
just making a decision based on a totally random value of desc->status
since it can have been re-ordered, speculatively loaded, pre-fetched,
whatever'ed... :-).
Want a respin with a comment ?
Ben.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/