Re: [RFC PATCH v2] memory-barriers: remove smp_mb__after_unlock_lock()
From: Paul E. McKenney
Date: Mon Jul 13 2015 - 18:53:55 EST
On Tue, Jul 14, 2015 at 12:15:03AM +0200, Peter Zijlstra wrote:
> On Mon, Jul 13, 2015 at 01:16:42PM -0700, Paul E. McKenney wrote:
> > On Mon, Jul 13, 2015 at 03:41:53PM -0400, Peter Hurley wrote:
> > > > Does that answer the question, or am I missing the point?
> > >
> > > Yes, it shows that smp_mb__after_unlock_lock() has no purpose, since it
> > > is defined only for PowerPC and your test above just showed that for
> > > the sequence
>
> The only purpose is to provide transitivity, but the documentation fails
> to explicitly call that out.
It does say that it is a full barrier, but I added explicit mention of
transitivity.
> > >
> > > store a
> > > UNLOCK M
> > > LOCK N
> > > store b
> > >
> > > a and b is always observed as an ordered pair {a,b}.
> >
> > Not quite.
> >
> > This is instead the sequence that is of concern:
> >
> > store a
> > unlock M
> > lock N
> > load b
>
> So its late and that table didn't parse, but that should be ordered too.
> The load of b should not be able to escape the lock N.
>
> If only because LWSYNC is a valid RMB and any LOCK implementation must
> load the lock state to observe it unlocked.
If you actually hold a given lock, then yes, you will observe anything
previously done while holding that same lock, even if you don't use
smp_mb__after_unlock_lock(). The smp_mb__after_unlock_lock() comes into
play when code not holding a lock needs to see the ordering. RCU needs
this because of the strong ordering that grace periods must provide:
regardless of who started or ended the grace period, anything on any
CPU preceding a given grace period is fully ordered before anything on
any CPU following that same grace period. It is not clear to me that
anything else would need such strong ordering.
> > > Additionally, the assertion in Documentation/memory_barriers.txt that
> > > the sequence above can be reordered as
> > >
> > > LOCK N
> > > store b
> > > store a
> > > UNLOCK M
> > >
> > > is not true on any existing arch in Linux.
> >
> > It was at one time and might be again.
>
> What would be required to make this true? I'm having a hard time seeing
> how things can get reordered like that.
You are right, I failed to merge current and past knowledge. At one time,
Itanium was said to allow things to bleed into lock-based critical sections.
However, we now know that ld,acq and st,rel really do full ordering.
Compilers might one day do this sort of reordering, but I would guess
that Linux kernel builds would disable this sort of thing. Something
about wanting critical sections to remain small.
Thanx, Paul
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/