Re: [PATCH tip/core/locking 4/4] Documentation/memory-barriers.txt:Document ACCESS_ONCE()

From: Paul E. McKenney
Date: Tue Dec 10 2013 - 12:44:17 EST


On Tue, Dec 10, 2013 at 04:20:06PM +0100, Ingo Molnar wrote:
>
> * Paul E. McKenney <paulmck@xxxxxxxxxxxxxxxxxx> wrote:
>
> > > [...]
> > >
> > > > + (*) The compiler is within its rights to reorder memory accesses unless
> > > > + you tell it not to. For example, consider the following interaction
> > > > + between process-level code and an interrupt handler:
> > > > +
> > > > + void process_level(void)
> > > > + {
> > > > + msg = get_message();
> > > > + flag = true;
> > > > + }
> > > > +
> > > > + void interrupt_handler(void)
> > > > + {
> > > > + if (flag)
> > > > + process_message(msg);
> > > > + }
> > > > +
> > > > + There is nothing to prevent the the compiler from transforming
> > > > + process_level() to the following, in fact, this might well be a
> > > > + win for single-threaded code:
> > > > +
> > > > + void process_level(void)
> > > > + {
> > > > + flag = true;
> > > > + msg = get_message();
> > > > + }
> > > > +
> > > > + If the interrupt occurs between these two statement, then
> > > > + interrupt_handler() might be passed a garbled msg. Use ACCESS_ONCE()
> > > > + to prevent this as follows:
> > > > +
> > > > + void process_level(void)
> > > > + {
> > > > + ACCESS_ONCE(msg) = get_message();
> > > > + ACCESS_ONCE(flag) = true;
> > > > + }
> > > > +
> > > > + void interrupt_handler(void)
> > > > + {
> > > > + if (ACCESS_ONCE(flag))
> > > > + process_message(ACCESS_ONCE(msg));
> > > > + }
> > >
> > > Looking at this, I find myself wondering why you couldn't just put a
> > > barrier() between the two statements in process_level()? ACCESS_ONCE()
> > > seems like a heavy hammer to just avoid reordering of two assignments.
> > > What am I missing, and what could be added here to keep the other folks as
> > > dense as me from missing the same thing?
> >
> > You could use barrier() from an ordering viewpoint. However,
> > ACCESS_ONCE() is often lighter weight than barrier(). ACCESS_ONCE()
> > affects only that one access, while barrier() forces the compiler to
> > forget pretty much anything it previously gleaned from any region of
> > memory, including private locations that no one else touches.
> >
> > I am adding a sentence saying that pure ordering can be provided by
> > barrier(), though often at higher cost.
>
> I suspect a related question would be, is the compiler allowed to
> reorder:
>
>
> x = ACCESS_ONCE(a);
> y = ACCESS_ONCE(b);
>
> ?
>
> This wording:
>
> + [...] Howevever, ACCESS_ONCE() can be thought of as a weak form

And I guess I should fix the "Howevever" as well. Or make up a meaning
for it, this being English and all...

> +for barrier() that affects only the specific accesses flagged by the
> +ACCESS_ONCE().
>
> Does not seem to be obvious enough to me - does it affect accesses to
> the variables referenced (but still allows accesses to separate
> variables reordered), or does it affect compiler-ordering of all
> ACCESS_ONCE() instances, instructing the compiler to preserve program
> order?

I cover this in the bullet item about reordering memory accesses:

(*) The compiler is within its rights to reorder memory accesses unless
you tell it not to. For example, consider the following interaction
between process-level code and an interrupt handler:

void process_level(void)
{
msg = get_message();
flag = true;
}

void interrupt_handler(void)
{
if (flag)
process_message(msg);
}

There is nothing to prevent the the compiler from transforming
process_level() to the following, in fact, this might well be a
win for single-threaded code:

void process_level(void)
{
flag = true;
msg = get_message();
}

If the interrupt occurs between these two statement, then
interrupt_handler() might be passed a garbled msg. Use ACCESS_ONCE()
to prevent this as follows:

void process_level(void)
{
ACCESS_ONCE(msg) = get_message();
ACCESS_ONCE(flag) = true;
}

void interrupt_handler(void)
{
if (ACCESS_ONCE(flag))
process_message(ACCESS_ONCE(msg));
}

Note that the ACCESS_ONCE() wrappers in interrupt_handler()
are needed if this interrupt handler can itself be interrupted
by something that also accesses 'flag' and 'msg', for example,
a nested interrupt or an NMI. Otherwise, ACCESS_ONCE() is not
needed in interrupt_handler() other than for documentation purposes.
(Note also that nested interrupts do not typically occur in modern
Linux kernels, in fact, if an interrupt handler returns with
interrupts enabled, you will get a WARN_ONCE() splat.)

This effect could also be achieved using barrier(), but ACCESS_ONCE()
is more selective: With ACCESS_ONCE(), the compiler need only forget
the contents of the indicated memory located, while with barrier()
the compiler must discard the value of all memory locations that
it has currented cached in any machine registers.

Does that cover it?

> Also, it's not clear what happens if non-ACCESS_ONCE() access to a
> variable is mixed with ACCESS_ONCE() access.

Different versions of the standard say different things here, and the
committee seems quite reluctant to tighten up volatile accesses. I
believe that the best approach is to say that the compiler is free to
order ACCESS_ONCE() with non-ACCESS_ONCE() code. How about the
following?

You should assume that the compiler can move ACCESS_ONCE()
past code not containing ACCESS_ONCE(), barrier(), or similar
primitives.

Thanx, Paul

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/