Re: [PATCH] random: use correct memory barriers for crng_node_pool
From: Eric Biggers
Date: Tue Sep 22 2020 - 15:09:43 EST
On Tue, Sep 22, 2020 at 11:31:00AM -0700, Paul E. McKenney wrote:
> > Also, it's not just the p == &global_variable case. Consider:
> >
> > struct a { struct b *b; };
> > struct b { ... };
> >
> > Thread 1:
> >
> > /* one-time initialized data shared by all instances of b */
> > static struct c *c;
> >
> > void init_b(struct a *a)
> > {
> > if (!c)
> > c = alloc_c();
> >
> > smp_store_release(&a->b, kzalloc(sizeof(struct b)));
> > }
> >
> > Thread 2:
> >
> > void use_b_if_present(struct a *a)
> > {
> > struct b *b = READ_ONCE(a->b);
> >
> > if (b) {
> > c->... # crashes because c still appears to be NULL
> > }
> > }
> >
> >
> > So when the *first* "b" is allocated, the global data "c" is initialized. Then
> > when using a "b", we expect to be able to access "c". But there's no
> > data dependency from "b" to "c"; it's a control dependency only.
> > So smp_load_acquire() is needed, not READ_ONCE().
> >
> > And it can be an internal implementation detail of "b"'s subsystem whether it
> > happens to use global data "c".
>
> Given that "c" is static, these two subsystems must be in the same
> translation unit. So I don't see how this qualifies as being internal to
> "b"'s subsystem.
You're missing the point here. b and c could easily be allocated by a function
alloc_b() that's in another file.
> Besides which, control dependencies should be used only by LKMM experts
> at this point.
What does that even mean? Control dependencies are everywhere.
> > This sort of thing is why people objected to the READ_ONCE() optimization during
> > the discussion at
> > https://lkml.kernel.org/linux-fsdevel/20200717044427.68747-1-ebiggers@xxxxxxxxxx/T/#u.
> > Most kernel developers aren't experts in the LKMM, and they want something
> > that's guaranteed to be correct without having to to think really hard about it
> > and make assumptions about the internal implementation details of other
> > subsystems, how compilers have implemented the C standard, and so on.
>
> And smp_load_acquire()is provided for that reason. Its name was
> even based on the nomenclature used in the C standard and elsewhere.
> And again, control dependencies are for LKMM experts, as they are very
> tricky to get right.
How does a developer know that the code they're calling in another subsystem
wasn't written by one of these "experts" and therefore has a control dependency?
>
> But in the LKMM documentation, you are likely to find LKMM experts who
> want to optimize all the way, particularly in cases like the one-time
> init pattern where all the data is often local. And the best basis for
> READ_ONCE() in one-time init is not a control dependency, but rather
> ordering of accesses to a single variable from a single task combined
> with locking, both of which are quite robust and much easier to use,
> especially in comparison to control dependencies.
>
> My goal for LKMM is not that each and every developer have a full
> understanding of every nook and cranny of that model, but instead that
> people can find the primitives supporting the desired point in the
> performance/simplicity tradoff space. And yes, I have more writing
> to do to make more progress towards that goal.
So are you saying people should use smp_load_acquire(), or are you saying people
should use READ_ONCE()?
- Eric