Re: [PATCH] random: use correct memory barriers for crng_node_pool

22 Sep 2020


      On Tue, Sep 22, 2020 at 11:31:00AM -0700, Paul E. McKenney wrote:
...
...
Also, it's not just the p == &global_variable case.  Consider:
struct a { struct b *b; };
struct b { ... };
Thread 1:
/* one-time initialized data shared by all instances of b */
   static struct c *c;
void init_b(struct a *a)
   {
   	if (!c)
   		c = alloc_c();
smp_store_release(&a->b, kzalloc(sizeof(struct b)));

}
Thread 2:
void use_b_if_present(struct a *a)
   {
   	struct b *b = READ_ONCE(a->b);
if (b) {
	c->... # crashes because c still appears to be NULL
}

}
So when the *first* "b" is allocated, the global data "c" is initialized.  Then
when using a "b", we expect to be able to access "c".  But there's no
data dependency from "b" to "c"; it's a control dependency only.
So smp_load_acquire() is needed, not READ_ONCE().
And it can be an internal implementation detail of "b"'s subsystem whether it
happens to use global data "c".
Given that "c" is static, these two subsystems must be in the same
translation unit.  So I don't see how this qualifies as being internal to
"b"'s subsystem.
You're missing the point here.  b and c could easily be allocated by a function
alloc_b() that's in another file.
...
Besides which, control dependencies should be used only by LKMM experts
at this point.
What does that even mean?  Control dependencies are everywhere.
...
...
This sort of thing is why people objected to the READ_ONCE() optimization during
the discussion at
https://lkml.kernel.org/linux-fsdevel/20200717044427.68747-1-ebiggers@kernel....
Most kernel developers aren't experts in the LKMM, and they want something
that's guaranteed to be correct without having to to think really hard about it
and make assumptions about the internal implementation details of other
subsystems, how compilers have implemented the C standard, and so on.
And smp_load_acquire()is provided for that reason.  Its name was
even based on the nomenclature used in the C standard and elsewhere.
And again, control dependencies are for LKMM experts, as they are very
tricky to get right.
How does a developer know that the code they're calling in another subsystem
wasn't written by one of these "experts" and therefore has a control dependency?
...
But in the LKMM documentation, you are likely to find LKMM experts who
want to optimize all the way, particularly in cases like the one-time
init pattern where all the data is often local.  And the best basis for
READ_ONCE() in one-time init is not a control dependency, but rather
ordering of accesses to a single variable from a single task combined
with locking, both of which are quite robust and much easier to use,
especially in comparison to control dependencies.
My goal for LKMM is not that each and every developer have a full
understanding of every nook and cranny of that model, but instead that
people can find the primitives supporting the desired point in the
performance/simplicity tradoff space.  And yes, I have more writing
to do to make more progress towards that goal.
So are you saying people should use smp_load_acquire(), or are you saying people
should use READ_ONCE()?
- Eric

2025

2024

2023

2022

2021

2020

2019

2018

2017

Re: [PATCH] random: use correct memory barriers for crng_node_pool