[PATCH 0/5] V3: Clarify/standardize memory barriers for ipc

From: Manfred Spraul
Date: Sun Oct 20 2019 - 08:33:37 EST


Hi,

Updated series, based on input from Davidlohr and Peter Zijlstra:

- I've dropped the documentation update for wake_q_add, as what it
states is normal: When you call a function and pass a parameter
to a structure, you as caller are responsible to ensure that the
parameter is valid, and remains valid for the duration of the
function call, including any tearing due to memory reordering.
In addition, I've switched ipc to wake_q_add_safe().

- The patch to Documentation/memory_barriers.txt now as first change.
@Davidlohr: You proposed to have 2 paragraphs: First, one for
add/subtract, then one for failed cmpxchg. I didn't like that:
We have one rule (can be combined with non-mb RMW ops), and then
examples what are non-mb RMW ops. Listing special cases just ask
for issues later.
What I don't know is if there should be examples at all in
Documentation/memory_barriers, or just
"See Documentation/atomic_t.txt for examples of RMW ops that
do not contain a memory barrier"

- For the memory barrier pairs in ipc/<whatever>, I have now added
/* See ABC_BARRIER for purpose/pairing */ as standard comment,
and then a block near the relevant structure where purpose, pairing
races, ... are explained. I think this makes it easier to read,
compared to adding it to both the _release and _acquire branches.

Description/purpose:

The memory barriers in ipc are not properly documented, and at least
for some architectures insufficient:
Reading the xyz->status is only a control barrier, thus
smp_acquire__after_ctrl_dep() was missing in mqueue.c and msg.c
sem.c contained a full smp_mb(), which is not required.

Patches:
Patch 1: Documentation for smp_mb__{before,after}_atomic().

Patch 2: Remove code duplication inside ipc/mqueue.c

Patch 3-5: Update the ipc code, especially add missing
smp_mb__after_ctrl_dep() and switch to wake_q_add_safe().

Clarify that smp_mb__{before,after}_atomic() are compatible with all
RMW atomic operations, not just the operations that do not return a value.

Open issues:
- More testing. I did some tests, but doubt that the tests would be
sufficient to show issues with regards to incorrect memory barriers.

What do you think?

--
Manfred