Re: [RFC] rust: types: Add read_once and write_once

From: Boqun Feng
Date: Thu Oct 26 2023 - 10:22:05 EST


On Thu, Oct 26, 2023 at 01:16:25PM +0200, Peter Zijlstra wrote:
> On Thu, Oct 26, 2023 at 11:36:10AM +0100, Gary Guo wrote:
>
> > There's two reasons that we are using volatile read/write as opposed to
> > relaxed atomic:
> > * Rust lacks volatile atomics at the moment. Non-volatile atomics are
> > not sufficient because the compiler is allowed (although they
> > currently don't) optimise atomics. If you have two adjacent relaxed
> > loads, they could be merged into one.
>
> Ah yes, that would be problematic, eg, if lifted out of a loop things
> could go sideways fast.
>

Maybe we can workaround this limitation by using compiler barriers, i.e.

compiler_fence(SeqCst);
load(Relaxed);
compiler_fence(Acquire);

this is slightly stronger than a volatile atomic.

> > * Atomics only works for integer types determined by the platform. On
> > some 32-bit platforms you wouldn't be able to use 64-bit atomics at
> > all, and on x86 you get less optimal sequence since volatile load is
> > permitted to tear while atomic load needs to use LOCK CMPXCHG8B.
>
> We only grudgingly allowed u64 READ_ONCE() on 32bit platforms because
> the fallout was too numerous to fix. Some of them are probably bugs.
>
> Also, I think cmpxchg8b without lock prefix would be sufficient, but
> I've got too much of a head-ache to be sure. Worse is that we still
> support targets without cmpxchg8b.
>
> It might be interesting to make the Rust side more strict in this regard
> and see where/when we run into trouble.
>

Sounds good to me. If the compiler barriers make sense for now, then
we can do:

pub unsafe fn read_once_usize(ptr: *const usize) -> usize {
core::sync::atomic::compiler_fence(SeqCst);
let r = unsafe { *ptr.cast::<AtomicUsize>() }.load(Relaxed);
core::sync::atomic::compiler_fence(Acquire);
r
}

and if the other side (i.e. write) is also atomic (e.g. WRITE_ONCE()),
we don't have data race.

However, there are still cases where data races are ignored in C code,
for example inode::i_state: reads out of locks race with writes inside
locks, since writes are done by plain accesses. Nothing can be done to
fix that from Rust side only, and fixing the C side is a separate topic.

Thoughts?

Regards,
Boqun

> > * Atomics doesn't work for complex structs. Although I am not quite sure
> > of the value of supporting it.
>
> So on the C side we mandate the size is no larger than machine word,
> with the exception of the u64 on 32bit thing. We don't mandate strict
> integer types because things like pte_t are wrapper types.
>
>