Re: [PATCH 00/14] alpha: cleanups for 6.10

From: Paul E. McKenney
Date: Wed May 29 2024 - 18:09:23 EST


On Wed, May 29, 2024 at 07:50:28PM +0100, Maciej W. Rozycki wrote:
> On Tue, 28 May 2024, Paul E. McKenney wrote:
>
> > > > > This topic came up again when Paul E. McKenney noticed that
> > > > > parts of the RCU code already rely on byte access and do not
> > > > > work on alpha EV5 reliably, so I refreshed my series now for
> > > > > inclusion into the next merge window.
> > > >
> > > > Hrrrm? That sounds like like Paul ran tests on EV5, did he?
> > >
> > > What exactly is required to make it work?
> >
> > Whatever changes are needed to prevent the data corruption that can
> > currently result in code generated by single-byte stores. For but one
> > example, consider a pair of tasks (or one task and an interrupt handler
> > in the CONFIG_SMP=n case) do a single-byte store to a pair of bytes
> > in the same machine word. As I understand it, in code generated for
> > older Alphas, both "stores" will load the word containing that byte,
> > update their own byte, and store the updated word.
> >
> > If two such single-byte stores run concurrently, one or the other of those
> > two stores will be lost, as in overwritten by the other. This is a bug,
> > even in kernels built for single-CPU systems. And a rare bug at that, one
> > that tends to disappear as you add debug code in an attempt to find it.
>
> Thank you for the detailed description of the problematic scenario.
>
> I hope someone will find it useful, however for the record I have been
> familiar with the intricacies of the Alpha architecture as well as their
> implications for software for decades now. The Alpha port of Linux was
> the first non-x86 Linux platform I have used and actually (and I've chased
> that as a matter of interest) my first ever contribution to Linux was for
> Alpha platform code:
>
> On Mon, 30 Mar 1998, Jay.Estabrook@xxxxxxxxxxx wrote:
>
> > Hi, sorry about the delay in answering, but you'll be happy to know, I took
> > your patches and merged them into my latest SMP patches, and submitted them
> > to Linus just last night. He promises them to (mostly) be in 2.1.92, so we
> > can look forward to that... :-)
>
> so I find the scenario you have described more than obvious.

Glad that it helped.

> Mind that the read-modify-write sequence that software does for sub-word
> write accesses with original Alpha hardware is precisely what hardware
> would have to do anyway and support for that was deliberately omitted by
> the architecture designers from the ISA to give it performance advantages
> quoted in the architecture manual. The only difference here is that with
> hardware read-modify-write operations atomicity for sub-word accesses is
> guaranteed by the ISA, however for software read-modify-write it has to be
> explictly coded using the usual load-locked/store-conditional sequence in
> a loop. I don't think it's a big deal really, it should be trivial to do
> in the relevant accessors, along with the memory barriers that are needed
> anyway for EV56+ and possibly other ports such as the MIPS one.

There shouldn't be any memory barriers required, and don't EV56+ have
single-byte loads and stores?

> What I have been after actually is: can you point me at a piece of code
> in our tree that will cause an issue with a non-BWX Alpha as described in
> your scenario, so that I have a starting point? Mind that I'm completely
> new to RCU as I didn't have a need to research it before (though from a
> skim over Documentation/RCU/rcu.rst I understand what its objective is).

See the uses of the fields of the current->rcu_read_unlock_special.b
anonymous structure for the example that led us here. And who knows how
many other pieces of the Linux kernel that assume that it is possible
to atomically store a single byte.

Many of which use a normal C-language store, in which case there are
no accessors. This can be a problem even in the case where there are no
data races to either byte, because the need for the read-modify-write
sequence on older Alpha systems results in implicit data races at the
machine-word level.

> FWIW even if it was only me I think that depriving the already thin Alpha
> port developer base of any quantity of the remaining manpower, by dropping
> support for a subset of the hardware available, and then a subset that is
> not just as exotic as the original i386 became to the x86 platform at the
> time support for it was dropped, is only going to lead to further demise
> and eventual drop of the entire port.

Yes, support has been dropped for some of the older x86 CPUs as well,
for example, Linux-kernel support for multiprocessor 80386 systems was
dropped a great many years ago, in part because those CPUs do not have
a cmpxchg instruction. So it is not like we are picking on Alpha.

> And I think it would be good if we kept the port, just as we keep other
> ports of historical significance only, for educational reasons if nothing
> else, such as to let people understand based on an actual example, once
> mainstream, the implications of weakly ordered memory systems.

I don't know of any remaining issues with the newer Alpha systems that do
support single-byte and double-byte load and store instructions, and so
I am not aware of any reason for dropping Linux-kernel support for them.

Thanx, Paul