RE: "Verifying and Optimizing Compact NUMA-Aware Locks on Weak Memory Models"

From: Jonas Oberhauser
Date: Fri Sep 09 2022 - 07:46:01 EST

Next message: Manikandan Jagatheesan: "Performance Regression in Linux Kernel 5.19"
Previous message: xkernel . wang: "[PATCH v4] staging: r8188eu: fix potential memory leak in rtw_os_xmit_resource_alloc()"
Next in thread: Hernan Luis Ponce de Leon: "RE: "Verifying and Optimizing Compact NUMA-Aware Locks on Weak Memory Models""
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

Hi Boqun, hi everyone,

> > > > - some babbling about a missing propagation -- ISTR Linux if stuffed
> > > > full of them, specifically we require stores to auto propagate
> > > > without help from barriers
> > >
> > > Not a missing propagation; a late one.
> > >
> > > Don't understand what you mean by "auto propagate without help
> > > from barriers".

after a quick look at the claim in question, it seems to be about a different kind of issue. The problem alluded to seems not to be that the store doesn't propagate in finite time to the other core. The tools used by the authors of that work already assume every (marked) access propagates given enough time.

What they mean seems to be that a prop relation followed only by wmb (not mb) doesn't enforce the order of some writes to the same location, leading to the claimed hang in qspinlock (at least as far as LKMM is concerned). Without having looked at the code or trying to understand the bug in more detail, this may be similar to the bug fixed by Ingo in qspinlock in 2018 where the initialization of the locked bit in the node could overwrite the setting of the locked bit in the other thread, unless a release barrier/wmb is added at some point.

That being said:

> Should we add something somewhere in our model, maybe in the explanation.txt?
> Plus, I think we cannot express this in Herd because Herd uses graph-based model (axiomatic model) instead of an operational model to describe the model: axiomatic model cannot describe "something will eventually happen". There was also some discussion in the zulip steam of Rust unsafe-code-guidelines.

It is possible to express this kind of progress property in a graph based model. Some work on this was done in the RISC-V manual, as well as for some stronger graph based models ("making weak memory models fair", https://dl.acm.org/doi/10.1145/3485475 ) and in a more practical way for verification of liveness ( https://dl.acm.org/doi/abs/10.1145/3445814.3446748 ).

>From a theoretical perspective, the suggestion made in "making weak memory models fair" would be to define prefix-finiteness of all the explicitly acyclic relations of LKMM, which includes the sc-per-loc relation and thus in particular fr. If fr is prefix-finite, meaning each store has only finitely many reads observing an older value, the infinite behaviour in the small example posted elsewhere in this thread becomes forbidden.

>From an engineering perspective, I think the only issue is that cat *currently* does not have any syntax for this, nor does herd currently implement the await model checking techniques proposed in those works (c.f. Theorem 5.3. in the "making weak memory models fair" paper, which says that for this kind of loop, iff the mo-maximal reads in some graph are read in a loop iteration that does not exit the loop, the loop can run forever). However GenMC and I believe also Dat3M and recently also Nidhugg support such techniques. It may not even be too much effort to implement something like this in herd if desired.

Best regards,
jonas

Next message: Manikandan Jagatheesan: "Performance Regression in Linux Kernel 5.19"
Previous message: xkernel . wang: "[PATCH v4] staging: r8188eu: fix potential memory leak in rtw_os_xmit_resource_alloc()"
Next in thread: Hernan Luis Ponce de Leon: "RE: "Verifying and Optimizing Compact NUMA-Aware Locks on Weak Memory Models""
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]