Re: [RFC PATCH] tools/memory-model: Remove (dep ; rfi) from ppo

From: Paul E. McKenney
Date: Wed Mar 06 2019 - 12:24:56 EST


On Thu, Mar 07, 2019 at 12:46:05AM +0900, Akira Yokosawa wrote:
> On Tue, 26 Feb 2019 16:04:50 +0100, Peter Zijlstra wrote:
> > On Tue, Feb 26, 2019 at 06:28:45AM -0800, Paul E. McKenney wrote:
> >
> >> Yes, this all is a bit on the insane side from a kernel viewpoint.
> >> But the paper you found does not impose this; it has instead been there
> >> for about 20 years, back before C and C++ admitted to the existence
> >> of concurrency. But of course compilers are getting more aggressive,
> >> and yes, some of the problems show up in single-threaded code.
> >
> > But that paper is from last year!! It has Peter Sewell on, I'm sure he's
> > heard of concurrency.
> >
> >> The usual response is "then cast the pointers to intptr_t!" but of
> >> course that breaks type checking.
> >
> > I tried laundering the pointer through intptr_t, but I can't seem to
> > unbreak it.
> >
> >
> > root@ivb-ep:~/tmp# gcc-8 -O2 -fno-strict-aliasing -o ptr ptr.c ; ./ptr
> > p=0x55aacdc80034 q=0x55aacdc80034
> > x=1 y=2 *p=11 *q=2
> > root@ivb-ep:~/tmp# cat ptr.c
> > #include <stdio.h>
> > #include <string.h>
> > #include <stdint.h>
> > int y = 2, x = 1;
> > int main (int argc, char **argv) {
> > intptr_t P = (intptr_t)&x;
> > intptr_t Q = (intptr_t)&y;
> > P += sizeof(int);
> > int *q = &y;
> > printf("p=%p q=%p\n", (int*)P, (int*)Q);
> > if (P == Q) {
> > int *p = (int *)P;
> > *p = 11;
> > printf("x=%d y=%d *p=%d *q=%d\n", x, y, *p, *q);
> > }
> > }
> >
>
> So, I'm looking at the macro RELOC_HIDE() defined in include/linux/compiler-gcc.h.
>
> It says:
>
> --------
> /*
> * This macro obfuscates arithmetic on a variable address so that gcc
> * shouldn't recognize the original var, and make assumptions about it.
> *
> * This is needed because the C standard makes it undefined to do
> * pointer arithmetic on "objects" outside their boundaries and the
> * gcc optimizers assume this is the case. In particular they
> * assume such arithmetic does not wrap.
> *
> [...]
> */
> #define RELOC_HIDE(ptr, off) \
> ({ \
> unsigned long __ptr; \
> __asm__ ("" : "=r"(__ptr) : "0"(ptr)); \
> (typeof(ptr)) (__ptr + (off)); \
> })
> --------
>
> Looks like this macro has existed ever since the origin of Linus' git repo.
>
> And the optimization "bug" discussed in this thread can be suppressed by
> this macro.
>
> For example,
>
> $ gcc -O2 -o reloc_hide reloc_hide.c; ./reloc_hide
> x=1 y=11 *p=11 *q=11
> $ cat reloc_hide.c
> #include <stdio.h>
> #include <stdint.h>
>
> #define RELOC_HIDE(ptr, off) \
> ({ \
> uintptr_t __ptr; \
> __asm__ ("" : "=r"(__ptr) : "0"(ptr)); \
> (typeof(ptr)) (__ptr + (off)); \
> })
>
> int y = 2, x = 1;
> int main (int argc, char **argv) {
> int *p = RELOC_HIDE(&x, sizeof(*p));
> int *q = RELOC_HIDE(&y, 0);
> if (p == q) {
> *p = 11;
> printf("x=%d y=%d *p=%d *q=%d\n", x, y, *p, *q);
> }
> }
>
> Note that "uintptr_t" is used in this version of RELOC_HIDE() for user-land
> code.
>
> Am I the only one who was not aware of this gcc-specific macro?

I have seen it before, but had forgotten it. ;-)

But people on the committee seem to agree that inline assembly should
"launder" pointers, along with atomic and volatile accesses. The case
of revalidating pointers fetched during a previous critical section for
a given lock is very much in play, but then again, we don't have any
known good use cases identified.

Thanx, Paul