Re: [PATCH] tracepoint: Do not warn on EEXIST or ENOENT
From: Steven Rostedt
Date: Sat Jun 26 2021 - 11:43:04 EST
On Sun, 27 Jun 2021 00:13:17 +0900
Tetsuo Handa <penguin-kernel@xxxxxxxxxxxxxxxxxxx> wrote:
> On 2021/06/26 23:18, Steven Rostedt wrote:
> > On Sat, 26 Jun 2021 22:58:45 +0900
> > Tetsuo Handa <penguin-kernel@xxxxxxxxxxxxxxxxxxx> wrote:
> >
> >> syzbot is hitting WARN_ON_ONCE() at tracepoint_add_func() [1], but
> >> func_add() returning -EEXIST and func_remove() returning -ENOENT are
> >> not kernel bugs that can justify crashing the system.
> >
> > There should be no path that registers a tracepoint twice. That's a bug
> > in the kernel. Looking at the link below, I see the backtrace:
> >
> > Call Trace:
> > tracepoint_probe_register_prio kernel/tracepoint.c:369 [inline]
> > tracepoint_probe_register+0x9c/0xe0 kernel/tracepoint.c:389
> > __bpf_probe_register kernel/trace/bpf_trace.c:2154 [inline]
> > bpf_probe_register+0x15a/0x1c0 kernel/trace/bpf_trace.c:2159
> > bpf_raw_tracepoint_open+0x34a/0x720 kernel/bpf/syscall.c:2878
> > __do_sys_bpf+0x2586/0x4f40 kernel/bpf/syscall.c:4435
> > do_syscall_64+0x3a/0xb0 arch/x86/entry/common.c:47
> >
> > So BPF is allowing the user to register the same tracepoint more than
> > once? That looks to be a bug in the BPF code where it shouldn't be
> > allowing user space to register the same tracepoint multiple times.
>
> I didn't catch your question.
>
> (1) func_add() can reject an attempt to add same tracepoint multiple times
> by returning -EINVAL to the caller.
> (2) But tracepoint_add_func() (the caller of func_add()) is calling WARN_ON_ONCE()
> if func_add() returned -EINVAL.
That's because (before BPF) there's no place in the kernel that tries
to register the same tracepoint multiple times, and was considered a
bug if it happened, because there's no ref counters to deal with adding
them multiple times.
If the tracepoint is already registered (with the given function and
data), then something likely went wrong.
> (3) And tracepoint_add_func() is triggerable via request from userspace.
Only via BPF correct?
I'm not sure how it works, but can't BPF catch that it is registering
the same tracepoint again?
We could add this patch, but then we need to add the WARN_ON_ONCE() to
all the other callers, because for the other callers it's a bug if the
tracepoint was registered twice with the same callback and data.
Or we can add another interface that wont warn, and BPF can use that.
> (4) tracepoint_probe_register_prio() serializes tracepoint_add_func() call
> triggered by concurrent request from userspace using tracepoints_mutex mutex.
You keep saying user space. Is it a BPF program?
> (5) But tracepoint_add_func() does not check whether same tracepoint multiple
> is already registered before calling func_add().
Because it's considered a bug in the kernel if that is the case.
> (6) As a result, tracepoint_add_func() receives -EINVAL from func_add(), and
> calls WARN_ON_ONCE() and the system crashes due to panic_on_warn == 1.
>
> Why this is a bug in the BPF code? The BPF code is not allowing userspace to
> register the same tracepoint multiple times. I think that tracepoint_add_func()
Then how is the same tracepoint being registered multiple times?
You keep saying "user space" but the only way user space is doing this
is through BPF. Or am I missing something?
> is stupid enough to crash the kernel instead of rejecting when an attempt to
> register the same tracepoint multiple times is made.
Because its a bug in the kernel, and WARN_ON_ONCE() is what is used
when you detect something that is considered a bug in the kernel. If
you don't want warnings to crash the kernel, you don't add
"panic_on_warning".
If BPF is expected to register the same tracepoint with the same
callback and data more than once, then let's add a call to do that
without warning. Like I said, other callers expect the call to succeed
unless it's out of memory, which tends to cause other problems.
FYI, this warning has caught bugs in my own code, that triggered my
tests to fail, and had me go fix that bug before pushing it further.
And my tests fail only on a full warning.
-- Steve