Re: NULL pointer deref when running BPF monitor program (6.11.0-rc1)

From: Jiri Olsa
Date: Wed Oct 09 2024 - 16:44:32 EST


On Wed, Oct 02, 2024 at 06:30:30PM +0200, Jiri Olsa wrote:
> On Tue, Aug 20, 2024 at 11:05:07AM -0400, Steven Rostedt wrote:
> > On Tue, 20 Aug 2024 12:17:31 +0200
> > Jiri Olsa <olsajiri@xxxxxxxxx> wrote:
> >
> > > > Could it be possible that the verifier could add to the exception table for
> > > > all accesses to tracepoint arguments? Then if there's a NULL pointer
> > > > dereference, the kernel will not crash but the exception can be sent to the
> > > > user space process instead? That is, it sends SIGSEV to the task accessing
> > > > NULL when it shouldn't.
> > >
> > > hm, but that would mean random process that would happened to trigger
> > > the tracepoint would segfault, right? I don't think we can do that
> >
> > Better than a kernel crash, isn't it? I thought the guarantee of BPF was
> > not to ever crash the kernel. Crashing user space may be bad, but not
> > always fatal, and something that can be fixed by fixng the BPF program that
> > was loaded.
> >
> > >
> > > it seems better to teach verifier which tracepoint arguments can be NULL
> > > and deny load of the bpf program that would not check such argument properly
> >
> > These are not mutually exclusive. I think you want both. Adding annotation
> > is going to be a whack-a-mole game as new tracepoints will always be
> > created with new possibly NULL parameters and even old tracepoints can add
> > that too. There's nothing to stop that.
> >
> > The exception table logic will prevent any missed checks from causing a
> > kernel crash, and your annotations will keep user space from crashing.
> >
> > -- Steve
>
> sorry for delay.. reviving this after plumbers and other stuff that got in a way
>
> Steven,
> we were discussing this in plumbers and you had an idea on doing this
> automatically through objtool.. IIRC you meant tracking instructions
> that carry argument pointers for NULL checks
>
> AFAICS we'd need to do roughly:
> - for each tracepoint we'd need to interpret one of the functions
> where TP_fast_assign macro gets unwinded:
> perf_trace_##call
> trace_custom_event_raw_event_##call
> trace_event_raw_event_##call
> - we can't tell at this point which argument is kernel object,
> so we'd need to check all arguments (assuming we can get their count)
> - store argument info (if it has null check) into some elf tables and
> use those later in bpf verifier
> - it's all arch specific
>
> on first look it seems hard and fragile (given it's arch specific)
> but I might be easily wrong with above.. do you have an idea on how
> this could work?

Hi Josh,
we'd like to have information on which of tracepoint's arguments can be NULL

Steven had an idea that objtool could help with that by doing something like
what's described above.. would you have any thoughts on that?

thanks,
jirka