Re: [next] i386: kunit: ASSERTION FAILED at mm/kfence/kfence_test.c:547

From: Petr Mladek
Date: Mon May 02 2022 - 04:30:46 EST


On Mon 2022-05-02 09:51:46, Marco Elver wrote:
> On Sat, Apr 30, 2022 at 11:14PM +0206, John Ogness wrote:
> [...]
> > I am not familiar with how this works. Is the tracepoint getting set on
> > call_console_drivers()? Or on call_console_driver()?
>
> It's at the start of call_console_drivers(). See trace_console_rcuidle()
> call.
>
> > If so, there are a couple problems with that. First off, the prototype
> > for that function has changed. Second, that function is called when text
> > is printed, but this is not when the text was created. With the
> > kthreads, the printing can be significantly delayed.
> >
> > Since printk() is now lockless and console printing is delayed, it
> > becomes a bit tricky to parse the records in the existing code using a
> > tracepoint.
> >
> > I wonder if creating a NOP function for the kfence probe to attach to
> > would be more appropriate. In printk_sprint() we get the text after
> > space has been reserved, but before the text is committed to the
> > ringbuffer. This is guaranteed to be called from within the printk()
> > context.
>
> I think we just need to fix the existing tracepoint, since it has
> changed its semantics vs. what it was intended to be:
>
> | commit 95100358491abaa2e9a5483811370059bbca4645
> | Author: Johannes Berg <johannes.berg@xxxxxxxxx>
> | Date: Thu Nov 24 20:03:08 2011 +0100
> |
> | printk/tracing: Add console output tracing
> |
> | Add a printk.console trace point to record any printk
> | messages into the trace, regardless of the current
> | console loglevel. This can help correlate (existing)
> | printk debugging with other tracing.
> |
> | Link: http://lkml.kernel.org/r/1322161388.5366.54.camel@xxxxxxxxxxxxxxxxxxxxx
> |
> | Acked-by: Frederic Weisbecker <fweisbec@xxxxxxxxx>
> | Cc: Christoph Hellwig <hch@xxxxxxxxxxxxx>
> | Cc: Ingo Molnar <mingo@xxxxxxxxxx>
> | Acked-by: Peter Zijlstra <a.p.zijlstra@xxxxxxxxx>
> | Acked-by: Thomas Gleixner <tglx@xxxxxxxxxxxxx>
> | Signed-off-by: Johannes Berg <johannes.berg@xxxxxxxxx>
> | Signed-off-by: Steven Rostedt <rostedt@xxxxxxxxxxx>
>
> Specifically using it to "correlate (existing) printk debugging with
> other tracing" is now broken.

It is pity that it was done in the console code in the first place.

printk() always used console_trylock() and the message was flushed
to the console only when the trylock succeeded. And it was always
deferred in NMI or when printed via printk_deferred().

> My proposal would be to fix the tracepoint like so:
>
> | --- a/kernel/printk/printk.c
> | +++ b/kernel/printk/printk.c
> | @@ -2002,8 +2002,6 @@ static void call_console_driver(struct console *con, const char *text, size_t le
> | {
> | size_t dropped_len;
> |
> | - trace_console_rcuidle(text, len);
> | -
> | if (con->dropped && dropped_text) {
> | dropped_len = snprintf(dropped_text, DROPPED_TEXT_MAX,
> | "** %lu printk messages dropped **\n",
> | @@ -2178,6 +2176,8 @@ static u16 printk_sprint(char *text, u16 size, int facility,
> | }
> | }
> |
> | + trace_console_rcuidle(text, text_len);
> | +
> | return text_len;
> | }
>
> This fixes the KFENCE and KCSAN tests.
>
> Unless I hear objections, I'll prepare a patch explaining why we need to
> fix the tracepoint.

It makes perfect sense to me.

Note that there is one difference. The original tracepoint stored the
string with the console header (loglevel + timestamp), see
info_print_prefix(). The new code will store "only" the plain message.

But the new code should be more user friendly. The printk() message
will be sorted with the other tracing messages out of box. Users will
not longer need to use the printk timestamp to find when it was
actually printed.

Best Regards,
Petr