Re: [PATCH v2 3/3] tracing/kprobes: Use APIs that matches symbols without .XXX suffix

From: Song Liu
Date: Wed Aug 07 2024 - 15:41:46 EST




> On Aug 7, 2024, at 3:08 AM, Masami Hiramatsu <mhiramat@xxxxxxxxxx> wrote:
>
> On Wed, 7 Aug 2024 00:19:20 +0000
> Song Liu <songliubraving@xxxxxxxx> wrote:
>
>>
>>
>>> On Aug 6, 2024, at 5:01 PM, Masami Hiramatsu <mhiramat@xxxxxxxxxx> wrote:
>>>
>>> On Tue, 6 Aug 2024 20:12:55 +0000
>>> Song Liu <songliubraving@xxxxxxxx> wrote:
>>>
>>>>
>>>>
>>>>> On Aug 6, 2024, at 1:01 PM, Steven Rostedt <rostedt@xxxxxxxxxxx> wrote:
>>>>>
>>>>> On Tue, 6 Aug 2024 16:00:49 -0400
>>>>> Steven Rostedt <rostedt@xxxxxxxxxxx> wrote:
>>>>>
>>>>>>>>> + if (IS_ENABLED(CONFIG_LTO_CLANG) && !addr)
>>>>>>>>> + addr = kallsyms_lookup_name_without_suffix(trace_kprobe_symbol(tk));
>>>>>>>>> +
>>>>>>>>
>>>>>>>> So you do the lookup twice if this is enabled?
>>>>>>>>
>>>>>>>> Why not just use "kallsyms_lookup_name_without_suffix()" the entire time,
>>>>>>>> and it should work just the same as "kallsyms_lookup_name()" if it's not
>>>>>>>> needed?
>>>>>>>
>>>>>>> We still want to give priority to full match. For example, we have:
>>>>>>>
>>>>>>> [root@~]# grep c_next /proc/kallsyms
>>>>>>> ffffffff81419dc0 t c_next.llvm.7567888411731313343
>>>>>>> ffffffff81680600 t c_next
>>>>>>> ffffffff81854380 t c_next.llvm.14337844803752139461
>>>>>>>
>>>>>>> If the goal is to explicitly trace c_next.llvm.7567888411731313343, the
>>>>>>> user can provide the full name. If we always match _without_suffix, all
>>>>>>> of the 3 will match to the first one.
>>>>>>>
>>>>>>> Does this make sense?
>>>>>>
>>>>>> Yes. Sorry, I missed the "&& !addr)" after the "IS_ENABLED()", which looked
>>>>>> like you did the command twice.
>>>>>
>>>>> But that said, does this only have to be for llvm? Or should we do this for
>>>>> even gcc? As I believe gcc can give strange symbols too.
>>>>
>>>> I think most of the issue comes with LTO, as LTO promotes local static
>>>> functions to global functions. IIUC, we don't have GCC built, LTO enabled
>>>> kernel yet.
>>>>
>>>> In my GCC built, we have suffixes like ".constprop.0", ".part.0", ".isra.0",
>>>> and ".isra.0.cold". We didn't do anything about these before this set. So I
>>>> think we are OK not handling them now. We sure can enable it for GCC built
>>>> kernel in the future.
>>>
>>> Hmm, I think it should be handled as it is. This means it should do as
>>> livepatch does. Since I expected user will check kallsyms if gets error,
>>> we should keep this as it is. (if a symbol has suffix, it should accept
>>> symbol with suffix, or user will get confused because they can not find
>>> which symbol is kprobed.)
>>>
>>> Sorry about the conclusion (so I NAK this), but this is a good discussion.
>>
>> Do you mean we do not want patch 3/3, but would like to keep 1/3 and part
>> of 2/3 (remove the _without_suffix APIs)? If this is the case, we are
>> undoing the change by Sami in [1], and thus may break some tracing tools.
>
> What tracing tools may be broke and why?
>
> For this suffix problem, I would like to add another patch to allow probing on
> suffixed symbols. (It seems suffixed symbols are not available at this point)
>
> The problem is that the suffixed symbols maybe a "part" of the original function,
> thus user has to carefully use it.

It appears there are multiple APIs that may need change. For example, on gcc
built kernel, /sys/kernel/tracing/available_filter_functions does not show
the suffix:

[root@(none)]# grep cmos_irq_enable /proc/kallsyms
ffffffff81db5470 t __pfx_cmos_irq_enable.constprop.0
ffffffff81db5480 t cmos_irq_enable.constprop.0
ffffffff822dec6e t cmos_irq_enable.constprop.0.cold

[root@(none)]# grep cmos_irq_enable /sys/kernel/tracing/available_filter_functions
cmos_irq_enable

perf-probe uses _text+<offset> for such cases:

[root@(none)]# cat /sys/kernel/tracing/kprobe_events
p:probe/cmos_irq_enable _text+14374016

I am not sure which APIs do we need to change here.

Thanks,
Song