Re: [PATCH v2] kallsyms: Fix wrong "big" kernel symbol type read from procfs

From: Matthew Wilcox
Date: Fri Oct 11 2024 - 18:01:35 EST


On Fri, Oct 11, 2024 at 10:38:53PM +0800, Zheng Yejian wrote:
> The root cause is that, after commit 73bbb94466fd ("kallsyms: support
> "big" kernel symbols"), ULEB128 was used to encode symbol name length.
> That is, for "big" kernel symbols of which name length is longer than
> 0x7f characters, the length info is encoded into 2 bytes.

Technically, at least two. If we ever have a symbol larger than
16kB, we'll use three bytes.

> +++ b/kernel/kallsyms.c
> @@ -103,8 +103,11 @@ static char kallsyms_get_symbol_type(unsigned int off)
> {
> /*
> * Get just the first code, look it up in the token table,
> - * and return the first char from this token.
> + * and return the first char from this token. If MSB of length
> + * is 1, it is a "big" symbol, so needs an additional byte.
> */
> + if (kallsyms_names[off] & 0x80)
> + off++;

So this "if" should be a "while" for maximum future proofing against the
day that we have a 16kB function ...