Re: [PATCH V1] mm, slub: avoid zeroing kmalloc redzone

From: Vlastimil Babka
Date: Wed Aug 28 2024 - 12:53:22 EST


On 8/25/24 15:05, Feng Tang wrote:
> On Fri, Aug 23, 2024 at 02:24:15PM +0800, Peng Fan (OSS) wrote:
>> From: Peng Fan <peng.fan@xxxxxxx>
>>
>> With commit 946fa0dbf2d8
>> ("mm/slub: extend redzone check to extra allocated kmalloc space than requested"),
>> setting orig_size treats the wasted space (object_size - orig_size) as
>> redzones. But (in check_object()) when orig_size is set to zero, the entire
>> object is perceived as a redzone. To a valid allocated kmalloc space,
>> when init_on_free=1, the wasted space and the orig_size should
>> not be cleared to 0, otherwise there will be kernel dump:

Could we make it more clear?

Since commit 946fa0dbf2d8 ("mm/slub: extend redzone check to extra allocated
kmalloc space than requested"), setting orig_size treats the wasted space
(object_size - orig_size) as a redzone. However with init_on_free=1 we clear
the full object->size, including the redzone.

Additionally we clear the object metadata, including the stored orig_size,
making it zero, which makes check_object() treat the the whole object as a
redzone.

These issues lead to the following BUG report with "slub_debug=FUZ
init_on_free=1":

>>
>> [ 0.000000] =============================================================================
>> [ 0.000000] BUG kmalloc-8 (Not tainted): kmalloc Redzone overwritten
>> [ 0.000000] -----------------------------------------------------------------------------
>> [ 0.000000]
>> [ 0.000000] 0xffff000010032858-0xffff00001003285f @offset=2136. First byte 0x0 instead of 0xcc
>> [ 0.000000] FIX kmalloc-8: Restoring kmalloc Redzone 0xffff000010032858-0xffff00001003285f=0xcc
>> [ 0.000000] Slab 0xfffffdffc0400c80 objects=36 used=23 fp=0xffff000010032a18 flags=0x3fffe0000000200(workingset|node=0|zone=0|lastcpupid=0x1ffff)
>> [ 0.000000] Object 0xffff000010032858 @offset=2136 fp=0xffff0000100328c8
>> [ 0.000000]
>> [ 0.000000] Redzone ffff000010032850: cc cc cc cc cc cc cc cc ........
>> [ 0.000000] Object ffff000010032858: cc cc cc cc cc cc cc cc ........
>> [ 0.000000] Redzone ffff000010032860: cc cc cc cc cc cc cc cc ........
>> [ 0.000000] Padding ffff0000100328b4: 00 00 00 00 00 00 00 00 00 00 00 00 ............
>> [ 0.000000] CPU: 0 UID: 0 PID: 0 Comm: swapper/0 Not tainted 6.11.0-rc3-next-20240814-00004-g61844c55c3f4 #144
>> [ 0.000000] Hardware name: NXP i.MX95 19X19 board (DT)
>> [ 0.000000] Call trace:
>> [ 0.000000] dump_backtrace+0x90/0xe8
>> [ 0.000000] show_stack+0x18/0x24
>> [ 0.000000] dump_stack_lvl+0x74/0x8c
>> [ 0.000000] dump_stack+0x18/0x24
>> [ 0.000000] print_trailer+0x150/0x218
>> [ 0.000000] check_object+0xe4/0x454
>> [ 0.000000] free_to_partial_list+0x2f8/0x5ec
>>
>> To address the issue, use orig_size to clear the used area. And restore
>> the value of orig_size after clear the remaining area.
>>
>> When CONFIG_SLUB_DEBUG not defined, (get_orig_size()' directly returns
>> s->object_size. So when using memset to init the area, the size can simply
>> be orig_size, as orig_size returns object_size when CONFIG_SLUB_DEBUG not
>> enabled. And orig_size can never be bigger than object_size.
>>
>> Fixes: 946fa0dbf2d8 ("mm/slub: extend redzone check to extra allocated kmalloc space than requested")
>
> Thanks for the fix! I missed to test the 'init_on_free' case back then.
>
> Reviewed-by: Feng Tang <feng.tang@xxxxxxxxx>

By the way Feng, have you noticed the other issue we have, with krealloc()
and redzoning? Want to look at that? Thanks.

https://lore.kernel.org/all/44fa564b-9c8f-4ac2-bce3-f6d2c99b73b7@xxxxxxx/


> with one small nit below
>
>> Signed-off-by: Peng Fan <peng.fan@xxxxxxx>
>> ---
>>
>> RFC->V1:
>> Update commit log (Per Hyeonggon)
>> Use orig_size to do memset(Per Hyeonggon)
>> Add get_orig_size and set_orig_size when CONFIG_SLUB_DEBUG not enabled(kernel test robot)
>> https://lore.kernel.org/all/20240819064115.385086-1-peng.fan@xxxxxxxxxxx/
>>
>> mm/slub.c | 20 ++++++++++++++++++--
>> 1 file changed, 18 insertions(+), 2 deletions(-)
>>
>> diff --git a/mm/slub.c b/mm/slub.c
>> index 94f5a4143825..a5fbeb2835b1 100644
>> --- a/mm/slub.c
>> +++ b/mm/slub.c
>> @@ -1895,6 +1895,15 @@ static inline void inc_slabs_node(struct kmem_cache *s, int node,
>> static inline void dec_slabs_node(struct kmem_cache *s, int node,
>> int objects) {}
>>
>> +static inline unsigned int get_orig_size(struct kmem_cache *s, void *object)
>> +{
>> + return s->object_size;
>> +}
>> +
>> +static inline void set_orig_size(struct kmem_cache *s, void *object,
>> + unsigned int orig_size)
>> +{}
>
> Current get_orig_size() and set_orig_size() are protected by
> CONFIG_SLUB_DEUG=y macro, and with this patch, they will be called
> in both ON and OFF case. Maybe we can just lift those existing
> functions out of the "#ifdef CONFIG_SLUB_DEBUG" protection?

I agree. As for the "code is larger for !CONFIG_SLUB_DEBUG" concern I think
the code will be eliminated anyway, because of the

if (!slub_debug_orig_size(s))
return;

and slub_debug_orig_size() does kmem_cache_debug_flags() which is a
compile-time false for !CONFIG_SLUB_DEBUG.

> Thanks,
> Feng