Re: kobject_init_and_add is easy to misuse
From: James Bottomley
Date: Tue Jun 02 2020 - 11:25:20 EST
On Tue, 2020-06-02 at 05:10 -0700, Matthew Wilcox wrote:
> On Tue, Jun 02, 2020 at 07:50:33PM +0800, Wang Hai wrote:
> > syzkaller reports for memory leak when kobject_init_and_add()
> > returns an error in the function sysfs_slab_add() [1]
> >
> > When this happened, the function kobject_put() is not called for
> > the
> > corresponding kobject, which potentially leads to memory leak.
> >
> > This patch fixes the issue by calling kobject_put() even if
> > kobject_init_and_add() fails.
>
> I think this speaks to a deeper problem with kobject_init_and_add()
> -- the need to call kobject_put() if it fails is not readily apparent
> to most users. This same bug appears in the first three users of
> kobject_init_and_add() that I checked --
> arch/ia64/kernel/topology.c
> drivers/firmware/dmi-sysfs.c
> drivers/firmware/efi/esrt.c
> drivers/scsi/iscsi_boot_sysfs.c
>
> Some do get it right --
> arch/powerpc/kernel/cacheinfo.c
> drivers/gpu/drm/ttm/ttm_bo.c
> drivers/gpu/drm/ttm/ttm_memory.c
> drivers/infiniband/hw/mlx4/sysfs.c
>
> I'd argue that the current behaviour is wrong,
Absolutely agree with this. We have a big meta pattern here where we
introduce functions with tortuous semantics then someone creates a
checker for the semantics and misuses come crawling out of the woodwork
leading to floods of patches, usually for little or never used error
paths, which really don't buy anything apart from theoretical
correctness. Just insisting on simple semantics would have avoided
this.
> that kobject_init_and_add() should call kobject_put() if the add
> fails. This would need a tree-wide audit. But somebody needs to do
> that anyway because based on my random sampling, half of the users
> currently get it wrong.
Well, the semantics of kobject_init() are free on fail, so these are
the ones everyone seems to be using. The semantics of kobject_add are
put on fail. The problem is that put on fail isn't necessarily correct
in the kobject_init() case: the release function may make assumptions
about the object hierarchy which aren't satisfied in the kobject_init()
failure case. This argues that kobject_init_and_add() can't ever have
correct semantics and we should eliminate it.
James