Re: [PATCH] fs/exfat: resolve memory leak from exfat_create_upcase_table()
From: Al Viro
Date: Sun Sep 15 2024 - 03:23:50 EST
On Sun, Sep 15, 2024 at 08:05:46AM +0100, Al Viro wrote:
> Interesting... How does the mainline manage to avoid the
> call of exfat_kill_sb(), which should call_rcu() delayed_free(), which
> calls exfat_free_upcase_table()?
>
> Could you verify that your reproducer does *NOT* hit that
> callchain? AFAICS, the only caller of exfat_load_upcase_table()
> is exfat_create_upcase_table(), called by __exfat_fill_super(),
> called by exfat_fill_super(), passed as callback to get_tree_bdev().
> And if that's the case, ->kill_sb() should be called on failure and
> with non-NULL ->s_fs_info...
>
> Something odd is going on there.
Yecchh... OK, I see what's happening, and the patch is probably
correct, but IMO it's way too subtle. Unless I'm misreading what's
going on there, you have the following:
exfat_load_upcase_table() have 3 failure exits.
One of them is with -ENOMEM; no table allocated and we proceed to
exfat_load_default_upcase_table().
Another is with -EIO. In that case the table is left allocated, the
caller of exfat_load_upcase_table() returns immediately and the normal
logics in ->kill_sb() takes it out.
Finally, there's one with -EINVAL. There the caller proceeds to
exfat_load_default_upcase_table(), which is where the mainline leaks.
That's the case your patch adjusts.
Note that resulting rules for exfat_load_upcase_table()
* should leave for ->kill_sb() to free if failing with -EIO.
* should make sure it's freed on all other failure exits.
At the very least that needs to be documented. However, since the
problem happens when the caller proceeds to exfat_load_default_upcase_table(),
the things would be simpler if you had taken the "need to free what we'd
allocated" logics into the place where that logics is visible. I.e.
ret = exfat_load_upcase_table(sb, sector, num_sectors,
le32_to_cpu(ep->dentry.upcase.checksum));
brelse(bh);
if (ret && ret != -EIO) {
/* clean after exfat_load_upcase_table() */
exfat_free_upcase_table(sbi);
goto load_default;
}
IMO it would be less brittle that way. And commit message needs
the explanation of the leak mechanism - a link to reporter is
nice, but it doesn't explain what's going on.