Re: [PATCH 09/19] VFS: add _async versions of the various directory modifying inode_operations

From: Al Viro
Date: Sat Feb 08 2025 - 20:09:21 EST

Next message: kernel test robot: "drivers/net/ethernet/sfc/ethtool_common.c:170:32: warning: '%-24s' directive output may be truncated writing between 24 and 31 bytes into a region of size between 0 and 25"
Previous message: Cong Wang: "Re: [PATCH v4 00/14] kexec: introduce Kexec HandOver (KHO)"
Next in thread: Al Viro: "Re: [PATCH 09/19] VFS: add _async versions of the various directory modifying inode_operations"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

On Fri, Feb 07, 2025 at 10:41:34PM +0000, Al Viro wrote:

> I'm sorry, but I don't buy the "complete with no lock on directory"
> part - not without a verifiable proof of correctness of the locking
> scheme. Especially if you are putting rename into the mix.
>
> And your method prototypes pretty much bake that in.
>
> *IF* we intend to try going that way (and I'm not at all convinced
> that it's feasible - locking aside, there's also a shitload of fun
> with fsnotify, audit, etc.), let's make those new methods take
> a single argument - something like struct mkdir_args, etc., with
> inlines for extracting individual arguments out of that. Yes, it's
> ugly, but it allows later changes without a massive headache on
> each calling convention modification.
>
> Said that, an explicit description of locking scheme and a proof of
> correctness (at least on the "it can't deadlock" level) is, IMO,
> a hard requirement for the entire thing, async or no async.
>
> We *do* have such for the current locking scheme.

While we are at it, the locking order is... interesting. You
have
* parent's ->i_rwsem before child's d_update_lock()
* for a child, d_update_lock() before ->i_rwsem
and that - on top of ordering between ->i_rwsem of various
inodes.

Do you actually have a proof that it's deadlock-free?

Next message: kernel test robot: "drivers/net/ethernet/sfc/ethtool_common.c:170:32: warning: '%-24s' directive output may be truncated writing between 24 and 31 bytes into a region of size between 0 and 25"
Previous message: Cong Wang: "Re: [PATCH v4 00/14] kexec: introduce Kexec HandOver (KHO)"
Next in thread: Al Viro: "Re: [PATCH 09/19] VFS: add _async versions of the various directory modifying inode_operations"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]